Hacker News new | past | comments | ask | show | jobs | submit login

Former headline writer here.

Headlines are not one monolithic thing, any more than publishers are.

There are giant worldwide news organizations, there are personal blog publishers, and everything in between.

They all have different constraints and different motivations for their headlines.

Even if you look at just one type of publisher -- a traditional print newspaper with an online presence -- you are likely to find that they write multiple headlines for each story, depending on where that headline will be seen.

For instance, they might write one headline for the print version of the story that's constrained by page layout requirements.

Then then might write another headline that gets displayed on their homepage. It has to be short, punchy, and eye-catching.

Then they might write another longer, more complete, SEO-friendly headline that gets displayed when you actually click on the link to the story.

There might also be an alternate headline that gets displayed as the HTML/meta title of the page, which can be useful for when the link is shared via social media.

And all of those have their specific purposes and limitations.

Another wrinkle: In the case of news organizations, it's highly likely that the person who writes the headline is not the person who writes the article, whereas typically with personal blogs the same person writes both the story text and the title.

So if inaccurate or sensationalistic headlines are one problem, then another problem is treating headlines as if they are all the same. They're not.




...then another problem is treating headlines as if they are all the same. They're not.

What you indicated was that headlines are written in different circumstances. But it doesn't mean that they're not all, roughly, similar.

I betting there exists a strong correlation between clickbait-y-ness and pageviews.

So,

1. Gather all articles with a lot of pageviews

2. Do some NLP to get the really bad ones, i.e. "Look at what this <noun> did after <predicate>"

3. Remove outliers

And bang, you mark all of the sucky articles. That could be included in a Chrome extension, much like uBlock. Hell, why not just have it in uBlock?




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: