Google Rationale For Deal With Wires: “Duplicate Detection”
By Rafat Ali - Fri 31 Aug 2007 11:53 AM PST
Google (NSDQ: GOOG) has cut a deal with four wire services (our detailed post below), and now gives the rationale of the deal on Google News’ official blog here: “If many of those stories [in Google News] are actually the exact same article, it can end up burying those different perspectives. Enter “duplicate detection.” Duplicate detection means we’ll be able to display a better variety of sources with less duplication. Instead of 20 “different” articles (which actually used the exact same content), we’ll show the definitive original copy and give credit to the original journalist. (We launched a similar feature in Sort-by-Date and got great feedback about it.) Of course, if you want to see all the duplicates on other publisher websites with additional analysis and context, they’re only a click away.”
And then, a slightly convoluted logic from them: “By removing duplicate articles from our results, we’ll be able to surface even more stories and viewpoints from journalists and publishers from around the world. This change will provide more room on Google News for publishers’ most highly valued content: original content.”
One good thing though: “Duplicate detection isn’t just for our news agency partners—it also enables you to find the original copy of articles from publishers and news agencies that have their own destination site.”
Posted in: Companies, Google, Media, Newspapers






