Google News prioritizes AI-generated copycat articles above original content, including a Post exclusive

restricted details

Google News searches unveil AI-generated articles that brazenly plagiarize from legitimate media outlets – and The Post has already identified at least one such theft of its own published work.

The Post tested Google News late last week by seeking out recent articles about Federal Trade Commission nominee Melissa Holyoak and organizing the results by the most recent date of publication.

The Post’s exclusive Jan. 8 story about Holyoak was positioned lower in search results than a nearly identical theft – published by an outlet with the common name “Commerce News” and the strange domain address “biz.crast.net” that apparently churns out tons of AI-generated articles.

The counterfeit version of the article featured the same artwork – and even mentioned “The Post” in its plagiarized copy. The theft is ascribed to “Shawn Johnson,” whose byline showed up on over 17,800 pages of results and published dozens of articles just last Friday.

By late afternoon on Friday, a Google spokesperson affirmed the article “violates our policy and will be removed.”

Google also verified that AI-generated content is not against its policies, but that content can be removed if it is deemed to be “spam” that was published specifically to rank high in News results.

An AI-generated theft ranked above The Post’s original story in search results.

Independent outlet 404 Media called out the issue last week after obtaining screenshots that showed AI-generated thefts – including plagiarized versions of a “Star Wars”-related post published by Distractify and an article published by Heavy.com about an “execution-style murder” – appearing alongside real articles in Google News search results.

The spread of AI-generated articles is already a “real problem” for the industry, according to Danielle Coffey, CEO of the News/Media Alliance.

“It’s a broken system because it’s not rewarding the quality human-created content across the board.” Coffey told The Post. “It’s going to create conditions where it’s going to be impossible to generate any revenue to keep our newsrooms afloat.”

Coffey was one of several experts who testified before a Senate panel earlier this month on the dangers AI could pose to the future of journalism. At the same event, Conde Nast CEO Roger Lynch said AI chatbots are “built with stolen goods” and should be regulated by Congress.

The fake version of The Post’s Melissa Holyoak story even featured the same artwork.

The proliferation of AI-generated news content alongside real articles on a platform operated by Google — which controls 90% of the online search market — is “deeply concerning,” according to Jason Kint, CEO of Digital Content Next.

“Google exerts extraordinary market power in which news brands get discovered and funded for the American public,” Kint said. “AI is a tool and probably shouldn’t be seen as the villain here — but instead a risk accelerant when gatekeeper power is left unchecked.”

As The Post has reported, media outlets have expressed outrage in recent months over the use of AI-powered chatbots, such as OpenAI’s ChatGPT, to lift copyrighted work without proper credit or compensation.

Last month, the New York Times filed a major copyright infringement lawsuit to protect its business model, while others are locked in intense negotiations to secure payment for their content through licensing deals.

Google’s policy does not penalize AI-generated content. Thaspol – stock.adobe.com

404 Media’s report cited several specific examples of bluntly plagiarized content in which it said Google was “boosting” seemingly AI-powered sites regurgitated posts featuring identical or near-identical headlines, photos and text as articles from real media outlets.

The outlet said the examples in Google News were found by searching for the relevant topic and setting search parameters to content published in the last 24 hours.

“The presence of AI-generated content on Google News signals two things: first, the black box nature of Google News, with entry into Google News’ rankings in the first place an opaque, but apparently gameable, system,” 404 Media’s Joseph Cox wrote.

“Second, is how Google may not be ready for moderating its News service in the age of consumer-access AI, where essentially anyone is able to churn out a mass of content with little to no regard for its quality or originality,” Cox added.

Google said it takes search quality “extremely seriously.” Aleksei – stock.adobe.com

Google search liaison Danny Sullivan questioned the report’s methodology in a lengthy X thread, asserting that sorting search results by date was “expressingly asking our systems to ignore the regular relevance ranking.”

A Google spokesperson also pushed back on the report’s findings in a statement to The Post.

“Claiming that these sites were featured prominently in Google News is not accurate – the sites in question only appeared for artificially narrow queries, including queries that explicitly filtered out the date of an original article,” the spokesperson said in a statement.

“We take the quality of our results extremely seriously and have clear policies against content created for the primary purpose of ranking well on News and we remove sites that violate it,” the Google spokesperson added.

Google also argued it isn’t fair to claim an article is being “boosted” just because it appeared as a result in response to a specific user request.

When reached for comment, 404 Media’s Cox fired back, stating his article was “clear” and “does not contain factual inaccuracies.”

404 Media co-founder Jason Koebler also defended the report in a statement, asserting that “normal search terms with no time limitations still surface this content in Google News above the sites that they are ripping off.”




Load more…





https://nypost.com/2024/01/22/business/google-news-searches-ranked-ai-generated-ripoffs-above-real-articles-including-a-post-exclusive/?utm_source=url_sitebuttons&utm_medium=site%20buttons&utm_campaign=site%20buttons

Copy the URL to share

Leave a Reply

Your email address will not be published. Required fields are marked *