sa国际传媒

Skip to content
Join our Newsletter

The New York Times sues OpenAI and Microsoft for using its stories to train chatbots

NEW YORK (AP) 鈥 The New York Times is striking back against the threat that artificial intelligence poses to the news industry, filing a federal lawsuit Wednesday against OpenAI and Microsoft seeking to end the practice of using its stories to train
20231227091252-658c3ae74d71afde56746ba6jpeg
FILE - A sign for The New York Times hangs above the entrance to its building, Thursday, May 6, 2021 in New York. The New York Times filed a federal lawsuit against OpenAI and Microsoft on Wednesday, Dec. 27, 2023 seeking to end the practice of using published material to train chatbots. (AP Photo/Mark Lennihan, File)

NEW YORK (AP) 鈥 The New York Times is striking back against the threat that artificial intelligence poses to the news industry, filing a federal lawsuit Wednesday against OpenAI and Microsoft seeking to end the practice of using its stories to .

The Times says that the companies are threatening its livelihood by effectively stealing billions of dollars worth of work by its journalists, in some cases spitting out Times' material verbatim to people who seek answers from generative artificial intelligence like OpenAI's ChatGPT. The newspaper's lawsuit was filed in federal court in Manhattan.

OpenAI and Microsoft did not respond to requests for comment.

The media is one of many industries that could be upended by the rapid development of AI. Media organizations have already been pummeled by a migration of readers to online platforms and while many publications 鈥 most notably the Times 鈥 have successfully carved out a digital space, AI could become a significant threat.

鈥淭hese bots compete with the content they are trained on,鈥 said Ian B. Crosby, partner and lead counsel at Susman Godfrey, which is representing The Times.

Artificial intelligence companies scrape information available online, including articles published by news organizations, to train generative AI chatbots. The large language models are also trained on a huge trove of other human-written materials, such as instructional manuals and digital books. That helps them to build a strong command of language and grammar and to answer questions correctly.

But the technology is still under development and gets many things wrong. In its lawsuit, for example, the Times said OpenAI鈥檚 GPT-4 falsely attributed product recommendations to Wirecutter, the paper鈥檚 product reviews site, endangering its reputation.

OpenAI and other AI companies, including rival Anthropic, have attracted billions in investments very rapidly since public and business interest in the technology exploded, particularly this year.

Microsoft has a partnership with OpenAI that allows it to capitalize on the company鈥檚 AI technology. The Redmond, Washington, tech giant is also OpenAI鈥檚 biggest backer and has invested at least $13 billion into the company since the two began their partnership in 2019, according to the lawsuit. As part of the agreement, Microsoft鈥檚 supercomputers help power OpenAI鈥檚 AI research and the tech giant integrates the startup鈥檚 technology into its products.

The paper's complaint comes as the number of lawsuits filed against OpenAI for . The company has been sued by several writers 鈥 including comedian Sarah Silverman 鈥 who say their books were ingested to train OpenAI's AI models without their permission. In June, more than 4,000 writers signed a letter to the CEOs of OpenAI, Google, Microsoft, Meta and other AI developers accusing them of exploitative practices in building chatbots that 鈥渕imic and regurgitate鈥 their language, style and ideas.

As AI technology develops, growing fears over its use have also fueled labor strikes and lawsuits in other industries, . Different stakeholders are realizing the technology could disrupt their entire business model, but the question will be how to respond to it, said Sarah Kreps, director of Cornell University鈥檚 Tech Policy Institute.

Kreps said she agrees The New York Times is facing a threat from these chatbots, but she also argued solving the issue completely is going to be an uphill battle.

鈥淭here鈥檚 so many other language models out there that are doing the same thing,鈥 she said.

The lawsuit filed Wednesday claims that generative AI tools developed by OpenAI and Microsoft are closely summarizing content from the Times, mimicking its style and even reciting it verbatim. The complaint cited examples of OpenAI鈥檚 GPT-4 spitting out large portions of news articles from the Times, including a Pulitzer-Prize winning investigation into New York City鈥檚 taxi industry that was published in 2019 and took 18 months to complete. It also cited outputs from Bing Chat 鈥 now called Copilot 鈥 that it said included verbatim excerpts from Times articles.

The Times did not list specific damages that it is seeking, but said the legal action 鈥渟eeks to hold them responsible for the billions of dollars in statutory and actual damages that they owe for the unlawful copying and use of The Times鈥檚 uniquely valuable works.鈥 It is also asking the court to order the tech companies to destroy AI models or data sets that use its work.

Web traffic is an important component of the paper's advertising revenue and helps drive subscriptions to its online site. The outputs from AI chatbots divert that traffic away from the paper and other copyright holders, the Times says, making it less likely that users will visit the original source for the information.

Less traffic to the Times' Wirecutter articles, for example, means less people clicking on affiliate links, which in turn means less revenue for the paper's product review site.

The New York Times said it鈥檚 never given permission to anyone to use its content for generative AI purposes. The lawsuit also follows what appears to be breakdowns in talks between the newspaper and the two companies that began in April, and could be a way to kickstart talks on ending a business dispute.

The News/Media Alliance, a trade group representing more than 2,200 news organizations, applauded Wednesday's action by the Times.

鈥淨uality journalism and GenAI can complement each other if approached collaboratively,鈥 said Danielle Coffey, alliance president and CEO. 鈥淏ut using journalism without permission or payment is unlawful, and certainly not fair use.鈥

In July, OpenAI and The Associated Press for the artificial intelligence company to license AP鈥檚 archive of news stories. This month, OpenAI also signed a similar partnership with Axel Springer, a media company in Berlin that owns Politico and Business Insider. Under the deal, users of OpenAI鈥檚 ChatGPT will receive summaries of 鈥渟elected global news content鈥 from Axel Springer鈥檚 media brands. The companies said the answers to queries will include attribution and links to the original articles.

The Times has compared its action to a copyright lawsuit more than two decades ago against Napster, when record companies sued the file-sharing service for unlawful use of their material. The record companies won and Napster was soon gone, but it has had a major impact on the industry. Industry-endorsed streaming now dominates the music business.

___

AP Technology Writer Matt O'Brien contributed to this story.

Haleluya Hadero And David Bauder, The Associated Press