New York Times files lawsuit against OpenAI and Microsoft over use of articles to train AI chatbots

The New York Times filed a federal lawsuit against OpenAI and Microsoft on Wednesday, December 27, 2023. - Copyright Mark Lennihan/AP

By Associated Press

Published on 27/12/2023 - 18:03 GMT+1•Updated 28/12/2023 - 9:59 GMT+1

Comments

The New York Times has filed a lawsuit against the maker of ChatGPT in a bid to end the practice of using its published material to train chatbots.

The New York Times has filed a federal lawsuit against OpenAI and Microsoft seeking to end the practice of using its stories to train artificial intelligence (AI) chatbots, saying that copyright infringements at the paper alone could be worth billions.

The paper joins a growing list of individuals and publishers trying to stop OpenAI from using copyrighted material.

In the suit filed on Wednesday in Manhattan federal court in New York, the Times said OpenAI and Microsoft are advancing their technology through the "unlawful use of The Times’s work to create artificial intelligence products that compete with it" and "threatens The Times' ability to provide that service".

OpenAI and Microsoft did not immediately respond to requests for comment.

Media organisations have been pummelled by a migration of readers to online platforms and while many publications have carved out a digital space online as well, AI technology has threatened to upend numerous industries, including media.

AI companies scrape information available online, including articles published by media organizations, to train generative AI chatbots. Those companies have attracted billions in investments very rapidly.

Rising number of lawsuits against OpenAI

Microsoft has a partnership with OpenAI that allows it to capitalise on the AI technology made by the artificial intelligence company.

The tech giant is also OpenAI's biggest backer and has invested billions of dollars into the company since the two began their partnership in 2019 with a $1 billion (€899.5 billion) investment.

As part of the agreement, Microsoft’s supercomputers help power OpenAI’s AI research and the tech giant integrates the startup’s technology into its products.

The number of lawsuits filed against OpenAI for copyright infringement is growing.

The company has been sued by a number of writers - including comedian Sarah Silverman - who say their books were ingested to train OpenAI's AI models without their permission.

In June, more than 4,000 writers signed a letter to the CEOs of OpenAI, Google, Microsoft, Meta, and other AI developers accusing them of exploitative practices in building chatbots that "mimic and regurgitate" their language, style, and ideas.

The lawsuit filed Wednesday said generative AI tools developed by OpenAI and Microsoft are closely summarizing content from the Times, mimicking its style and even reciting it verbatim.

Using news articles to train GPT-4

The complaint cited examples of OpenAI's GPT-4 spitting out large portions of news articles from the Times, including a Pulitzer-Prize-winning investigation into New York City’s taxi industry that was published in 2019 and took 18 months to complete.

It also cited outputs from Bing Chat that it said included verbatim excerpts from Times articles.

The Times did not list specific damages that it is seeking, but said the legal action "seeks to hold them responsible for the billions of dollars in statutory and actual damages that they owe for the unlawful copying and use of The Times’s uniquely valuable works".

The Times, however, is seeking the destruction of GPT and other large language models or training sets that incorporate its work.

In the complaint, the Times said Microsoft and OpenAI "seek to free-ride on The Times' massive investments in its journalism" by using it to build products without payment or permission.

Talks between publishers and OpenAI

In July, OpenAI and The Associated Press announced a deal for the AI company to license AP’s archive of news stories.

The New York Times said it's never permitted anyone to use its content for generative AI purposes.

The lawsuit also follows what appears to be breakdowns in talks between the newspaper and the two companies.

The Times said it reached out to Microsoft and OpenAI in April to raise concerns about the use of its intellectual property and reach a resolution on the issue.

During the talks, the newspaper said it sought to "ensure it received fair value" for the use of its content, "facilitate the continuation of a healthy news ecosystem, and help develop GenAI technology in a responsible way that benefits society and supports a well-informed public".

"These negotiations have not led to a resolution," the lawsuit said.

Go to accessibility shortcuts

Comments

Europe News

New York Times files lawsuit against OpenAI and Microsoft over use of articles to train AI chatbots

The New York Times has filed a lawsuit against the maker of ChatGPT in a bid to end the practice of using its published material to train chatbots.

Rising number of lawsuits against OpenAI

Using news articles to train GPT-4

Talks between publishers and OpenAI

Read more

EU regulators to review Microsoft's $13 billion bet on OpenAI

The e-scooter so small it can fit in your carry-on bag when you travel

US Peace Corps launches new Tech Corps to bring AI expertise abroad

US rights to free speech vs. European rights to set their own rules