ChatGPT-maker braces for fight with New York Times and authors on 'fair use' of copyrighted works

Matt O'Brien

Associated Press

Published: January 9, 2024 at 4:20 PMUpdated: January 10, 2024 at 4:05 PM

1 / 2

FILE - A sign for The New York Times hangs above the entrance to its building, May 6, 2021 in New York. A barrage of high-profile lawsuits in a New York federal court, including one by the New York Times, will test the future of ChatGPT and other artificial intelligence products. (AP Photo/Mark Lennihan, File)

FILE - The OpenAI logo is seen on a mobile phone in front of a computer screen displaying output from ChatGPT, March 21, 2023, in Boston. A barrage of high-profile lawsuits in a New York federal court, including one by the New York Times, will test the future of ChatGPT and other artificial intelligence products. (AP Photo/Michael Dwyer, File)

A barrage of high-profile lawsuits in a New York federal court will test the future of ChatGPT and other artificial intelligence products that wouldn't be so eloquent had they not ingested huge troves of copyrighted human works.

But are AI chatbots — in this case, widely commercialized products made by OpenAI and its business partner Microsoft — breaking copyright and fair competition laws? Professional writers and media outlets will face a difficult fight to win that argument in court.

Recommended Videos

“I would like to be optimistic on behalf of the authors, but I’m not. I just think they have an uphill battle here,” said copyright attorney Ashima Aggarwal, who used to work for academic publishing giant John Wiley & Sons.

One lawsuit comes from The New York Times. Another from a group of well-known novelists such as John Grisham, Jodi Picoult and George R.R. Martin. A third from bestselling nonfiction writers, including an author of the Pulitzer Prize-winning biography on which the hit movie “Oppenheimer” was based.

THE LAWSUITS

Each of the lawsuits makes different allegations, but they all center on the San Francisco-based company OpenAI “building this product on the back of other peoples’ intellectual property,” said attorney Justin Nelson, who is representing the nonfiction writers and whose law firm is also representing The Times.

“What OpenAI is saying is that they have a free ride to take anybody else’s intellectual property really since the dawn of time, as long as it’s been on the internet,” Nelson said.

The Times sued in December, arguing that ChatGPT and Microsoft's Copilot are competing with the same outlets they are trained on and diverting web traffic away from the newspaper and other copyright holders who depend on advertising revenue generated from their sites to keep producing their journalism. It also provided evidence of the chatbots spitting out Times articles word-for-word. At other times the chatbots falsely attributed misinformation to the paper in a way it said damaged its reputation.

One senior federal judge is so far presiding over all three cases, as well as a fourth from two more nonfiction authors who filed another lawsuit last week. U.S. District Judge Sidney H. Stein has been at the Manhattan-based court since 1995 when he was nominated by then-President Bill Clinton.

THE RESPONSE

OpenAI and Microsoft haven't yet filed formal counter-arguments on the New York cases, but OpenAI made a public statement this week describing The Times lawsuit as “without merit” and saying that the chatbot's ability to regurgitate some articles verbatim was a “rare bug.”

“Training AI models using publicly available internet materials is fair use, as supported by long-standing and widely accepted precedents,” said a Monday blog post from the company. It went on to suggest that The Times “either instructed the model to regurgitate or cherry-picked their examples from many attempts.”

OpenAI cited licensing agreements made last year with The Associated Press, the German media company Axel Springer and other organizations as offering a glimpse into how the company is trying to support a healthy news ecosystem. OpenAI is paying an undisclosed fee to license AP’s archive of news stories. The New York Times was engaged in similar talks before deciding to sue.

OpenAI said earlier this year that access to AP's “high-quality, factual text archive” would improve the capabilities of its AI systems. But its blog post this week downplayed the importance of news content for AI training, arguing that large language models learn from an “enormous aggregate of human knowledge” and that “any single data source — including The New York Times — is not significant for the model’s intended learning.”

WHO'S GOING TO WIN?

Much of the AI industry's argument rests on the “fair use” doctrine of U.S. copyright law that allows for limited uses of copyrighted materials such as for teaching, research or transforming the copyrighted work into something different.

In response, the legal team representing The Times wrote Tuesday that what OpenAI and Microsoft are doing is “not fair use by any measure” because they're taking from the newspaper's investment in its journalism “to build substitutive products without permission or payment.”

So far, courts have largely sided with tech companies in interpreting how copyright laws should treat AI systems. In a defeat for visual artists, a federal judge in San Francisco last year dismissed much of the first big lawsuit against AI image-generators, though artists have since amended their complaint. Another California judge shot down part of comedian Sarah Silverman’s arguments against Facebook parent Meta but her case was amended in December and joined with another one that includes writers Ta-Nehisi Coates and Michael Chabon.

The most recent lawsuits have brought more detailed evidence of alleged harms, but Aggarwal said when it comes to using copyrighted content to train AI systems that deliver a "small portion of that to users, the courts just don’t seem inclined to find that to be copyright infringement.”

Tech companies cite as precedent Google’s success in beating back legal challenges to its online book library. The U.S. Supreme Court in 2016 let stand lower court rulings that rejected authors’ claim that Google’s digitizing of millions of books and showing snippets of them to the public amounted to copyright infringement.

But judges interpret fair use arguments on a case-by-case basis and it is “actually very fact-dependent,” depending on economic impact and other factors, said Cathy Wolfe, an executive at the Dutch firm Wolters Kluwer who also sits on the board of the Copyright Clearance Center, which helps negotiate print and digital media licenses in the U.S.

"Just because something is free on the internet, on a website, doesn't mean you can copy it and email it, let alone use it to conduct commercial business," Wolfe said. "Who’s going to win, I don’t know, but I’m certainly a proponent for protecting copyright for all of us. It drives innovation."

BEYOND THE COURTS

Some media outlets and other content creators are looking beyond the courts and calling for lawmakers or the U.S. Copyright Office to strengthen copyright protections for the AI era. A panel of the U.S. Senate Judiciary Committee heard testimony Wednesday from media executives and advocates in a hearing dedicated to AI's effect on journalism.

Roger Lynch, chief executive of the Conde Nast magazine chain, planned to tell senators that generative AI companies “are using our stolen intellectual property to build tools of replacement.”

“We believe that a legislative fix can be simple — clarifying that the use of copyrighted content in conjunction with commercial Gen AI is not fair use and requires a license,” says a copy of Lynch's prepared remarks.

___

This story was first published on January 9, 2024. It was updated on January 10, 2024 to make clear that a lawsuit brought by artists against AI image-generators and another lawsuit against Meta brought by authors, including Sarah Silverman, have been amended after judges dismissed parts of each case.

At Orlando service, doves and tears for 136 homeless people who died this year

1 hospitalized, 2 cats rescued in New Smyrna Beach fire

Lake Mary High School celebrates 1st win at Class 7A state football championship

LEGO contest sparks creativity in young builders with Legoland Galacticoaster ride opportunity

Business

ChatGPT-maker braces for fight with New York Times and authors on 'fair use' of copyrighted works

'They're coming for you Sharon' Arrest made in missing child case decades later

Florida license plate frame law’s confusion forces state to clarify the rules

Banana Ball: Orlando eyes league championship game for 2028

Ocoee woman accused of sending sexual videos of 13-year-old to prison inmate

Central Florida congestion woes increase as construction goes on

‘Breath of fresh air:’ Neighbors relieved as Brevard County condemns home of woman who dumped feces in yards

Man accused of dumping missing Brevard County woman’s body after sex, fatal overdose in 2023

See the moment a plane crashes into a car on I-95 in Brevard

Florida’s bear hunt is underway, but FWC is not actively saying how many have been killed

Universal altered report about roller coaster death after state agency intervened

ADA compliance lawsuits hit Central Florida businesses, raising concerns about accessibility and legal challenges

Prosecutor pinned for ‘oversight’ after man accused in Orlando jogging attack takes plea deal

Bane scores 37 to lead the Orlando Magic past the Heat in the NBA Cup quarterfinals

‘Not in our backyard:’ Gas plant to power cruise ships could be built near Merritt Island neighborhood

Residents split on new Celery Avenue speed tables in Seminole County

New Smyrna Beach faces final vote on ‘pedal pubs’ amid concerns

Florida homeowners fight rising lot rents as park owners avoid mediation

6 Things to do this weekend (12/6 - 12/7)

Rollins College student arrested after rifle, ammunition found on school property, police say

Orlando pastry chef brings Brazilian flavors to Food Network’s ‘Holiday Baking Championship’

🌳Discover the Orlando Tree Trek Adventure Park: Family-friendly aerial adventure minutes from Disney

‘Stand Your Ground’ cited after fatal Seminole County shooting stems from apparent road rage

Tavares eyes fines, permits for garage sale violations

Central Florida’s newest Trader Joe’s is officially open!🛒✨

A Door Dash driver uses the 5-second rule on some unknowing customers. It’s caught on a doorbell camera.