Authors file a lawsuit against OpenAI for unlawfully ‘ingesting’ their books

Ella Creamer

5 July 2023 at 10:33 am·4-min read

Two authors have filed a lawsuit against OpenAI, the company behind the artificial intelligence tool ChatGPT, claiming that the organisation breached copyright law by “training” its model on novels without the permission of authors.

Mona Awad, whose books include Bunny and 13 Ways of Looking at a Fat Girl, and Paul Tremblay, author of The Cabin at the End of the World, filed the class action complaint to a San Francisco federal court last week.

ChatGPT allows users to ask questions and type commands into a chatbot and responds with text that resembles human language patterns. The model underlying ChatGPT is trained with data that is publicly available on the internet.

Yet, Awad and Tremblay believe their books, which are copyrighted, were unlawfully “ingested” and “used to train” ChatGPT because the chatbot generated “very accurate summaries” of the novels, according to the complaint. Sample summaries are included in the lawsuit as exhibits.

This is the first lawsuit against ChatGPT that concerns copyright, according to Andres Guadamuz, a reader in intellectual property law at the University of Sussex. The lawsuit will explore the uncertain “borders of the legality” of actions within the generative AI space, he adds.

Books are ideal for training large language models because they tend to contain “high-quality, well-edited, long-form prose,” said the authors’ lawyers, Joseph Saveri and Matthew Butterick, in an email to the Guardian. “It’s the gold standard of idea storage for our species.”

The complaint said that OpenAI “unfairly” profits from “stolen writing and ideas” and calls for monetary damages on behalf of all US-based authors whose works were allegedly used to train ChatGPT. Though authors with copyrighted works have “great legal protection”, said Saveri and Butterick, they are confronting companies “like OpenAI who behave as if these laws don’t apply to them”.

However, it may be difficult to prove that authors have suffered financial losses specifically because of ChatGPT being trained on copyrighted material, even if the latter turned out to be true. ChatGPT may work “exactly the same” if it had not ingested the books, said Guadamuz, because it is trained on a wealth of internet information that includes, for example, internet users discussing the books.

OpenAI has become “increasingly secretive” about its training data, said Saveri and Butterick. In papers released alongside early iterations of ChatGPT, OpenAI gave some clues as to the size of the “internet-based books corpora” it used as training material, which it called only “Books2”. The lawyers deduce that the size of this dataset – estimated to contain 294,000 titles – means the books could only be drawn from shadow libraries such as Library Genesis (LibGen) and Z-Library, through which books can be secured in bulk via torrent systems.

This case will “likely rest on whether courts view the use of copyright material in this way as ‘fair use’”, said Lilian Edwards, professor of law, innovation and society at Newcastle University, “or as simple unauthorised copying.” Edwards and Guadamuz both emphasise that a similar lawsuit brought in the UK would not be decided in the same way, because the UK does not have the same “fair use” defence.

The UK government has been “keen on promoting an exception to copyright that would allow free use of copyright material for text and data mining, even for commercial purposes,” said Edwards, but the reform was “spiked” after authors, publishers and the music industry were “appalled”.

Since ChatGPT was launched in November 2022, the publishing industry has been in discussion over how to protect authors from the potential harms of AI technology. Last month, The Society of Authors (SoA) published a list of “practical steps for members” to “safeguard” themselves and their work. Yesterday, the SoA’s chief executive, Nicola Solomon told the trade magazine the Bookseller that the organisation was “very pleased” to see authors suing OpenAI, having “long been concerned” about the “wholesale copying” of authors’ work to train large language models.

Richard Combes, head of rights and licensing at the Authors’ Licensing and Collecting Society (ALCS), said that current regulation around AI is “fragmented, inconsistent across different jurisdictions and struggling to keep pace with technological developments”. He encouraged policymakers to consult principles that the ALCS has drawn up which “protect the true value that human authorship brings to our lives and, notably in the case of the UK, our economy and international identity”.

Saveri and Butterick believe that AI will eventually resemble “what happened with digital music and TV and movies” and comply with copyright law. “They will be based on licensed data, with the sources disclosed.”

The lawyers also noted it is “ironic” that “so-called ‘artificial intelligence’” tools rely on data made by humans. “Their systems depend entirely on human creativity. If they bankrupt human creators, they will soon bankrupt themselves.”

OpenAI were approached for comment.

Hello!
Prince William and Princess Kate interrupt Christmas break to make major announcement
The Prince and Princess of Wales have interrupted their Christmas break with their three children as the royal couple shared a major announcement
OK! Magazine
Michael Mosley's cause of death bombshell at inquest into tragic Greek island fall
Dr Michael Mosley's body was found in a rocky area on the Greek island of Symi in June following a four day search after the broadcaster went missing while on holiday
The Northern Echo
Plane takes off WITHOUT PILOT as they watch from runway as it heads out to sea
A plane took off without its pilot at a Northumberland airfield and is thought to have crashed into the sea.
Hello!
Prince William and Kate Middleton's photographer breaks his silence after royals suddenly delete Christmas card
Prince William and Princess Kate’s Christmas card drama has captured the attention of royal fans everywhere, with the family’s chosen photographer, Will Warr, breaking his silence. See details.
Manchester Evening News
Luke Littler loses World Darts Championship record to 'absolute nutcase' as shock statement sent
World Darts Championship records were broken this week at Ally Pally despite Luke Littler not playing at the tournament after receiving a bye to the second round
The Independent
Trump team warns Starmer’s ‘horrible, arrogant’ ambassador pick means Britain will be ‘locked out’ of key discussions
President-elect’s team deeply unhappy at Starmer’s choice for Britain’s new top diplomat
Hello!
The Chase star Mark Labbett furious after being 'benched' by ITV bosses
Mark Labbett speaks out after being benched on Beat the Chasers, sparking debate among fans about fairness and ITV's producer decisions.
Evening Standard
OPINION - I'm sorry, but this is why I have no sympathy for the woman whose £10,000 handbag was stolen in London
It did not, I have to say, take long for my sympathy for the woman who had her designer handbag stolen from a changing room in Oxford Street to evaporate into thin air. Nothing sums up more the vulgarity of the culture than the fact that it’s now de rigueur for a young woman to flaunt £10,000 worth of designer handbag as a measure of success. As an artefact, the Hermes JPG Shoulder Birkin, in red is an attractive piece; it’s well made and an intelligent take on the original designed for Jane Birkin.
Manchester Evening News
Tyson Fury to be stripped of £28m immediately after Oleksandr Usyk fight
The Wythenshawe-born boxer is out for revenge in Saudi Arabia on Saturday night as he takes on Oleksandr Usyk for his three heavyweight belts
Hello!
Real reason King Charles has not revealed the type of cancer he has
Palace sources have shared why the King hasn't shared details around his cancer diagnosis, amid reports Charles will continue his treatment into 2025.
The Independent
Strictly legend left ‘fragile’ after unexpected exit from the show
‘I gave myself 48 hours to kick, scream, cry and sob,’ the dancer said
The Independent
Two GB News hosts axed amid major shake-up on channel
‘GB News have made the decision to permanently relieve me of my duties at the channel’ said one presenter
Hello!
Strictly's Janette Manrara and Aljaž Škorjanec delight fans with exciting announcement
Strictly stars Janette Manrara and Aljaž Škorjanec delighted fans with an exciting announcement which comes just days after Aljaž competed in the Strictly final with Tasha Ghouri…
Manchester Evening News
We compared Baileys with M&S, Sainsbury's, Asda, Lidl, Aldi and Morrisons versions, no wonder this one's a sell-out
On taste and price you'd be silly not to snap this one up
Manchester Evening News
BBC faces backlash after households receive letters with 'terrifying' Christmas Day threats
People who received the letter over TV licences have hit out
Wales Online
Gavin and Stacey star James Corden opens up on 'hard' marriage before major family decison
Gavin and Stacey actor and co-creator James Corden has revealed the difficult aspect of his 12-year marriage to TV producer Julia Carey ahead of the show's final episode
Hello!
Bradley Walsh's son Barney sparks reaction with pair's latest joint TV appearance
Bradley Walsh and his son Barney appeared on The One Show on Thursday night – and fans all made the same comment about the 27-year-old actor…
HuffPost UK
This Year's Christmas Number 1 Is Here – And Chart History Has Just Been Made
Mariah Carey, Wham! and Tom Grennan were all in competition for the festive top spot this year.
Hello!
Prince Harry and Meghan Markle's second family Christmas card revealed
The Duke and Duchess of Sussex have sent separate holiday greetings to their close family and friends, after releasing a card with a rare photo of their children, Archie and Lilibet
BuzzFeed
27 Brutally Honest Confessions From A Non-Rich Kid Who Went To An Elite Private School With Super-Rich Kids
"I was one of the only kids in the whole school that wasn’t wealthy or had famous parents. I attended on scholarship."

Best John Lewis deals to bag ASAP, including up to 50% off Barbour, Mint Velvet and Dyson

Authors file a lawsuit against OpenAI for unlawfully ‘ingesting’ their books

Latest stories

Prince William and Princess Kate interrupt Christmas break to make major announcement

Michael Mosley's cause of death bombshell at inquest into tragic Greek island fall

Plane takes off WITHOUT PILOT as they watch from runway as it heads out to sea

Prince William and Kate Middleton's photographer breaks his silence after royals suddenly delete Christmas card

Luke Littler loses World Darts Championship record to 'absolute nutcase' as shock statement sent

Trump team warns Starmer’s ‘horrible, arrogant’ ambassador pick means Britain will be ‘locked out’ of key discussions

The Chase star Mark Labbett furious after being 'benched' by ITV bosses

OPINION - I'm sorry, but this is why I have no sympathy for the woman whose £10,000 handbag was stolen in London

Tyson Fury to be stripped of £28m immediately after Oleksandr Usyk fight

Real reason King Charles has not revealed the type of cancer he has

Strictly legend left ‘fragile’ after unexpected exit from the show

Two GB News hosts axed amid major shake-up on channel

Strictly's Janette Manrara and Aljaž Škorjanec delight fans with exciting announcement

We compared Baileys with M&S, Sainsbury's, Asda, Lidl, Aldi and Morrisons versions, no wonder this one's a sell-out

BBC faces backlash after households receive letters with 'terrifying' Christmas Day threats

Gavin and Stacey star James Corden opens up on 'hard' marriage before major family decison

Bradley Walsh's son Barney sparks reaction with pair's latest joint TV appearance

This Year's Christmas Number 1 Is Here – And Chart History Has Just Been Made

Prince Harry and Meghan Markle's second family Christmas card revealed

27 Brutally Honest Confessions From A Non-Rich Kid Who Went To An Elite Private School With Super-Rich Kids