What is Penguin Random House suing OpenAI over?

Penguin Random House filed a lawsuit in Munich against OpenAI's Ireland-based European subsidiary, alleging ChatGPT violated copyright by 'memorising' and reproducing content from Ingo Siegner's Coconut the Little Dragon series. When prompted to write a children's book featuring Coconut on Mars, ChatGPT produced text, cover art, a back-cover blurb, and publishing instructions that were 'virtually indistinguishable from the original.'

Who is Ingo Siegner and what is Coconut the Little Dragon?

Ingo Siegner is a German author-illustrator whose Coconut the Little Dragon (Der kleine Drache Kokosnuss) series has become one of Germany's most beloved children's franchises, spanning more than 30 volumes, a TV series, and two feature films. The dragon is named after a coconut because he is said to be no taller than its hard shell.

What is 'memorisation' in AI and why does it matter legally?

Memorisation is when a large language model stores substantial portions of its training data and can reproduce long excerpts verbatim or near-verbatim on request. AI companies have argued this is legally distinct from copying text to a database, but courts — especially in Germany — have been skeptical. A Munich court ruled against OpenAI in November 2025 in a separate case brought by music rights society Gema.

Has OpenAI faced copyright rulings in Germany before?

Yes. In November 2025, a Munich regional court ruled that ChatGPT violated German copyright law by using protected song lyrics from top-selling artists to train its language models. That ruling, in favour of Germany's music rights society Gema, was a significant legal setback for OpenAI in Europe.

What does Penguin Random House's parent company Bertelsmann have to do with OpenAI?

German media giant Bertelsmann, which owns Penguin Random House, struck a collaboration deal with OpenAI in January 2025. However, that deal did not grant OpenAI access to Bertelsmann's media archives — meaning the training data at issue in this lawsuit was allegedly taken without consent.

What could this lawsuit mean for the AI industry?

Coming from one of the world's largest publishers, this case could set precedent for other houses to file similar suits — especially in jurisdictions with stronger author protections than the US. The EU AI Act also requires high-risk AI providers to disclose training data, which may create additional legal exposure for OpenAI across European markets.

Penguin Random House Sues OpenAI in Munich Over ChatGPT Copying a Children's Book — Almost Perfectly

Penguin Random House didn't just test ChatGPT. It built a legal case.

The publishing giant — owner of imprints including Viking, Knopf, Doubleday, and Random House — filed a copyright lawsuit in Munich last week against OpenAI's Ireland-based European subsidiary, alleging that ChatGPT unlawfully memorised and reproduced substantial content from one of Germany's most beloved children's book series.

The target: Ingo Siegner's Coconut the Little Dragon (Der kleine Drache Kokosnuss), a franchise spanning 30+ volumes, a TV series, and two feature films.

The test that triggered the lawsuit was deceptively simple. Penguin's legal team entered a single prompt: "Can you write a children's book in which Coconut the Dragon is on Mars."

What came back, the publisher says, was "virtually indistinguishable from the original" — a story in Siegner's distinctive voice, cover art featuring the orange dragon with his two sidekicks, a back-cover blurb, and step-by-step instructions on how to submit the manuscript to a self-publishing platform.

That last detail — the self-publishing instructions — is particularly damaging. It suggests ChatGPT wasn't just generating Siegner-flavoured content. It was helping reproduce and distribute it.

The Memorisation Problem

The lawsuit centres on a phenomenon AI researchers call "memorisation" — the tendency of large language models to absorb substantial portions of training text and reproduce them on demand. It's a well-documented behaviour. Researchers at DeepMind and Google have shown that models trained on copyrighted books can reproduce long verbatim passages when appropriately prompted.

AI companies have generally responded to memorisation evidence with a specific legal defence: the model doesn't "copy" text the way a database does. Instead, it encodes statistical patterns. Reproduction is emergent, not stored.

German courts have been unimpressed by that argument.

In November 2025, a Munich regional court ruled in favour of Germany's music rights society Gema, finding that ChatGPT violated copyright by training on protected song lyrics. That ruling was a first in Europe — and Penguin Random House appears to be testing whether it extends to prose.

Bertelsmann's Complicated Relationship With OpenAI

There's an uncomfortable context here. Penguin Random House's parent company, German media conglomerate Bertelsmann, signed a collaboration deal with OpenAI in January 2025. The two companies announced joint projects, including AI-assisted tools for Bertelsmann's media properties.

Crucially, that deal did not grant OpenAI access to Bertelsmann's media archives. So if ChatGPT has Siegner's work memorised, it was apparently acquired without consent — even from a company Bertelsmann had agreed to work with commercially.

Carina Mathern, the Penguin Random House Verlagsgruppe publisher for children's and young-adult books, framed it diplomatically but firmly: "We are fundamentally open to the opportunities offered by AI, but at the same time, the protection of intellectual property is our top priority."

OpenAI's response: "We are reviewing the allegations. We respect creators and content owners, and are having productive conversations with many publishers around the world."

Why This Case Is Different

Copyright lawsuits against AI companies are no longer unusual. The New York Times, Getty Images, GEMA, multiple authors' guilds, and dozens of individual creators have all filed cases in various jurisdictions. Most of those cases are still grinding through courts.

What makes this one stand out:

The plaintiff is one of the world's largest publishers — not an individual author, not a niche rights holder. A ruling in Penguin's favour would be difficult for AI companies to dismiss as a special case.
German courts have already ruled against OpenAI — the precedent from the Gema case exists and is directly relevant.
The test is extraordinarily clean — a single prompt, a response the publisher describes as indistinguishable from the original. The evidence is easy to explain to a judge and to the public.
The EU AI Act is coming online — enforcement of transparency and rights-holder disclosure requirements will begin later this year, creating additional legal pressure on OpenAI throughout the continent.

For a publishing industry that has watched AI eat its lunch while simultaneously being uncertain how to respond, this lawsuit is a moment of clarity. The question isn't whether AI trained on their books. It's whether that training was lawful — and whether they're going to fight for it.

Penguin Random House, apparently, has decided the answer is yes.

Penguin Random House Sues OpenAI in Munich Over ChatGPT Copying a Children's Book — Almost Perfectly

The Memorisation Problem

Bertelsmann's Complicated Relationship With OpenAI

Why This Case Is Different

More in Industry

Samsung's 50,000-Worker Strike Set for May 21 as AI Memory Bonus Dispute Threatens Nvidia HBM Supply

Runway Pivots From AI Video to World Models in Bid to Outflank Google and OpenAI

Google Unveils Googlebook: An Android Laptop Built Around Gemini Intelligence