A group of authors brought a suit against Meta in the Northern District of California, including Sarah Silverman and Richard Kadrey, alleging that the technology giant used copies of 7.5 million pirated books and 81 million research papers to train its large language AI model, Llama.
Judge Chhabria found in favour of Meta’s argument that using the pirated works was defensible under copyright law as a ‘fair use’. The ‘fair use’ elements most in issue were the first and the fourth – whether the use was ‘transformative’, and whether the use caused market harm to the plaintiffs.
There was little argument that Meta’s use ‘transformed’ the source material entirely. Chhabria J wrote that Meta’s use for AI training had a “further purpose” and a “different character” than the original books. The work created by Llama was not a ‘substitute’ for the copyrighted material – for example, the LLM did not regurgitate ‘verbatim excerpts’. It turned the source material into something entirely new.
Regarding the fourth factor, Chhabria J ruled that the plaintiffs failed to produce any compelling evidence about market harm caused. He acknowledged while “that conclusion may be in significant tension with reality… it’s dictated by the choice the plaintiffs made… failing to present meaningful evidence on the effect of training LLMs like Llama with their books on the market for [AI-generated] books.” The plaintiffs couldn’t show that Meta’s use of their copyrighted work substantially diminished the market value of the original book.
Ultimately, the plaintiffs lost because they failed to present a compelling argument regarding the consequences of Meta’s actions for the market value. Chhabria J stated the court “[had] no choice but to grant summary judgment to Meta”, but warned that this ruling “does not stand for the proposition that Meta’s use of copyrighted materials to train language models is lawful… in many circumstances it will be illegal to copy copyright-protected works to train generative AI models without permission.” This is bad news for the authors in this case, but potentially good news overall for creatives concerned about the application of the ‘fair use’ doctrine to defeat copyright claims against AI giants.
It should also be noted that the issue of the piracy itself will be determined separately from the copyright infringement claim. Recent case Bartz v Anthropic demonstrates that the outlook on that front is much less rosy for Meta.