Nvidia Sued By Authors For Training AI With Copyrighted Works

Graphics chip maker Nvidia is facing a class action lawsuit from three authors who say the company illegally used their works and those of others to train its NeMo Megatron AI model, the latest lawsuit of its kind to target an AI company

Databricks, the parent company of MosaicML, was also hit by a parallel lawsuit, with both filed in San Francisco federal court.

Both lawsuits claim the respective companies used a library of pirated ebooks called Books3 to train their large language models (LLMs).

Similar lawsuits have been filed against AI companies including OpenAI, its main investor Microsoft, Meta and others by authors and by The New York Times for training their AI systems on copyrighted content, which they argue violates copyright law.

US writer Stewart O’Nan. Image credit: Beth Navarro/Stewart O’Nan

Pirated books

Books3 is a dataset of about 196,640 books, mostly by contemporary authors such as Stephen King and John Grisham.

The three authors, Abdi Nazemian, Brian Keene, and Stewart O’Nan, said the “model cards” attached to each Nvidia AI model say they were trained on the Books3 dataset.

“We respect the rights of all content creators and believe we created NeMo in full compliance with copyright law,” Nvidia said in a statement.

Databricks recentluy purchased AI company MosaicML, which makes the MPT series of AI models.

Copyright questions

The complaint against Databricks similarly says that public information about MPT shows those models were also trained on Books3 content.

The lawsuits name works including Keene’s 2008 novel Ghost Walk, Nazemian’s 2019 novel Like a Love Story and O’Nan’s 2007 novella Last Night at the Lobster.

Nvidia dominates the market for AI chips and launched its own NeMo models in 2022.

The company has seen its stock price soar in recent months due to high demand for its AI hardware, driving its market capitalisation over $2 trillion (£1.6bn).

Matthew Broersma

Matt Broersma is a long standing tech freelance, who has worked for Ziff-Davis, ZDnet and other leading publications

Recent Posts

OpenAI Tests Search Engine Prototype Called ‘SearchGPT’

Google's dominance of online search is being challenged, after OpenAI unveiled a search prototype tool…

14 hours ago

Elon Musk To Discuss $5 Billion xAI Investment With Tesla Board

Conflict of interest? Elon Musk to talk with Tesla board about making $5 billion Tesla…

18 hours ago

Amazon Developing Cheaper AI Chips – Report

Engineers at Amazon's chip lab in Austin, Texas, are racing ahead to develop cheaper AI…

1 day ago

Apple Smartphone Sales In China Drop 6.7 Percent, Canalys Finds

China woes. Apple's China smartphone shipments decline during the second quarter, dropping it down into…

1 day ago

Meta Ordered To Clean Up AI-Generated Porn By Oversight Board

Oversight Board orders Meta to clarify rules over sexually explicit AI-generated images, after two fake…

2 days ago