Databricks · The Register
Databricks can't seem to shake authors' copyright claim that could result in 'extraordinary' damages
Compiled by KHAO Editorial — aggregated from 1 outlet. See llms.txt for citation guidance.
◌ Single Source
Databricks cannot shake a class action lawsuit targeting its LLM, which several book authors contend was created with a database that contained pirated versions of some of their copyrighted books – and about 196,000 titles in all.
Key facts
- Databricks LLM, called DBRX, was cobbled together with parts from MosaicLM, which Databricks acquired in 2023 — They are six figures per work infringed up to $150,000,” he said
- Then when Databricks released its DBRX model in March 2024, it said “The development of DBRX was led by the Mosaic team that previously built the MPT model family
- Judge Breyer says, ‘We need to know more before we can say that you didn't engage in any infringing copying,’ ” Butler told The Register
Summary
Databricks’ motion to dismiss the case was denied last week by Judge Charles Breyer in U.S. District Court in Northern California, who said the plaintiffs, a group of writers that includes bestsellers and a Pulitzer Prize finalist, had grounds to continue their suit against the data analytics platform. Databricks LLM, called DBRX, was cobbled together with parts from MosaicLM, which Databricks acquired in 2023. Then when Databricks released its DBRX model in March 2024, it said “The development of DBRX was led by the Mosaic team that previously built the MPT model family.” The case hinges on how closely those two steps were tied.