OpenAI destroyed a trove of books used to train AI models. The employees who collected the data are gone.
https://www.businessinsider.com/openai-destroyed-ai-training-datasets-lawsuit-authors-books-copyright-2024-5Open linkView original on lemmy.world
"A corporation destroyed a trove of books!"
"That's terrible, what books were lost?"
"Oh, none. It was a digital collection."
"But people will be inconvenienced by losing the repository, right?"
"No, I mean a corporation deleted a dataset they made, trained from a metaphorical trove of books."
"I'm leaving."
Nice to see Lemmy users don't read the articles before posting stupid comments. The internet never changes.
The article is about a lawsuit by book authors.
OpenAI trained their models on copyrighted material.
OpenAI deleted the sources so the list of copyrighted materials is unknown.
it feels like a significant portion of Lemmy users have not met a boot they did not like the taste of
Oh no.
Anyway.
The free stuff we put out on the internet got scraped by a robot?
Now we're gonna sue...Google? No it's fine when they do it.