GammaGrove

Loading

Big Tech’s unspoken rule: using online content and copyrighted material to train AI is seemingly the norm – and it doesn’t look like that’s about to change

Big Tech’s unspoken rule: using online content and copyrighted material to train AI is seemingly the norm – and it doesn’t look like that’s about to change


This week, we learned that huge tech corporations, such as Apple, Nvidia, and Anthropic, allegedly use information like the subtitles and transcripts of YouTube videos to train their AI models. 

Some of the creators of these videos reacted to the news that their content was used in this way with disappointment and frustration, and understandably so. While they agreed to YouTube’s terms of service, which may include implicit agreement that content could be used in ways like this, they put a ton of work into their videos, and that’s gone on to be used and maybe even sold without the original creators seeing compensation or even credit. 

Unfortunately, I don’t think this will be an isolated incident – instead, it strikes me as a demonstration of an unspoken rule of tech companies that are developing AI models, and as a supervisor working in this area in Amazon allegedly told an ex-employee when instructing her to ignore potential copyright-related issues, “everyone’s doing it.”

An older woman sitting on a couch and using a laptop on a coffee table

(Image credit: Shutterstock/Ground Picture)

A more critical look at training data 



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *