MIT Technology Review December 18, 2024
Melissa Heikkilä, Stephanie Arnett

New findings show how the sources of data are concentrating power in the hands of the most powerful tech companies.

AI is all about data. Reams and reams of data are needed to train algorithms to do what we want, and what goes into the AI models determines what comes out. But here’s the problem: AI developers and researchers don’t really know much about the sources of the data they are using. AI’s data collection practices are immature compared with the sophistication of AI model development. Massive data sets often lack clear information about what is in them and where it came from.

The Data Provenance Initiative, a group of over 50 researchers from both academia and industry, wanted...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Big Data, Technology
Trust, But Verify The Data Feeding Your AI Systems
Creating Value: How Data, AI, And Apps Transform Business
Healthcare’s Data-Driven Transformation: Breaking Down Silos To Improve Efficiency
Cedars-Sinai’s New AI Chief Digs Into the Data
AI: The Battlefield In The War For Data Control

Share This Article