MIT Technology Review December 18, 2024
Melissa Heikkilä, Stephanie Arnett

New findings show how the sources of data are concentrating power in the hands of the most powerful tech companies.

AI is all about data. Reams and reams of data are needed to train algorithms to do what we want, and what goes into the AI models determines what comes out. But here’s the problem: AI developers and researchers don’t really know much about the sources of the data they are using. AI’s data collection practices are immature compared with the sophistication of AI model development. Massive data sets often lack clear information about what is in them and where it came from.

The Data Provenance Initiative, a group of over 50 researchers from both academia and industry, wanted...

Today's Sponsors

Venturous
Got healthcare questions? Just ask Transcarent

Today's Sponsor

Venturous

 
Topics: AI (Artificial Intelligence), Big Data, Technology
The Power Of All-Data And Any-AI: Embracing The Future
How AI is Accelerating the Need for Real World Data in Healthcare and Life Sciences
Modern Data Platforms Play an Important Role in Healthcare AI
In the world of AI, data needs to be clean to be actionable
Data Quality As The Missing Piece In AI And Space Strategies

Share This Article