Multimodal: AI’s new frontier

MIT Technology Review May 8, 2024
MIT Technology Review Insights

AI models that process multiple types of information at once bring even bigger opportunities, along with more complex challenges, than traditional unimodal AI.

Multimodality is a relatively new term for something extremely old: how people have learned about the world since humanity appeared. Individuals receive information from myriad sources via their senses, including sight, sound, and touch. Human brains combine these different modes of data into a highly nuanced, holistic picture of reality.

“Communication between humans is multimodal,” says Jina AI CEO Han Xiao. “They use text, voice, emotions, expressions, and sometimes photos.” That’s just a few obvious means of sharing information. Given this, he adds, “it is very safe to assume that future communication between human and machine will...

Today's Sponsors

Today's Sponsor

Topics: AI (Artificial Intelligence), Technology

2024-05-08T15:34:13-04:00

Multimodal: AI’s new frontier

Today's Sponsors

Today's Sponsor

Share This Article