VentureBeat January 13, 2025
Michael Nuñez

Researchers at the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) have announced the release of LlamaV-o1, a state-of-the-art artificial intelligence model capable of tackling some of the most complex reasoning tasks across text and images.

By combining cutting-edge curriculum learning with advanced optimization techniques like Beam Search, LlamaV-o1 sets a new benchmark for step-by-step reasoning in multimodal AI systems.

“Reasoning is a fundamental capability for solving complex multi-step problems, particularly in visual contexts where sequential step-wise understanding is essential,” the researchers wrote in their technical report, published today. Fine-tuned for reasoning tasks that require precision and transparency, the AI model outperforms many of its peers on tasks ranging from interpreting financial charts to diagnosing medical images.

In tandem with...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Technology
SuperDial Acquires MajorBoost, Enhancing AI-Powered Phone Automation
AI driving uptick in venture capital investment in healthcare
NVIDIA Collaborates with IQVIA, Illumina, and Mayo Clinic to Drive Drug Discovery
In the future, we will all manage our own AI agents | Jensen Huang Q&A
Carta Healthcare to Power Oncology Research and Clinical Trials with Acquisition of Realyze Intelligence, a UPMC Enterprises Company

Share This Article