VentureBeat October 17, 2024
Michael Nuñez

H2O.ai, a provider of open-source AI platforms, announced today two new vision-language models designed to improve document analysis and optical character recognition (OCR) tasks.

The models, named H2OVL Mississippi-2B and H2OVL-Mississippi-0.8B, show competitive performance against much larger models from major tech companies, potentially offering a more efficient solution for businesses dealing with document-heavy workflows.

David vs. Goliath: How H2O.ai’s tiny models are outsmarting tech giants

The H2OVL Mississippi-0.8B model, with only 800 million parameters, surpassed all other models, including those with billions more parameters, on the OCRBench Text Recognition task. Meanwhile, the 2-billion parameter H2OVL Mississippi-2B model demonstrated strong general performance across a range of vision-language benchmarks.

“We’ve designed H2OVL Mississippi models to be a high-performance yet cost-effective solution, bringing...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Technology
Google digs deeper into healthcare AI: 5 notes
JP Morgan Annual Healthcare Conference 2025: What are the key talking points likely to be?
How AI Has And Will Continue To Transform Healthcare
AI Translates Nature into New Medicines | StartUp Health Insights: Week of Nov 26, 2024
Building AI trust: The key role of explainability

Share This Article