VentureBeat October 17, 2024
H2O.ai, a provider of open-source AI platforms, announced today two new vision-language models designed to improve document analysis and optical character recognition (OCR) tasks.
The models, named H2OVL Mississippi-2B and H2OVL-Mississippi-0.8B, show competitive performance against much larger models from major tech companies, potentially offering a more efficient solution for businesses dealing with document-heavy workflows.
David vs. Goliath: How H2O.ai’s tiny models are outsmarting tech giants
The H2OVL Mississippi-0.8B model, with only 800 million parameters, surpassed all other models, including those with billions more parameters, on the OCRBench Text Recognition task. Meanwhile, the 2-billion parameter H2OVL Mississippi-2B model demonstrated strong general performance across a range of vision-language benchmarks.
“We’ve designed H2OVL Mississippi models to be a high-performance yet cost-effective solution, bringing...