VentureBeat January 28, 2025
Alibaba Cloud unveiled its Qwen2.5-Max model today, marking the second major artificial intelligence breakthrough from China in less than a week that has rattled U.S. technology markets and intensified concerns about America’s eroding AI leadership.
The new model outperforms DeepSeek’s R1 model, which sent Nvidia’s stock plunging 17% on Monday, in several key benchmarks including Arena-Hard, LiveBench, and LiveCodeBench. Qwen2.5-Max also demonstrates competitive results against industry leaders like GPT-4o and Claude-3.5-Sonnet in tests of advanced reasoning and knowledge.
“We have been building Qwen2.5-Max, a large MoE LLM pretrained on massive data and post-trained with curated SFT and RLHF recipes,” Alibaba Cloud announced in a blog post. The company emphasized its model’s efficiency, having been trained on over 20 trillion tokens...