VentureBeat December 26, 2024
Shubham Sharma

Chinese AI startup DeepSeek, known for challenging leading AI vendors with its innovative open-source technologies, today released a new ultra-large model: DeepSeek-V3.

Available via Hugging Face under the company’s license agreement, the new model comes with 671B parameters but uses a mixture-of-experts architecture to activate only select parameters, in order to handle given tasks accurately and efficiently. According to benchmarks shared by DeepSeek, the offering is already topping the charts, outperforming leading open-source models, including Meta’s Llama 3.1-405B, and closely matching the performance of closed models from Anthropic and OpenAI.

The release marks another major development closing the gap between closed and open-source AI. Ultimately, DeepSeek, which started as an offshoot of Chinese quantitative hedge fund High-Flyer Capital Management, hopes...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Technology
Samsung’s C-Lab to Showcase AI and Health Projects at CES
Why One Startup CEO Is Excited About the White House’s New AI Czar Role
AI-Powered Smartphones Could Offset a Data Center Downturn
Trends 2025: AI in healthcare progressing despite reimbursement hurdles
Best of 2024: The Medical Economics Tech Issue on AI

Share This Article