Forbes December 24, 2024
John Werner

It’s the end of ‘shipmas’, almost Christmas time, and OpenAI has given us some information about the pending model o3, and how it does its reasoning.

One of the most prominent demos is in this YouTube video with Sam Altman, who is joined by Mark Chen, Hongyu Ren, and special guest Greg Kamradt, to talk about o3; and related models.

“This model is incredible at programming,” Altman says as they look at benchmarks like GPQA Diamond for Ph.D.-level science questions; and EpochAI frontier for math, where o3 demonstrates breakout results.

As demonstrated, the model is getting good marks against practical testing of skilled human professionals.

The group also discussed the use of these new models for SWE-bench operations, or in...

Today's Sponsors

LEK
ZeOmega

Today's Sponsor

LEK

 
Topics: AI (Artificial Intelligence), Technology
How To Build An AI Strategy That Works For Your Employees
Visualizing Big Tech Company Spending On AI Data Centers
Design And Technology Industry Pros Predict Top AI Trends For 2025
OpenAI’s o3 shows remarkable progress on ARC-AGI, sparking debate on AI reasoning
AI, Omnichannel and Social Commerce: Inside the 2024 D2C Transformation

Share This Article