OpenAI’s o3 AI Model Falls Short of Benchmark Claims in FrontierMath Test

OpenAI’s o3 artificial intelligence (AI) model, which was released last week, is underperforming on a specific benchmark. Epoch AI, the company behind the FrontierMath benchmark, highlighted that the publicly available version of the o3 AI model scored 10 percent on the test, a much lower value than the company’s claim at launch.

from Gadgets 360 https://ift.tt/FJuXZti

0 Comments

Post a Comment

Post a Comment (0)

Previous Post Next Post