News
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and ...
Claude Opus 4, now alleged as the world’s best coding model, delivers a record-setting 72.5% on SWE-bench and 43.2% on Terminal-bench, outperforming all competitors on long-running and complex ...
Anthropic launches its Claude 4 series, featuring Opus 4 and Sonnet 4, delivering new AI benchmarks in coding, advanced ...
Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid ...
Learn how Claude 4’s advanced AI features make it a game-changer in writing, data analysis, and human-AI collaboration.
In a benchmark for performance comparing models on software engineering tasks, Anthropic’s latest models, Claude Opus 4 and Claude Sonnet 4, outperformed OpenAI and Google’s Gemini 2.5 Pro.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results