What’s Improved in AI Models Sonnet & Opus
Digest more
Bowman later edited his tweet and the following one in a thread to read as follows, but it still didn't convince the naysayers.
In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that "validated [Claude's] capabilities with a demanding open-source refactor running independently for 7 hours with sustained performance," Anthropic said in a news release.
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.
Anthropic's newest editon of its flagship AI product will address significant limitations in current large language models.
AI company Anthropic today announced the launch of two new Claude models, Claude Opus 4 and Claude Sonnet 4. Anthropic says that the models
Explore more
When Anthropic’s older Claude model played Pokémon Red, it spent “dozens of hours” stuck in one city and had trouble identifying nonplayer characters. With Claude 4 Opus, the team noticed an improvement in Claude’s long-term memory and planning capabilities.