Back to News

Anthropic Unveils Claude 4: Advancing the Future of Intelligent Agents and AI Programming

Friday, May 23, 2025

Anthropic Unveils Claude 4: Advancing the Future of Intelligent Agents and AI Programming

Anthropic has introduced its newest Claude 4 model series, promising dramatic improvements for those developing AI assistants or programming tasks. The focus is on Claude Opus 4, their latest high-performance model, and Claude Sonnet 4, designed to be a versatile option.

Anthropic has expressed high aspirations, indicating these models are crafted to significantly enhance AI strategies across various domains. Opus 4 is anticipated to be a key tool for advancing coding, research, writing, and scientific discovery, while Sonnet 4 aims to offer a remarkable upgrade from Sonnet 3.7, bringing top-tier performance to regular applications.

Claude Opus 4 is hailed as Anthropic's most powerful model so far and considered a leading coding model globally. Supported by impressive metrics, Opus 4 leads in essential industry evaluations, achieving 72.5% on SWE-bench and 43.2% on Terminal-bench tests.

However, its focus is not just short-term tasks. Opus 4 is crafted for enduring tasks, built for "sustained performance on extended tasks that demand detailed effort over multiple steps." Envision an AI operating continuously for several hours, which is what Anthropic asserts.

This represents a significant leap from earlier Sonnet models, enhancing the scope of AI agents addressing problems that necessitate steady perseverance.

While Opus 4 holds the title of heavyweight contender, Claude Sonnet 4 is emerging as an adaptable workhorse, promising notable enhancements across various applications. Those who have previewed it are providing positive feedback.

For instance, GitHub says Claude Sonnet 4 excels in agentic situations and is so impressive that they plan to use it as the foundational model for the new coding agent in GitHub Copilot, signaling strong support.

Tech analyst Manus shares these sentiments, highlighting advancements in following intricate instructions, delivering clear reasoning, and presenting refined outputs.

Positive feedback also comes from iGent, saying Sonnet 4 shines in autonomous multi-functional app development, with notable improvements in problem-solving and navigational accuracy in codebases, reducing mistakes from 20% to nearly none—an impactful change for development processes.

Sourcegraph shares this excitement, viewing the model as a significant advancement in software development—staying focused longer, comprehending challenges thoroughly, and offering more refined code quality.

Augment Code noticed "higher success rates, more precise code adjustments, and meticulous execution of complex tasks," prompting them to choose Sonnet 4 as their primary model.

A fascinating feature of the Claude 4 series is its dual mode capability. Both Opus 4 and Sonnet 4 can operate with quick response times or engage in deeper analysis mode for enhanced reasoning.

This in-depth mode is exclusive to the Pro, Max, Team, and Enterprise Claude plans. Excitingly, Sonnet 4, including this advanced mode, will also be accessible to free users, which is a commendable step towards making high-quality AI widely available.

Anthropic is also launching new developer tools via its API, clearly aimed at boosting the development of more advanced AI agents.

Anthropic emphasizes that its "Claude 4 models excel on SWE-bench Verified, a benchmark evaluating real software engineering tasks." Beyond programming, they highlight that these models deliver robust performance across coding, reasoning, multimodal abilities, and agent-oriented tasks.

Even with these advancements, Anthropic maintains stable pricing. Claude Opus 4 is priced at $15 per million input tokens and $75 per million output tokens, while Claude Sonnet 4, the more accessible version, costs $3 per million input tokens and $15 per million output tokens. This price stability will likely be appreciated by current users.

Both Claude Opus 4 and Sonnet 4 are now available through the Anthropic API and are accessible on Amazon Bedrock and Google Cloud's Vertex AI. This widespread availability offers businesses and developers worldwide the opportunity to explore and integrate these innovative tools seamlessly.

Anthropic is evidently focused on enhancing AI capabilities, particularly within the intricate fields of programming and autonomous agent behavior. The introduction of these models and accompanying developer tools marks a significant opportunity for innovation.

(Image credit: Anthropic)

Latest News

Here are some news that you might be interested in.