Back to stories
Models

Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Michael Ouroumis2 min read
Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Anthropic has released Claude Opus 4.6, a frontier model that the company says represents a step change in how AI handles real-world software engineering. The model is designed specifically for agentic workflows — tasks where the AI operates autonomously over extended periods, planning its approach, writing code, running tests, and iterating on failures without constant human guidance.

What Makes It Different

Previous Claude models were strong at answering coding questions and generating individual functions. Opus 4.6 is built for a different mode of operation: sustained, autonomous work across entire codebases.

The model can hold context across hundreds of files, reason about architectural decisions, and execute multi-step plans that span dozens of tool calls. Anthropic says the key improvement is what it calls "execution persistence" — the model's ability to maintain coherent intent across long task sequences without drifting or losing track of its original goal.

"We trained Opus 4.6 to think like a senior engineer working through a complex ticket," said Dario Amodei, Anthropic CEO. "It doesn't just generate code. It plans, executes, tests, and course-corrects."

Benchmark Results

Anthropic reports that Opus 4.6 achieves 72.3% on SWE-bench Verified, a benchmark that measures a model's ability to resolve real GitHub issues from popular open-source repositories. This tops GPT-5's reported 68.1% and represents a significant jump from Claude Opus 4's 64.0%.

On internal agentic benchmarks that measure multi-step task completion across repositories with over 100,000 lines of code, Opus 4.6 scores 58.7% — nearly double the 31.2% achieved by Claude Sonnet 4.

Claude Code Integration

The most immediate impact will be felt through Claude Code, Anthropic's command-line coding assistant. Claude Code with Opus 4.6 can now handle tasks like migrating a codebase from one framework to another, implementing features described in GitHub issues, and resolving complex bugs that span multiple services.

Anthropic demonstrated Claude Code autonomously implementing a full authentication system — database schema, API routes, middleware, frontend components, and tests — from a single natural language description in under eight minutes.

Pricing and Availability

Opus 4.6 is available immediately through Anthropic's API at $15 per million input tokens and $75 per million output tokens. It is also accessible through Amazon Bedrock and Google Cloud Vertex AI.

For Claude Code users on the Max plan, Opus 4.6 is the new default model. Pro plan users can access it with usage limits.

Industry Response

The release intensifies the competition in AI-assisted development. GitHub Copilot, Cursor, and other tools now face a model that is purpose-built for the agentic paradigm they are all racing toward. The question is no longer whether AI can write code — it is how much of the software development lifecycle AI can handle autonomously.

Learn AI for Free — FreeAcademy.ai

Take "AI Essentials: Understanding AI in 2026" — a free course with certificate to master the skills behind this story.

More in Models

Moonshot Kimi K2.6 lands open-source, scales to 300 sub-agents and 4,000 coordinated steps
Models

Moonshot Kimi K2.6 lands open-source, scales to 300 sub-agents and 4,000 coordinated steps

Moonshot AI shipped Kimi K2.6 as a generally available open-source model on April 20, posting 58.6 on SWE-Bench Pro — ahead of GPT-5.4 and Claude Opus 4.6 — while scaling agent swarms to 300 sub-agents and 4,000 coordinated steps.

9 hours ago3 min read
OpenAI's 'Spud' Caught Live in API Testing, Polymarket Jumps to 81% for April 23 Launch
Models

OpenAI's 'Spud' Caught Live in API Testing, Polymarket Jumps to 81% for April 23 Launch

API monitors detected OpenAI's next frontier model — codenamed Spud — running in production-scale testing on April 19, sending Polymarket traders to an 81% implied probability of a public launch on April 23.

1 day ago2 min read
OpenAI Launches GPT-Rosalind, Its First Domain-Specific Model Built for Life Sciences
Models

OpenAI Launches GPT-Rosalind, Its First Domain-Specific Model Built for Life Sciences

OpenAI debuts GPT-Rosalind, a specialized AI model for biology, drug discovery, and genomics, with launch partners including Amgen, Moderna, and Los Alamos National Laboratory.

4 days ago2 min read