Back to stories
Models

Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Michael Ouroumis2 min read
Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Anthropic has released Claude Opus 4.6, a frontier model that the company says represents a step change in how AI handles real-world software engineering. The model is designed specifically for agentic workflows — tasks where the AI operates autonomously over extended periods, planning its approach, writing code, running tests, and iterating on failures without constant human guidance.

What Makes It Different

Previous Claude models were strong at answering coding questions and generating individual functions. Opus 4.6 is built for a different mode of operation: sustained, autonomous work across entire codebases.

The model can hold context across hundreds of files, reason about architectural decisions, and execute multi-step plans that span dozens of tool calls. Anthropic says the key improvement is what it calls "execution persistence" — the model's ability to maintain coherent intent across long task sequences without drifting or losing track of its original goal.

"We trained Opus 4.6 to think like a senior engineer working through a complex ticket," said Dario Amodei, Anthropic CEO. "It doesn't just generate code. It plans, executes, tests, and course-corrects."

Benchmark Results

Anthropic reports that Opus 4.6 achieves 72.3% on SWE-bench Verified, a benchmark that measures a model's ability to resolve real GitHub issues from popular open-source repositories. This tops GPT-5's reported 68.1% and represents a significant jump from Claude Opus 4's 64.0%.

On internal agentic benchmarks that measure multi-step task completion across repositories with over 100,000 lines of code, Opus 4.6 scores 58.7% — nearly double the 31.2% achieved by Claude Sonnet 4.

Claude Code Integration

The most immediate impact will be felt through Claude Code, Anthropic's command-line coding assistant. Claude Code with Opus 4.6 can now handle tasks like migrating a codebase from one framework to another, implementing features described in GitHub issues, and resolving complex bugs that span multiple services.

Anthropic demonstrated Claude Code autonomously implementing a full authentication system — database schema, API routes, middleware, frontend components, and tests — from a single natural language description in under eight minutes.

Pricing and Availability

Opus 4.6 is available immediately through Anthropic's API at $15 per million input tokens and $75 per million output tokens. It is also accessible through Amazon Bedrock and Google Cloud Vertex AI.

For Claude Code users on the Max plan, Opus 4.6 is the new default model. Pro plan users can access it with usage limits.

Industry Response

The release intensifies the competition in AI-assisted development. GitHub Copilot, Cursor, and other tools now face a model that is purpose-built for the agentic paradigm they are all racing toward. The question is no longer whether AI can write code — it is how much of the software development lifecycle AI can handle autonomously.

More in Models

Microsoft Releases Phi-4-Reasoning-Vision-15B: A Small Model That Knows When to Think
Models

Microsoft Releases Phi-4-Reasoning-Vision-15B: A Small Model That Knows When to Think

Microsoft open-sources Phi-4-reasoning-vision-15B, a compact 15B-parameter multimodal model that selectively activates chain-of-thought reasoning and rivals models many times its size.

8 hours ago2 min read
Meta Releases Llama 4 Maverick With 400B Parameters Under Open Weights
Models

Meta Releases Llama 4 Maverick With 400B Parameters Under Open Weights

Meta releases Llama 4 Maverick, a 400-billion parameter mixture-of-experts model under its open weights license, matching GPT-5 on key benchmarks and reigniting the open-source AI debate.

1 day ago2 min read
OpenAI Releases GPT-5.3 Instant: 27% Fewer Hallucinations and a Less 'Cringe' Personality
Models

OpenAI Releases GPT-5.3 Instant: 27% Fewer Hallucinations and a Less 'Cringe' Personality

OpenAI rolls out GPT-5.3 Instant as ChatGPT's new default model, delivering significant reductions in hallucination rates, fewer unnecessary refusals, and a more natural conversational tone.

1 day ago2 min read