Back to stories
Models

Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Michael Ouroumis2 min read
Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Anthropic has released Claude Opus 4.6, a frontier model that the company says represents a step change in how AI handles real-world software engineering. The model is designed specifically for agentic workflows — tasks where the AI operates autonomously over extended periods, planning its approach, writing code, running tests, and iterating on failures without constant human guidance.

What Makes It Different

Previous Claude models were strong at answering coding questions and generating individual functions. Opus 4.6 is built for a different mode of operation: sustained, autonomous work across entire codebases.

The model can hold context across hundreds of files, reason about architectural decisions, and execute multi-step plans that span dozens of tool calls. Anthropic says the key improvement is what it calls "execution persistence" — the model's ability to maintain coherent intent across long task sequences without drifting or losing track of its original goal.

"We trained Opus 4.6 to think like a senior engineer working through a complex ticket," said Dario Amodei, Anthropic CEO. "It doesn't just generate code. It plans, executes, tests, and course-corrects."

Benchmark Results

Anthropic reports that Opus 4.6 achieves 72.3% on SWE-bench Verified, a benchmark that measures a model's ability to resolve real GitHub issues from popular open-source repositories. This tops GPT-5's reported 68.1% and represents a significant jump from Claude Opus 4's 64.0%.

On internal agentic benchmarks that measure multi-step task completion across repositories with over 100,000 lines of code, Opus 4.6 scores 58.7% — nearly double the 31.2% achieved by Claude Sonnet 4.

Claude Code Integration

The most immediate impact will be felt through Claude Code, Anthropic's command-line coding assistant. Claude Code with Opus 4.6 can now handle tasks like migrating a codebase from one framework to another, implementing features described in GitHub issues, and resolving complex bugs that span multiple services.

Anthropic demonstrated Claude Code autonomously implementing a full authentication system — database schema, API routes, middleware, frontend components, and tests — from a single natural language description in under eight minutes.

Pricing and Availability

Opus 4.6 is available immediately through Anthropic's API at $15 per million input tokens and $75 per million output tokens. It is also accessible through Amazon Bedrock and Google Cloud Vertex AI.

For Claude Code users on the Max plan, Opus 4.6 is the new default model. Pro plan users can access it with usage limits.

Industry Response

The release intensifies the competition in AI-assisted development. GitHub Copilot, Cursor, and other tools now face a model that is purpose-built for the agentic paradigm they are all racing toward. The question is no longer whether AI can write code — it is how much of the software development lifecycle AI can handle autonomously.

Learn AI for Free — FreeAcademy.ai

Take "AI Essentials: Understanding AI in 2026" — a free course with certificate to master the skills behind this story.

More in Models

NVIDIA Launches Ising: Open-Source AI Models to Make Quantum Computers Useful
Models

NVIDIA Launches Ising: Open-Source AI Models to Make Quantum Computers Useful

NVIDIA unveiled Ising, its first family of open-source AI models for quantum computing, promising 2.5x faster error correction and slashing calibration time from days to hours.

2 days ago2 min read
OpenAI Retires Six Older Codex Models Including GPT-5 and GPT-5.1
Models

OpenAI Retires Six Older Codex Models Including GPT-5 and GPT-5.1

OpenAI today removes six legacy Codex models from its ChatGPT sign-in flow, consolidating around the newer GPT-5.3 and GPT-5.4 families and nudging developers toward API-based workflows.

2 days ago2 min read
GLM-5.1 Cracks Code Arena Top 3, First Open-Weight Model to Do So
Models

GLM-5.1 Cracks Code Arena Top 3, First Open-Weight Model to Do So

Z.ai's GLM-5.1 posted a 1530 Elo score on Code Arena this week, becoming the first open-weight model to break into the global top three — trailing only Anthropic's Claude Opus 4.6 variants.

4 days ago2 min read