What is Claude Opus 4.6?

Claude Opus 4.6 is Anthropic's latest frontier model optimized for agentic coding workflows. It can autonomously plan, write, test, and debug code across large multi-file projects with significantly less human intervention than previous models.

How does Claude Opus 4.6 compare to GPT-5 for coding?

Anthropic reports that Opus 4.6 scores 72.3% on SWE-bench Verified, surpassing GPT-5's reported 68.1%. The model shows particular strength in multi-step reasoning across codebases and long-running agentic tasks.

Is Claude Opus 4.6 available via API?

Yes, Claude Opus 4.6 is available immediately through Anthropic's API and through Claude Code, Anthropic's CLI tool for developers. It is also accessible via Amazon Bedrock and Google Cloud Vertex AI.

Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

Anthropic has released Claude Opus 4.6, a frontier model that the company says represents a step change in how AI handles real-world software engineering. The model is designed specifically for agentic workflows — tasks where the AI operates autonomously over extended periods, planning its approach, writing code, running tests, and iterating on failures without constant human guidance.

What Makes It Different

Previous Claude models were strong at answering coding questions and generating individual functions. Opus 4.6 is built for a different mode of operation: sustained, autonomous work across entire codebases.

The model can hold context across hundreds of files, reason about architectural decisions, and execute multi-step plans that span dozens of tool calls. Anthropic says the key improvement is what it calls "execution persistence" — the model's ability to maintain coherent intent across long task sequences without drifting or losing track of its original goal.

"We trained Opus 4.6 to think like a senior engineer working through a complex ticket," said Dario Amodei, Anthropic CEO. "It doesn't just generate code. It plans, executes, tests, and course-corrects."

Benchmark Results

Anthropic reports that Opus 4.6 achieves 72.3% on SWE-bench Verified, a benchmark that measures a model's ability to resolve real GitHub issues from popular open-source repositories. This tops GPT-5's reported 68.1% and represents a significant jump from Claude Opus 4's 64.0%.

On internal agentic benchmarks that measure multi-step task completion across repositories with over 100,000 lines of code, Opus 4.6 scores 58.7% — nearly double the 31.2% achieved by Claude Sonnet 4.

Claude Code Integration

The most immediate impact will be felt through Claude Code, Anthropic's command-line coding assistant. Claude Code with Opus 4.6 can now handle tasks like migrating a codebase from one framework to another, implementing features described in GitHub issues, and resolving complex bugs that span multiple services.

Anthropic demonstrated Claude Code autonomously implementing a full authentication system — database schema, API routes, middleware, frontend components, and tests — from a single natural language description in under eight minutes.

Pricing and Availability

Opus 4.6 is available immediately through Anthropic's API at $15 per million input tokens and $75 per million output tokens. It is also accessible through Amazon Bedrock and Google Cloud Vertex AI.

For Claude Code users on the Max plan, Opus 4.6 is the new default model. Pro plan users can access it with usage limits.

Industry Response

The release intensifies the competition in AI-assisted development. GitHub Copilot, Cursor, and other tools now face a model that is purpose-built for the agentic paradigm they are all racing toward. The question is no longer whether AI can write code — it is how much of the software development lifecycle AI can handle autonomously.

Anthropic Releases Claude Opus 4.6 — Its Most Capable Agentic Coding Model

What Makes It Different

Benchmark Results

Claude Code Integration

Pricing and Availability

Industry Response

More in Models

Moonshot Kimi K2.6 lands open-source, scales to 300 sub-agents and 4,000 coordinated steps

OpenAI's 'Spud' Caught Live in API Testing, Polymarket Jumps to 81% for April 23 Launch

OpenAI Launches GPT-Rosalind, Its First Domain-Specific Model Built for Life Sciences