Anthropic

Claude Opus 4.6

Arena Score1506
Context Window1M
Pricing (in/out)$15/$75
CategoryProprietary

Claude Opus 4.6 with Thinking is Anthropic flagship model released in April 2026. It tops the LMSYS Chatbot Arena with a score of 1506 and achieves 80.8% on SWE-bench Verified, making it the best model for agentic coding tasks. It features 1M token context with 76% recall on MRCR v2 (8-needle), adaptive thinking with effort controls, and 128K max output tokens. Opus 4.6 excels at complex reasoning, long document analysis, and autonomous multi-step workflows via Claude Code.

Benchmarks

BenchmarkScore
SWE-bench Verified80.8%
MRCR v2 (1M, 8-needle)76%
Terminal-Bench 2.065.4%
LMSYS Arena1506

Compare Claude Opus 4.6

← Back to Model Leaderboard