GPT-5.2 vs Claude Opus 4.6

Complete benchmark comparison on coding, reasoning, tool use, cost, and latency. Updated March 2026.

GPT-5.2 (xhigh)

by OpenAI

Coding 9.6

Reasoning 9.5

Tool Use 9.5

Input $1.75 / 1M tokens

Output $14 / 1M tokens

Context 400K

by Anthropic

Coding 9.6

Reasoning 9.7

Tool Use 9.4

Input $18.75 / 1M tokens

Output $93.75 / 1M tokens

Context 200K

GPT-5.2 wins on coding by a hair, but both are exceptional. Choose GPT-5.2 for coding/agentic work, Claude for deep reasoning and analysis.

Best for Coding: GPT-5.2 ✓ WIN Best for Reasoning: Claude Opus 4.6 ✓ WIN Best for Agents: GPT-5.2 ✓ WIN Best for Cost: GPT-5.2 ✓ WIN