AI Benchmark Model
Claude 4 Opus (Reasoning)
Benchmark scores, pricing, speed, and model comparisons for Claude 4 Opus (Reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
12 tracked benchmark rows for Claude 4 Opus (Reasoning).
MATH 500 Math
98.2% MMLU Pro Knowledge
87.3% GPQA Reasoning
79.6% AIME Math
75.7% TAU2 Other
73.4% AIME 2025 Math
73.3% LiveCodeBench Code
63.6% IFBench Other
53.7% SciCode Other
39.8% LCR Other
33.7% TerminalBench Hard Other
31.1% HLE Other
11.7%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!