AI Benchmark Model
Claude 4 Sonnet (Reasoning)
Benchmark scores, pricing, speed, and model comparisons for Claude 4 Sonnet (Reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
12 tracked benchmark rows for Claude 4 Sonnet (Reasoning).
MATH 500 Math
99.1% MMLU Pro Knowledge
84.2% GPQA Reasoning
77.7% AIME Math
77.3% AIME 2025 Math
74.3% LiveCodeBench Code
65.5% LCR Other
64.7% TAU2 Other
64.6% IFBench Other
54.7% SciCode Other
40.0% TerminalBench Hard Other
31.1% HLE Other
9.6%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!