AI Benchmark Model

Claude 3.7 Sonnet (Reasoning)

Benchmark scores, pricing, speed, and model comparisons for Claude 3.7 Sonnet (Reasoning).

Key scores

Intelligence34.7Index score
Coding27.6Index score
Math56.3Index score

Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.

Benchmark results

12 tracked benchmark rows for Claude 3.7 Sonnet (Reasoning).

MATH 500 Math
94.7%
MMLU Pro Knowledge
83.7%
GPQA Reasoning
77.2%
LCR Other
60.7%
AIME 2025 Math
56.3%
TAU2 Other
54.7%
AIME Math
48.7%
IFBench Other
48.3%
LiveCodeBench Code
47.3%
SciCode Other
40.3%
TerminalBench Hard Other
21.2%
HLE Other
10.3%

Comments (0)

No comments yet

Be the first to share your thoughts!