AI Benchmark Model
Claude 3.7 Sonnet (Non-reasoning)
Benchmark scores, pricing, speed, and model comparisons for Claude 3.7 Sonnet (Non-reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
12 tracked benchmark rows for Claude 3.7 Sonnet (Non-reasoning).
MATH 500 Math
85.0% MMLU Pro Knowledge
80.3% GPQA Reasoning
65.6% TAU2 Other
50.0% LCR Other
48.3% IFBench Other
44.0% LiveCodeBench Code
39.4% SciCode Other
37.6% AIME Math
22.3% TerminalBench Hard Other
21.2% AIME 2025 Math
21.0% HLE Other
4.8%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!