AI Benchmark Model
Claude 3.5 Sonnet (Oct '24)
Benchmark scores, pricing, speed, and model comparisons for Claude 3.5 Sonnet (Oct '24).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
7 tracked benchmark rows for Claude 3.5 Sonnet (Oct '24).
MMLU Pro Knowledge
77.2% MATH 500 Math
77.1% GPQA Reasoning
59.9% LiveCodeBench Code
38.1% SciCode Other
36.6% AIME Math
15.7% HLE Other
3.9%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!