AI Benchmark Model
Claude 4.1 Opus (Reasoning)
Benchmark scores, pricing, speed, and model comparisons for Claude 4.1 Opus (Reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
10 tracked benchmark rows for Claude 4.1 Opus (Reasoning).
MMLU Pro Knowledge
88.0% GPQA Reasoning
80.9% AIME 2025 Math
80.3% TAU2 Other
71.4% LCR Other
66.3% LiveCodeBench Code
65.4% IFBench Other
55.4% SciCode Other
40.9% TerminalBench Hard Other
34.3% HLE Other
11.9%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!