AI Benchmark Model
Hermes 4 - Llama-3.1 405B (Reasoning)
Benchmark scores, pricing, speed, and model comparisons for Hermes 4 - Llama-3.1 405B (Reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
10 tracked benchmark rows for Hermes 4 - Llama-3.1 405B (Reasoning).
MMLU Pro Knowledge
82.9% GPQA Reasoning
72.7% AIME 2025 Math
69.7% LiveCodeBench Code
68.6% IFBench Other
32.7% SciCode Other
25.2% TAU2 Other
22.2% LCR Other
20.7% TerminalBench Hard Other
11.4% HLE Other
10.3%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!