AI Benchmark Model
Hermes 4 - Llama-3.1 70B (Reasoning)
Benchmark scores, pricing, speed, and model comparisons for Hermes 4 - Llama-3.1 70B (Reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
10 tracked benchmark rows for Hermes 4 - Llama-3.1 70B (Reasoning).
MMLU Pro Knowledge
81.1% GPQA Reasoning
69.9% AIME 2025 Math
68.7% LiveCodeBench Code
65.3% SciCode Other
34.1% IFBench Other
31.3% TAU2 Other
22.5% HLE Other
7.9% LCR Other
6.7% TerminalBench Hard Other
4.5%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!