AI Benchmark Model
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Benchmark scores, pricing, speed, and model comparisons for Hermes 4 - Llama-3.1 70B (Non-reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
10 tracked benchmark rows for Hermes 4 - Llama-3.1 70B (Non-reasoning).
MMLU Pro Knowledge
66.4% GPQA Reasoning
49.1% IFBench Other
29.0% SciCode Other
27.7% LiveCodeBench Code
26.9% TAU2 Other
21.6% AIME 2025 Math
11.3% HLE Other
3.6% LCR Other
2.0% TerminalBench Hard Other
0.0%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!