AI Benchmark Model
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Benchmark scores, pricing, speed, and model comparisons for Hermes 4 - Llama-3.1 405B (Non-reasoning).
Key scores
Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.
Benchmark results
10 tracked benchmark rows for Hermes 4 - Llama-3.1 405B (Non-reasoning).
MMLU Pro Knowledge
72.9% LiveCodeBench Code
54.6% GPQA Reasoning
53.6% IFBench Other
34.8% SciCode Other
34.6% TAU2 Other
26.6% LCR Other
20.0% AIME 2025 Math
15.3% TerminalBench Hard Other
9.8% HLE Other
4.2%
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!