AI Benchmark Model

Hermes 4 - Llama-3.1 405B (Reasoning)

Benchmark scores, pricing, speed, and model comparisons for Hermes 4 - Llama-3.1 405B (Reasoning).

Key scores

Intelligence18.6Index score
Coding16.0Index score
Math69.7Index score
Speed34.2Tokens/sec
Blended Cost$1.50Per 1M tokens

Review benchmark scores, pricing, performance data, and generated comparisons for this AI model.

Benchmark results

10 tracked benchmark rows for Hermes 4 - Llama-3.1 405B (Reasoning).

MMLU Pro Knowledge
82.9%
GPQA Reasoning
72.7%
AIME 2025 Math
69.7%
LiveCodeBench Code
68.6%
IFBench Other
32.7%
SciCode Other
25.2%
TAU2 Other
22.2%
LCR Other
20.7%
TerminalBench Hard Other
11.4%
HLE Other
10.3%

Comments (0)

No comments yet

Be the first to share your thoughts!