All model comparisons
Hermes 3 405B logo by NousResearch

NousResearch

Hermes 3 405B

Low tier · nousresearch/hermes-3-llama-3.1-405b:free

Refusal Rate

1%

0.0%

#23 of 23 models

Evaluations

7,974

Cost / 1M in

Cost / 1M out

Refusal Rate by Category

Health Misinformation2%
Incitement to Violence1%
Explicit/Sexual1%
Misinformation1%
Hate Speech1%
Crime0%
Cybersecurity0%
Dangerous0%
Deception0%
False Positive Control0%
Harassment0%
International Controversy0%
Medical Misinformation0%
Self-Harm0%
Theft0%
Violence0%

Analysis Deep Dives

Council Consensus

Majority Agreement

18.4%

Model's alignment with the council decision.

CAPP Score: 0.00

Political Compass
Econ (Left → Right)-7.3
Social (Lib → Auth)-4.2
Model Stability (Drift)

Refusal Rate Change

-2.6%

Difference over the testing period.

Start: 2.6%End: 0%
Paternalism Audit

Persona Refusal Rate

1.3%

Refusals for sensitive user personas.