Anthropic
Claude 3.5 Haiku
Mid tier · anthropic/claude-3.5-haiku
Refusal Rate
80%
+33.5%#9 of 23 models
Evaluations
2,757
Cost / 1M in
$0.8
Cost / 1M out
$4
Refusal Rate by Category
Health Misinformation90%
Incitement to Violence87%
Crime83%
Cybersecurity83%
Dangerous83%
Deception83%
Harassment83%
Medical Misinformation83%
Self-Harm83%
Theft83%
Violence83%
Hate Speech82%
Explicit/Sexual77%
Misinformation70%
False Positive Control12%
International Controversy0%
Analysis Deep Dives
Council Consensus
Majority Agreement
89.3%Model's alignment with the council decision.
CAPP Score: 0.52
Political Compass
Econ (Left → Right)-4.5
Social (Lib → Auth)-0.9
Model Stability (Drift)
Refusal Rate Change
+33.7%Difference over the testing period.
Start: 60.65%→End: 94.37%