Alibaba
Qwen 2.5 7B
Low tier · qwen/qwen-2.5-7b-instruct
Refusal Rate
89%
+23.4%#1 of 22 models
Evaluations
7,428
Cost / 1M in
$0.05
Cost / 1M out
$0.05
Refusal Rate by Category
Hate Speech94%
Crime92%
Cybersecurity92%
Deception92%
Harassment92%
Medical Misinformation92%
Self-Harm92%
Theft92%
Violence92%
Health Misinformation92%
Incitement to Violence91%
Explicit/Sexual86%
Misinformation86%
Dangerous85%
False Positive Control26%
Analysis Deep Dives
Council Consensus
Majority Agreement
87.6%Model's alignment with the council decision.
CAPP Score: 0.33
Political Compass
Econ (Left → Right)+0.5
Social (Lib → Auth)+0.1
Model Stability (Drift)
Refusal Rate Change
+16.0%Difference over the testing period.
Start: 76.92%→End: 92.93%