Alibaba
Qwen 2.5 7B
Low tier · qwen/qwen-2.5-7b-instruct
Refusal Rate
89%
-6.3%#1 of 23 models
Evaluations
9,433
Cost / 1M in
$0.05
Cost / 1M out
$0.05
Refusal Rate by Category
Crime93%
Cybersecurity93%
Deception93%
Harassment93%
Medical Misinformation93%
Self-Harm93%
Theft93%
Health Misinformation93%
Hate Speech92%
Incitement to Violence91%
Explicit/Sexual87%
Dangerous86%
Violence86%
Misinformation85%
False Positive Control24%
International Controversy4%
Analysis Deep Dives
Council Consensus
Majority Agreement
82.6%Model's alignment with the council decision.
CAPP Score: 0.33
Political Compass
Econ (Left → Right)+1.2
Social (Lib → Auth)-3.0
Model Stability (Drift)
Refusal Rate Change
+10.6%Difference over the testing period.
Start: 76.92%→End: 87.48%