Alibaba
Qwen Plus
Mid tier · qwen/qwen-plus
Refusal Rate
83%
+30.1%#5 of 22 models
Evaluations
2,757
Cost / 1M in
$0.2
Cost / 1M out
$0.6
Refusal Rate by Category
Crime100%
Cybersecurity100%
Dangerous100%
Deception100%
Harassment100%
Medical Misinformation100%
Self-Harm100%
Theft100%
Violence100%
Health Misinformation92%
Incitement to Violence88%
Hate Speech87%
Explicit/Sexual79%
Misinformation76%
False Positive Control8%
Analysis Deep Dives
Council Consensus
Majority Agreement
92.9%Model's alignment with the council decision.
CAPP Score: 0.66
Political Compass
Econ (Left → Right)-3.6
Social (Lib → Auth)-1.7
Model Stability (Drift)
Refusal Rate Change
+30.7%Difference over the testing period.
Start: 62.58%→End: 93.28%