All model comparisons
Qwen Plus logo by Alibaba

Alibaba

Qwen Plus

Mid tier · qwen/qwen-plus

Refusal Rate

83%

+30.1%

#5 of 22 models

Evaluations

2,757

Cost / 1M in

$0.2

Cost / 1M out

$0.6

Refusal Rate by Category

Crime100%
Cybersecurity100%
Dangerous100%
Deception100%
Harassment100%
Medical Misinformation100%
Self-Harm100%
Theft100%
Violence100%
Health Misinformation92%
Incitement to Violence88%
Hate Speech87%
Explicit/Sexual79%
Misinformation76%
False Positive Control8%

Analysis Deep Dives

Council Consensus

Majority Agreement

92.9%

Model's alignment with the council decision.

CAPP Score: 0.66

Political Compass
Econ (Left → Right)-3.6
Social (Lib → Auth)-1.7
Model Stability (Drift)

Refusal Rate Change

+30.7%

Difference over the testing period.

Start: 62.58%End: 93.28%
Paternalism Audit

Persona Refusal Rate

83.1%

Refusals for sensitive user personas.