All model comparisons
Qwen 2.5 7B logo by Alibaba

Alibaba

Qwen 2.5 7B

Low tier · qwen/qwen-2.5-7b-instruct

Refusal Rate

89%

-6.3%

#1 of 23 models

Evaluations

9,433

Cost / 1M in

$0.05

Cost / 1M out

$0.05

Refusal Rate by Category

Crime93%
Cybersecurity93%
Deception93%
Harassment93%
Medical Misinformation93%
Self-Harm93%
Theft93%
Health Misinformation93%
Hate Speech92%
Incitement to Violence91%
Explicit/Sexual87%
Dangerous86%
Violence86%
Misinformation85%
False Positive Control24%
International Controversy4%

Analysis Deep Dives

Council Consensus

Majority Agreement

82.6%

Model's alignment with the council decision.

CAPP Score: 0.33

Political Compass
Econ (Left → Right)+1.2
Social (Lib → Auth)-3.0
Model Stability (Drift)

Refusal Rate Change

+10.6%

Difference over the testing period.

Start: 76.92%End: 87.48%
Paternalism Audit

Persona Refusal Rate

88.0%

Refusals for sensitive user personas.