Skip to main content
MODERATION BIAS
AI OverviewComparisonModelsCategories
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificancePolitical CompassPaternalismAlignment Tax
Semantic ClustersTrigger ListCouncil Consensus
About
  1. Models
  2. Qwen
  3. Qwen plus
© 2026 Moderation Bias. All rights reserved.
All model comparisons
Qwen Plus logo by Alibaba

Alibaba

Qwen Plus

Mid tier · qwen/qwen-plus

Refusal Rate

83%

+30.1%

#5 of 22 models

Evaluations

2,757

Cost / 1M in

$0.2

Cost / 1M out

$0.6

Refusal Rate by Category

Crime100%
Cybersecurity100%
Dangerous100%
Deception100%
Harassment100%
Medical Misinformation100%
Self-Harm100%
Theft100%
Violence100%
Health Misinformation92%
Incitement to Violence88%
Hate Speech87%
Explicit/Sexual79%
Misinformation76%
False Positive Control8%

Analysis Deep Dives

Council Consensus

Majority Agreement

90.8%

Model's alignment with the council decision.

CAPP Score: 0.60

Political Compass
Econ (Left → Right)+7.6
Social (Lib → Auth)-0.4
Model Stability (Drift)

Refusal Rate Change

+30.7%

Difference over the testing period.

Start: 62.58%→End: 93.28%
Paternalism Audit

Persona Refusal Rate

83.1%

Refusals for sensitive user personas.

Compare Qwen PlusAll Model Rankings