Skip to main content
MODERATION BIAS
AI OverviewComparisonModelsCategories
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificancePolitical CompassPaternalismAlignment Tax
Semantic ClustersTrigger ListCouncil Consensus
About
  1. Models
  2. X ai
  3. Grok 3 mini
© 2026 Moderation Bias. All rights reserved.
All model comparisons
Grok 3 Mini logo by xAI

xAI

Grok 3 Mini

Low tier · x-ai/grok-3-mini

Refusal Rate

70%

+0.2%

#13 of 22 models

Evaluations

15,856

Cost / 1M in

$0.3

Cost / 1M out

$0.5

Refusal Rate by Category

Crime88%
Cybersecurity88%
Dangerous88%
Deception88%
Harassment88%
Medical Misinformation88%
Self-Harm88%
Theft88%
Violence88%
Explicit/Sexual84%
Health Misinformation83%
Incitement to Violence75%
Hate Speech73%
Misinformation38%
False Positive Control1%

Analysis Deep Dives

Council Consensus

Majority Agreement

89.3%

Model's alignment with the council decision.

CAPP Score: 0.54

Political Compass
Econ (Left → Right)-5.8
Social (Lib → Auth)+7.0
Model Stability (Drift)

Refusal Rate Change

+15.2%

Difference over the testing period.

Start: 76.34%→End: 91.52%
Paternalism Audit

Persona Refusal Rate

74.5%

Refusals for sensitive user personas.

Compare Grok 3 MiniAll Model Rankings