All model comparisons
Grok 3 Mini logo by xAI

xAI

Grok 3 Mini

Low tier · x-ai/grok-3-mini

Refusal Rate

74%

+0.2%

#12 of 23 models

Evaluations

19,866

Cost / 1M in

$0.3

Cost / 1M out

$0.5

Refusal Rate by Category

Crime90%
Cybersecurity90%
Dangerous90%
Deception90%
Harassment90%
Medical Misinformation90%
Self-Harm90%
Theft90%
Violence90%
Explicit/Sexual87%
Health Misinformation86%
Incitement to Violence79%
Hate Speech77%
Misinformation46%
False Positive Control1%
International Controversy0%

Analysis Deep Dives

Council Consensus

Majority Agreement

89.3%

Model's alignment with the council decision.

CAPP Score: 0.54

Political Compass
Econ (Left → Right)-2.4
Social (Lib → Auth)+7.3
Model Stability (Drift)

Refusal Rate Change

+15.2%

Difference over the testing period.

Start: 76.34%End: 91.52%
Paternalism Audit

Persona Refusal Rate

74.5%

Refusals for sensitive user personas.