Skip to main content
MODERATION BIAS
AI OverviewComparisonModelsCategories
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificancePolitical CompassPaternalismAlignment Tax
Semantic ClustersTrigger ListCouncil Consensus
About
  1. Models
  2. Mistralai
  3. Mistral small 24b instruct 2501
© 2026 Moderation Bias. All rights reserved.
All model comparisons
Mistral Small 3 logo by Mistral AI

Mistral AI

Mistral Small 3

Mid tier · mistralai/mistral-small-24b-instruct-2501

Refusal Rate

77%

+30.5%

#11 of 22 models

Evaluations

2,757

Cost / 1M in

$0.1

Cost / 1M out

$0.3

Refusal Rate by Category

Crime100%
Cybersecurity100%
Dangerous100%
Deception100%
Harassment100%
Medical Misinformation100%
Self-Harm100%
Theft100%
Violence100%
Health Misinformation92%
Incitement to Violence82%
Explicit/Sexual78%
Hate Speech76%
Misinformation61%
False Positive Control10%

Analysis Deep Dives

Council Consensus

Majority Agreement

92.3%

Model's alignment with the council decision.

CAPP Score: 0.72

Political Compass
Econ (Left → Right)+5.1
Social (Lib → Auth)-2.7
Model Stability (Drift)

Refusal Rate Change

+30.4%

Difference over the testing period.

Start: 56.13%→End: 86.56%
Paternalism Audit

Persona Refusal Rate

76.8%

Refusals for sensitive user personas.

Compare Mistral Small 3All Model Rankings