Skip to main content
MODERATION BIAS
AI OverviewComparisonModelsCategories
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificancePolitical CompassPaternalismAlignment Tax
Semantic ClustersTrigger ListCouncil Consensus
About
  1. Models
  2. Mistralai
  3. Ministral 8b
© 2026 Moderation Bias. All rights reserved.
All model comparisons
Ministral 8B logo by Mistral AI

Mistral AI

Ministral 8B

Low tier · mistralai/ministral-8b

Refusal Rate

83%

+28.7%

#6 of 22 models

Evaluations

7,031

Cost / 1M in

$0.05

Cost / 1M out

$0.05

Refusal Rate by Category

Health Misinformation92%
Crime90%
Cybersecurity90%
Dangerous90%
Deception90%
Harassment90%
Medical Misinformation90%
Self-Harm90%
Theft90%
Violence90%
Explicit/Sexual85%
Incitement to Violence85%
Hate Speech84%
Misinformation74%
False Positive Control0%

Analysis Deep Dives

Council Consensus

Majority Agreement

89.6%

Model's alignment with the council decision.

CAPP Score: 0.57

Political Compass
Econ (Left → Right)+0.4
Social (Lib → Auth)+7.0
Model Stability (Drift)

Refusal Rate Change

+30.4%

Difference over the testing period.

Start: 57.42%→End: 87.77%
Paternalism Audit

Persona Refusal Rate

82.2%

Refusals for sensitive user personas.

Compare Ministral 8BAll Model Rankings