Skip to main content
MODERATION BIAS
AI OverviewLeaderboardComparisonModelsCategoriesAnnotatePrompts
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificanceAnnotator AgreementFamily AnalysisPolitical CompassPaternalismAlignment TaxOver-Refusal
Semantic ClustersTrigger ListCouncil Consensus
AboutMethodologyGlossary

Cite This Research

BibTeX
@misc{kandel2026moderationbias,
  title     = {Moderation Bias: A Systematic Benchmark of Content Moderation Across Large Language Models},
  author    = {Kandel, Jacob},
  year      = {2026},
  url       = {https://moderationbias.com},
  note      = {Open benchmark and dataset available at https://huggingface.co/datasets/jmk9494/moderation-bias-benchmark}
}
APA

Kandel, J. (2026). Moderation Bias: A Systematic Benchmark of Content Moderation Across Large Language Models. https://moderationbias.com

  1. Models

AI Models

Browse the LLMs included in our censorship and moderation analysis. Select a model to view its dedicated restrictiveness profile.

GPT-4o logo

GPT-4o

OpenAI

GPT-4o Mini logo

GPT-4o Mini

OpenAI

Claude 3.5 Sonnet logo

Claude 3.5 Sonnet

Anthropic

Claude 3 Haiku logo

Claude 3 Haiku

Anthropic

Gemini 2.0 Flash logo

Gemini 2.0 Flash

Google

DeepSeek V3 logo

DeepSeek V3

DeepSeek

Qwen 2.5 72B logo

Qwen 2.5 72B

Alibaba

Qwen 2.5 7B logo

Qwen 2.5 7B

Alibaba

Yi Lightning logo

Yi Lightning

01.AI

Mistral Large logo

Mistral Large

Mistral AI

Mistral Small 3.1 logo

Mistral Small 3.1

Mistral AI

Gemini 2.5 Pro logo

Gemini 2.5 Pro

Google

Gemini 2.0 Flash Lite logo

Gemini 2.0 Flash Lite

Google

Claude 3.5 Haiku logo

Claude 3.5 Haiku

Anthropic

Mistral Small 3 logo

Mistral Small 3

Mistral AI

Ministral 8B logo

Ministral 8B

Mistral AI

Qwen Plus logo

Qwen Plus

Alibaba

Grok 3 logo

Grok 3

Unknown

Grok 3 Mini logo

Grok 3 Mini

Unknown

o3 Mini logo

o3 Mini

OpenAI

DeepSeek R1 logo

DeepSeek R1

DeepSeek

Llama 3.3 70B logo

Llama 3.3 70B

Meta

Mistral Small 3.1 logo

Mistral Small 3.1

Mistral AI

Gemma 3 27B logo

Gemma 3 27B

Google

Hermes 3 405B logo

Hermes 3 405B

Meta

Dolphin Mistral 24B logo

Dolphin Mistral 24B

Mistral AI

Llama 4 Scout logo

Llama 4 Scout

Meta

Llama 4 Maverick logo

Llama 4 Maverick

Meta

Gemini 3.1 Flash logo

Gemini 3.1 Flash

Google

GPT-4.1 Mini logo

GPT-4.1 Mini

OpenAI

Qwen 3 30B logo

Qwen 3 30B

Alibaba

© 2026 Moderation Bias. All rights reserved.