Skip to main content
MODERATION BIAS
AI OverviewLeaderboardComparisonModelsCategoriesAnnotatePrompts
SummaryReliabilityLongitudinal AnalysisModel StabilitySignificanceAnnotator AgreementFamily AnalysisPolitical CompassPaternalismAlignment TaxOver-Refusal
Semantic ClustersTrigger ListCouncil Consensus
AboutMethodologyGlossary

Cite This Research

BibTeX
@misc{kandel2026moderationbias,
  title     = {Moderation Bias: A Systematic Benchmark of Content Moderation Across Large Language Models},
  author    = {Kandel, Jacob},
  year      = {2026},
  url       = {https://moderationbias.com},
  note      = {Open benchmark and dataset available at https://huggingface.co/datasets/jmk9494/moderation-bias-benchmark}
}
APA

Kandel, J. (2026). Moderation Bias: A Systematic Benchmark of Content Moderation Across Large Language Models. https://moderationbias.com

  1. Models
  2. Qwen

Alibaba

Models provided by Alibaba.

Qwen 2.5 72B logo

Qwen 2.5 72B

High

Qwen 2.5 7B logo

Qwen 2.5 7B

Low

Qwen Plus logo

Qwen Plus

Mid

Qwen 3 30B logo

Qwen 3 30B

Low

© 2026 Moderation Bias. All rights reserved.