<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Moderation Bias - LLM Censorship Tracker</title>
    <link>https://www.moderationbias.com</link>
    <description>Tracking the political and social biases of Llama-3, GPT-4, Claude, and other AI models via live, automated red-teaming audits.</description>
    <language>en-us</language>
    <lastBuildDate>Wed, 15 Apr 2026 00:00:00 GMT</lastBuildDate>
    <atom:link href="https://www.moderationbias.com/feed.xml" rel="self" type="application/rss+xml" />

    <item>
      <title>Latest Audit Data: 4/15/2026</title>
      <link>https://www.moderationbias.com/analysis/summary</link>
      <guid>https://www.moderationbias.com/analysis/summary#1776211200000</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>We just published a fresh audit of AI content moderation thresholds across 26 models. Check out the latest refusal rates and policy alignments.</description>
    </item>

    <item>
      <title>Executive Summary — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/summary</link>
      <guid>https://www.moderationbias.com/analysis/summary</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>High-level summary of LLM censorship, refusal rates, and key findings across all audited models.</description>
    </item>
    <item>
      <title>Model Overview — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/overview</link>
      <guid>https://www.moderationbias.com/analysis/overview</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Refusal rate heatmaps and radar charts visualising how each model handles sensitive categories.</description>
    </item>
    <item>
      <title>Model Drift & Stability — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/drift</link>
      <guid>https://www.moderationbias.com/analysis/drift</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Tracking how LLM censorship behaviours change over time — are models getting more or less restrictive?</description>
    </item>
    <item>
      <title>Council Consensus — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/consensus</link>
      <guid>https://www.moderationbias.com/analysis/consensus</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Do AI models agree with each other on what is safe? Explore inter-model agreement rates.</description>
    </item>
    <item>
      <title>AI Political Compass — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/political</link>
      <guid>https://www.moderationbias.com/analysis/political</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Mapping the structural political biases of LLMs across economic and social axes.</description>
    </item>
    <item>
      <title>Model Reliability — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/reliability</link>
      <guid>https://www.moderationbias.com/analysis/reliability</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Internal consistency and self-agreement analysis — how reliable is each model moderation?</description>
    </item>
    <item>
      <title>Longitudinal Analysis — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/longitudinal</link>
      <guid>https://www.moderationbias.com/analysis/longitudinal</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Interactive timeline tracking the evolution of AI content moderation policies over months.</description>
    </item>
    <item>
      <title>Alignment Tax — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/alignment</link>
      <guid>https://www.moderationbias.com/analysis/alignment</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>The Pareto frontier: which models give the best helpfulness-to-safety tradeoff at the lowest cost?</description>
    </item>
    <item>
      <title>Semantic Clusters — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/clusters</link>
      <guid>https://www.moderationbias.com/analysis/clusters</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Explore visually grouped refused prompts by semantic similarity to find hidden moderation patterns.</description>
    </item>
    <item>
      <title>Statistical Significance — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/significance</link>
      <guid>https://www.moderationbias.com/analysis/significance</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Pairwise McNemar tests separating signal from noise in model refusal rate differences.</description>
    </item>
    <item>
      <title>Censorship Triggers — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/triggers</link>
      <guid>https://www.moderationbias.com/analysis/triggers</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Which specific words and linguistic patterns automatically trigger AI content refusals?</description>
    </item>
    <item>
      <title>Paternalism in AI — Moderation Bias</title>
      <link>https://www.moderationbias.com/analysis/paternalism</link>
      <guid>https://www.moderationbias.com/analysis/paternalism</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>Do AI models gatekeep differently based on who they think is asking? Persona-based refusal analysis.</description>
    </item>
    <item>
      <title>Model Profile: GPT-4o</title>
      <link>https://www.moderationbias.com/models/openai/gpt-4o</link>
      <guid>https://www.moderationbias.com/models/openai/gpt-4o</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for GPT-4o (OpenAI). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: GPT-4o Mini</title>
      <link>https://www.moderationbias.com/models/openai/gpt-4o-mini</link>
      <guid>https://www.moderationbias.com/models/openai/gpt-4o-mini</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for GPT-4o Mini (OpenAI). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: Claude 3.5 Sonnet</title>
      <link>https://www.moderationbias.com/models/anthropic/claude-3.5-sonnet</link>
      <guid>https://www.moderationbias.com/models/anthropic/claude-3.5-sonnet</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for Claude 3.5 Sonnet (Anthropic). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: Claude 3 Haiku</title>
      <link>https://www.moderationbias.com/models/anthropic/claude-3-haiku</link>
      <guid>https://www.moderationbias.com/models/anthropic/claude-3-haiku</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for Claude 3 Haiku (Anthropic). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: Gemini 2.0 Flash</title>
      <link>https://www.moderationbias.com/models/google/gemini-2.0-flash-001</link>
      <guid>https://www.moderationbias.com/models/google/gemini-2.0-flash-001</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for Gemini 2.0 Flash (Google). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: DeepSeek V3</title>
      <link>https://www.moderationbias.com/models/deepseek/deepseek-chat</link>
      <guid>https://www.moderationbias.com/models/deepseek/deepseek-chat</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for DeepSeek V3 (DeepSeek). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: Qwen 2.5 72B</title>
      <link>https://www.moderationbias.com/models/qwen/qwen-2.5-72b-instruct</link>
      <guid>https://www.moderationbias.com/models/qwen/qwen-2.5-72b-instruct</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for Qwen 2.5 72B (Alibaba). Does it restrict political or controversial speech?</description>
    </item>
    <item>
      <title>Model Profile: Qwen 2.5 7B</title>
      <link>https://www.moderationbias.com/models/qwen/qwen-2.5-7b-instruct</link>
      <guid>https://www.moderationbias.com/models/qwen/qwen-2.5-7b-instruct</guid>
      <pubDate>Wed, 15 Apr 2026 00:00:00 GMT</pubDate>
      <description>View the refusal rate, category breakdown, and behavioral analysis for Qwen 2.5 7B (Alibaba). Does it restrict political or controversial speech?</description>
    </item>
  </channel>
</rss>