RACE: Fine-Grained AI Text Detection That Distinguishes Human-Written, LLM-Polished, and Humanized AI Content

Available in: 中文

2026-04-07T17:16:14.883Z·1 min read

Existing AI text detectors only do binary (human vs AI) or ternary classification. This is insufficient because:

An ACL 2026 accepted paper introduces RACE (Rhetorical Analysis for Creator-Editor Modeling), a method that goes beyond binary "human vs AI" classification to detect fine-grained types of LLM involvement in text. This matters for policy enforcement, where LLM-polished human text and humanized LLM text carry different regulatory implications.

The Problem

Existing AI text detectors only do binary (human vs AI) or ternary classification. This is insufficient because:

LLM-polished human text — Human writes, AI improves style. Common in professional writing.
Humanized LLM text — AI generates, human edits. Common in content farms.
Both trigger different policy consequences but existing detectors can't distinguish them.

The Four Classes

Pure human text — Written entirely by humans
LLM-generated text — Written entirely by LLMs
LLM-polished human text — Human foundation, AI style editing
Humanized LLM text — AI generation, human editing

How RACE Works

Creator modeling — Uses Rhetorical Structure Theory to build a logic graph representing the original writer's foundation
Editor modeling — Extracts Elementary Discourse Unit-level features to identify editing style
Outperforms 12 baselines with low false alarm rates

Why It Matters

As AI writing tools become ubiquitous, regulation needs precision. A student using Grammarly-style polishing shouldn't be treated the same as someone submitting fully AI-generated content. RACE provides the granularity needed for fair enforcement.

↗ Original source · 2026-04-07T00:00:00.000Z

Comments0

RACE: Fine-Grained AI Text Detection That Distinguishes Human-Written, LLM-Polished, and Humanized AI Content

The Problem

The Four Classes

How RACE Works

Why It Matters

Related Articles