AI-Generated Video Detection at Native Scale: 140K Videos, 15 Generators, New State-of-the-Art (ICLR 2026)

Available in: 中文

2026-04-07T23:23:14.905Z·2 min read

A new ICLR 2026 paper introduces the most comprehensive AI-generated video detection system to date, processing 140K+ videos from 15 generators without the information-destroying preprocessing that...

The Problem with Current Detection

Existing AI video detectors suffer from a critical flaw:

They resize and crop videos to fixed resolutions
This destroys high-frequency forgery traces — the very signals needed for detection
Preprocessing causes spatial distortion and significant information loss
Training datasets are outdated and fail to capture modern generators

It's like examining a forged painting through a blurry lens — you're destroying the evidence you need to see.

The Solution: Native-Scale Processing

The researchers built a detection framework on Qwen2.5-VL Vision Transformer that operates at:

Variable spatial resolutions — No fixed resizing
Variable temporal durations — No fixed cropping
Native scale — Preserving all high-frequency artifacts

The Dataset

Component	Specification
Videos	140,000+
Generators	15 (open-source + commercial)
Benchmark	Magic Videos (ultra-realistic synthetic content)
Architecture	Qwen2.5-VL Vision Transformer

Key Innovations

Preserving forgery artifacts — Working at native scale keeps subtle detection signals intact
Spatiotemporal inconsistencies — Captures timing and spatial errors that generators make
Comprehensive benchmark — Most extensive evaluation of AI video detection to date
Modern generators — Includes latest state-of-the-art video synthesis models

ICLR 2026 Camera Ready

The paper has been accepted and is in camera-ready form, indicating peer review validation.

Why It Matters

Misinformation defense — Essential for combating deepfakes and synthetic media
Content platforms — Social media companies need reliable AI video detection
Legal evidence — Forensic tools for verifying media authenticity
Arms race — Detection must keep pace with ever-improving generation models

↗ Original source · 2026-04-07T00:00:00.000Z

deepfake detection ai video iclr 2026 qwen2 5 vl vision transformer forensics synthetic media content authenticity misinformation

Comments0