ActionParty: First Multi-Agent Video World Model Controls Seven Players Simultaneously

2026-04-04T00:40:36.321Z·1 min read
Researchers have introduced ActionParty, the first video world model capable of controlling up to seven players simultaneously across 46 diverse environments, tackling a fundamental limitation of e...

Researchers have introduced ActionParty, the first video world model capable of controlling up to seven players simultaneously across 46 diverse environments, tackling a fundamental limitation of existing video diffusion models.

The Problem

Current "world model" video systems are largely restricted to single-agent settings. They fail to control multiple agents simultaneously because of an action binding problem — the model struggles to associate specific actions with their corresponding subjects.

The Solution

ActionParty introduces subject state tokens — persistent latent variables that capture the state of each subject in the scene. Combined with a spatial biasing mechanism, it disentangles:

Key Results

MetricAchievement
Max simultaneous players7
Test environments46 (Melting Pot benchmark)
Action-following accuracySignificant improvement
Identity consistencyRobust through interactions
Autoregressive trackingComplex interactions handled

How It Works

  1. Each subject gets persistent state tokens in latent space
  2. Spatial biasing mechanism routes actions to correct subjects
  3. Video diffusion generates frames respecting all subject states simultaneously
  4. Autoregressive tracking maintains identity through interactions

Implications

Authors

Alexander Pondaven, Ziyi Wu, Igor Gilitschenski, Philip Torr, Sergey Tulyakov, Fabio Pizzati, Aliaksandr Siarohin.

Paper: arXiv:2604.02330 | Project: action-party.github.io

↗ Original source · 2026-04-03T00:00:00.000Z
← Previous: EFF Sues FAA Over 21-Month Nationwide Drone Ban Near ICE Vehicles, Calling It First Amendment ViolationNext: Grounded Token Initialization: Fixing a Key Bottleneck in Extending Language Models with New Vocabulary →
Comments0