Drifting MPC: Trajectory Optimization Without a Simulator Using Offline Dataset Learning

Available in: 中文
2026-04-07T16:05:05.880Z·1 min read
A new framework called Drifting MPC solves a fundamental problem in robotics and control: how to plan optimal trajectories when you don't have a simulator and can't model the system dynamics. The a...

A new framework called Drifting MPC solves a fundamental problem in robotics and control: how to plan optimal trajectories when you don't have a simulator and can't model the system dynamics. The answer: learn from existing trajectory data alone.

The Challenge

Traditional model predictive control (MPC) requires:

But what if:

Drifting MPC Approach

The framework combines:

  1. Drifting generative models — Learn a conditional distribution over trajectories from offline data
  2. Receding-horizon planning — Plan ahead while adapting as new information arrives
  3. Optimality-data tradeoff — Balance between generating optimal plans and staying close to demonstrated behavior

Key Properties

Applications

↗ Original source · 2026-04-07T00:00:00.000Z
← Previous: MC-CPO: Preventing AI Tutoring Systems from Reward Hacking by Enforcing Mastery-Based Safety ConstraintsNext: Don't Blink: Vision-Language Models Can Become More Accurate While Losing Visual Grounding — "Evidence Collapse" →
Comments0