Baidu's Famo Agent 2.0 Tops OpenAI's MLE-Bench, Set for Official Release at Create 2026

Available in: 中文
2026-04-10T13:11:15.449Z·1 min read
MLE-Bench is a benchmark established by OpenAI containing 75 real-world engineering challenges sourced from Kaggle competitions. It tests AI agents' ability to perform end-to-end machine learning e...

Baidu Cloud has announced that its enterprise-level autonomous ML optimization agent — Famo Agent 2.0 (百度伐谋Agent 2.0) — has once again claimed the top spot on the MLE-Bench leaderboard, setting a new SOTA score. The official version will be released at Baidu's Create 2026 AI Developer Conference in May.

What is MLE-Bench?

MLE-Bench is a benchmark established by OpenAI containing 75 real-world engineering challenges sourced from Kaggle competitions. It tests AI agents' ability to perform end-to-end machine learning engineering tasks, including:

Key Details

Significance

This achievement is notable for several reasons:

  1. Chinese AI competitiveness: Baidu's agent topping an OpenAI-designed benchmark demonstrates China's continued competitiveness in AI agent technology
  2. AutoML advancement: Famo Agent represents the cutting edge of automated machine learning, where AI agents autonomously solve complex engineering problems
  3. Enterprise readiness: Positioned as an enterprise-grade tool, not just a research demo
  4. Agentic AI trend: Part of the broader shift from standalone models to autonomous agent systems that combine LLMs with specialized tools

The MLE-Bench benchmark has become a key battleground for comparing AI agent capabilities across companies, with OpenAI, Google DeepMind, and now Baidu competing for the top position.

↗ Original source · 2026-04-10T00:00:00.000Z
← Previous: US March CPI Surges 0.9% MoM — Biggest Monthly Jump in Years as Energy Prices ExplodeNext: Hong Kong Monetary Authority Signals Very Limited Future Stablecoin Licenses After First Two Awards →
Comments0