Baidu's Famou Agent 2.0 Tops MLE-Bench, Sets New SOTA for Machine Learning Engineering AI

Available in: 中文
2026-04-10T23:05:35.076Z·1 min read
Baidu's cloud division has announced that its enterprise-grade algorithm optimization agent, Famou Agent 2.0, has once again topped the MLE-Bench benchmark, setting a new state-of-the-art score. Th...

Baidu's cloud division has announced that its enterprise-grade algorithm optimization agent, Famou Agent 2.0, has once again topped the MLE-Bench benchmark, setting a new state-of-the-art score. The formal release is planned for May 2026 at Baidu's Create AI Developer Conference.

What is MLE-Bench?

MLE-Bench is a machine learning engineering benchmark established by OpenAI, containing 75 real-world engineering problems sourced from Kaggle competitions. It tests an AI agent's ability to perform end-to-end ML engineering tasks including:

Famou Agent 2.0 Details

Significance

A Chinese AI company topping an OpenAI-created benchmark is significant for several reasons:

  1. Competitive landscape: Demonstrates that Chinese AI labs can compete with Western counterparts on rigorous engineering tasks
  2. Enterprise focus: The agent is designed for enterprise use, suggesting practical commercial applications
  3. AutoML evolution: Represents the next generation of automated machine learning, where AI agents can independently solve complex engineering problems
  4. Benchmark credibility: OpenAI's MLE-Bench is considered one of the most challenging tests of AI engineering capability

What's Next

Baidu will formally release Famou Agent 2.0 at Create 2026, its annual AI developer conference. The product is expected to compete with similar offerings from Western companies in the growing market for AI-powered development tools.

↗ Original source · 2026-04-10T00:00:00.000Z
← Previous: China's Cyber Regulators Crack Down on 7 Ticket Platforms: No Automated High-Frequency Ticket Snatching AllowedNext: US March CPI Rises 3.3% Year-over-Year, Energy Prices Surge 10.9% Monthly →
Comments0