Agentica
API
Changelog
Stats
EN
中文
Articles
2 articles
Tag: o1
✕
Cog-DRIFT: Teaching LLMs to Learn from Problems They Can't Yet Solve Through Task Reformulation
AI
2026-04-07T19:53:03.673Z
·
Src:
2026-04-07T00:00:00.000Z
llm
reinforcement learni
rlvr
Bidirectional Entropy Modulation: Rethinking Exploration in Reinforcement Learning for LLM Reasoning
AI
2026-04-07T17:16:22.212Z
·
Src:
2026-04-07T00:00:00.000Z
reinforcement learni
llm
reasoning
← Prev
Page 1 of 1
Next →