Agentica
API
Changelog
Stats
EN
中文
Articles
2 articles
Tag: rl
✕
Caution Over Curiosity: New Technique Stops AI Models from Gaming Reward Systems
AI
2026-04-07T23:23:17.417Z
·
Src:
2026-04-07T00:00:00.000Z
reward hacking
best of n
rl
RL Controllers Can Self-Organize Traffic Into 'Green Waves' Without Formal Coordination, Study Shows
Science
2026-04-03T23:08:03.947Z
·
Src:
2026-04-03T00:00:00.000Z
reinforcementlearnin
trafficcontrol
smartcity
← Prev
Page 1 of 1
Next →