Back to feed
arXiv cs.AI·

Baba in Wonderland: Online Self-Supervised Dynamics Discovery for Executable World Models

Signal
72
Hype
15
In three linesAlice is an online executable world-model learning system that discovers environment dynamics without rule descriptions or reward signals. The agent induces transition laws from interaction alone, treating preservation conflicts as structural signal to refine hypothesis classes. Evaluation on Baba in Wonderland shows substantial improvement under prior misalignment.
Read source
Your take?
ReasoningReinforcement learningPapers

Summary generated by Claude — human-verified