Abstract
Autonomy is a hallmark of animal intelligence, enabling adaptive andintelligent behavior in complex environments without relying on external rewardor task structure. Existing reinforcement learning approaches to exploration inreward-free environments, including a class of methods known as model-basedintrinsic motivation, exhibit inconsistent exploration patterns and do notconverge to an exploratory policy, thus failing to capture robust autonomousbehaviors observed in animals. Moreover, systems neuroscience has largelyoverlooked the neural basis of autonomy, focusing instead on experimentalparadigms where animals are motivated by external reward rather than engagingin ethological, naturalistic and task-independent behavior. To bridge thesegaps, we introduce a novel model-based intrinsic drive explicitly designedafter the principles of autonomous exploration in animals. Our method(3M-Progress) achieves animal-like exploration by tracking divergence betweenan online world model and a fixed prior learned from an ecological niche. Tothe best of our knowledge, we introduce the first autonomous embodied agentthat predicts brain data entirely from self-supervised optimization of anintrinsic goal -- without any behavioral or neural training data --demonstrating that 3M-Progress agents capture the explainable variance inbehavioral patterns and whole-brain neural-glial dynamics recorded fromautonomously behaving larval zebrafish, thereby providing the firstgoal-driven, population-level model of neural-glial computation. Our findingsestablish a computational framework connecting model-based intrinsic motivationto naturalistic behavior, providing a foundation for building artificial agentswith animal-like autonomy.