Abstract
This paper introduces ARCLE, an environment designed to facilitatereinforcement learning research on the Abstraction and Reasoning Corpus (ARC).Addressing this inductive reasoning benchmark with reinforcement learningpresents these challenges: a vast action space, a hard-to-reach goal, and avariety of tasks. We demonstrate that an agent with proximal policyoptimization can learn individual tasks through ARCLE. The adoption ofnon-factorial policies and auxiliary losses led to performance enhancements,effectively mitigating issues associated with action spaces and goalattainment. Based on these insights, we propose several research directions andmotivations for using ARCLE, including MAML, GFlowNets, and World Models.
Quick Read (beta)
loading the full paper ...