Abstract
The 3D scene graph models spatial relationships between objects, enabling theagent to efficiently navigate in a partially observable environment and predictthe location of the target object.This paper proposes an original frameworknamed SGN-CIRL (3D Scene Graph-Based Reinforcement Learning Navigation) formapless reinforcement learning-based robot navigation with learnablerepresentation of open-vocabulary 3D scene graph. To accelerate and stabilizethe training of reinforcement learning-based algorithms, the framework alsoemploys imitation learning and curriculum learning. The first one enables theagent to learn from demonstrations, while the second one structures thetraining process by gradually increasing task complexity from simple to moreadvanced scenarios. Numerical experiments conducted in the Isaac Simenvironment showed that using a 3D scene graph for reinforcement learningsignificantly increased the success rate in difficult navigation cases. Thecode is open-sourced and available at: https://github.com/Xisonik/Aloha\_graph.