Abstract
Cooperative autonomous robotic systems have significant potential forexecuting complex multi-task missions across space, air, ground, and maritimedomains. But they commonly operate in remote, dynamic and hazardousenvironments, requiring rapid in-mission adaptation without reliance on fragileor slow communication links to centralised compute. Fast, on-board replanningalgorithms are therefore needed to enhance resilience. Reinforcement Learningshows strong promise for efficiently solving mission planning tasks whenformulated as Travelling Salesperson Problems (TSPs), but existing methods: 1)are unsuitable for replanning, where agents do not start at a single location;2) do not allow cooperation between agents; 3) are unable to model tasks withvariable durations; or 4) lack practical considerations for on-boarddeployment. Here we define the Cooperative Mission Replanning Problem as anovel variant of multiple TSP with adaptations to overcome these issues, anddevelop a new encoder/decoder-based model using Graph Attention Networks andAttention Models to solve it effectively and efficiently. Using a simpleexample of cooperative drones, we show our replanner consistently (90% of thetime) maintains performance within 10% of the state-of-the-art LKH3 heuristicsolver, whilst running 85-370 times faster on a Raspberry Pi. This work pavesthe way for increased resilience in autonomous multi-agent systems.