TLE-Based A2C Agent for Terrestrial Coverage Orbital Path Planning

Abstract

The increasing congestion of Low Earth Orbit (LEO) poses persistentchallenges to the efficient deployment and safe operation of Earth observationsatellites. Mission planners must now account not only for mission-specificrequirements but also for the increasing collision risk with active satellitesand space debris. This work presents a reinforcement learning framework usingthe Advantage Actor-Critic (A2C) algorithm to optimize satellite orbitalparameters for precise terrestrial coverage within predefined surface radii. Byformulating the problem as a Markov Decision Process (MDP) within a customOpenAI Gymnasium environment, our method simulates orbital dynamics usingclassical Keplerian elements. The agent progressively learns to adjust five ofthe orbital parameters - semi-major axis, eccentricity, inclination, rightascension of ascending node, and the argument of perigee-to achieve targetedterrestrial coverage. Comparative evaluation against Proximal PolicyOptimization (PPO) demonstrates A2C's superior performance, achieving 5.8xhigher cumulative rewards (10.0 vs 9.263025) while converging in 31.5x fewertimesteps (2,000 vs 63,000). The A2C agent consistently meets missionobjectives across diverse target coordinates while maintaining computationalefficiency suitable for real-time mission planning applications. Keycontributions include: (1) a TLE-based orbital simulation environmentincorporating physics constraints, (2) validation of actor-critic methods'superiority over trust region approaches in continuous orbital control, and (3)demonstration of rapid convergence enabling adaptive satellite deployment. Thisapproach establishes reinforcement learning as a computationally efficientalternative for scalable and intelligent LEO mission planning.

Quick Read (beta)

loading the full paper ...