Abstract
Satellite communication is a key technology in our modern connected world.With increasingly complex hardware, one challenge is to efficiently configurelinks (connections) on a satellite transponder. Planning an optimal linkconfiguration is extremely complex and depends on many parameters and metrics.The optimal use of the limited resources, bandwidth and power of thetransponder is crucial. Such an optimization problem can be approximated usingmetaheuristic methods such as simulated annealing, but recent research resultsalso show that reinforcement learning can achieve comparable or even betterperformance in optimization methods. However, there have not yet been anystudies on link configuration on satellite transponders. In order to close thisresearch gap, a transponder environment was developed as part of this work. Forthis environment, the performance of the reinforcement learning algorithm PPOwas compared with the metaheuristic simulated annealing in two experiments. Theresults show that Simulated Annealing delivers better results for this staticproblem than the PPO algorithm, however, the research in turn also underlinesthe potential of reinforcement learning for optimization problems.