Abstract
Multi-Agent Reinforcement Learning is becoming increasingly more important intimes of autonomous driving and other smart industrial applications.Simultaneously a promising new approach to Reinforcement Learning arises usingthe inherent properties of quantum mechanics, reducing the trainable parametersof a model significantly. However, gradient-based Multi-Agent QuantumReinforcement Learning methods often have to struggle with barren plateaus,holding them back from matching the performance of classical approaches. Webuild upon an existing approach for gradient free Quantum ReinforcementLearning and propose three genetic variations with Variational Quantum Circuitsfor Multi-Agent Reinforcement Learning using evolutionary optimization. Weevaluate our genetic variations in the Coin Game environment and also comparethem to classical approaches. We showed that our Variational Quantum Circuitapproaches perform significantly better compared to a neural network with asimilar amount of trainable parameters. Compared to the larger neural network,our approaches archive similar results using $97.88\%$ less parameters.