Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization

Abstract

In recent years, Multi-Agent Reinforcement Learning (MARL) has foundapplication in numerous areas of science and industry, such as autonomousdriving, telecommunications, and global health. Nevertheless, MARL suffersfrom, for instance, an exponential growth of dimensions. Inherent properties ofquantum mechanics help to overcome these limitations, e.g., by significantlyreducing the number of trainable parameters. Previous studies have developed anapproach that uses gradient-free quantum Reinforcement Learning andevolutionary optimization for variational quantum circuits (VQCs) to reduce thetrainable parameters and avoid barren plateaus as well as vanishing gradients.This leads to a significantly better performance of VQCs compared to classicalneural networks with a similar number of trainable parameters and a reductionin the number of parameters by more than 97 \% compared to similarly goodneural networks. We extend an approach of K\"olle et al. by proposing aGate-Based, a Layer-Based, and a Prototype-Based concept to mutate andrecombine VQCs. Our results show the best performance for mutation-onlystrategies and the Gate-Based approach. In particular, we observe asignificantly better score, higher total and own collected coins, as well as asuperior own coin rate for the best agent when evaluated in the Coin Gameenvironment.

Quick Read (beta)

loading the full paper ...