MA-CDMR: An Intelligent Cross-domain Multicast Routing Method based on Multiagent Deep Reinforcement Learning in Multi-domain SDWN

  • 2024-09-11 14:52:05
  • Miao Ye, Hongwen Hu, Xiaoli Wang, Yuping Wang, Yong Wang, Wen Peng, Jihao Zheng
  • 0

Abstract

The cross-domain multicast routing problem in a software-defined wirelessnetwork with multiple controllers is a classic NP-hard optimization problem. Asthe network size increases, designing and implementing cross-domain multicastrouting paths in the network requires not only designing efficient solutionalgorithms to obtain the optimal cross-domain multicast tree but also ensuringthe timely and flexible acquisition and maintenance of global network stateinformation. However, existing solutions have a limited ability to sense thenetwork traffic state, affecting the quality of service of multicast services.In addition, these methods have difficulty adapting to the highly dynamicallychanging network states and have slow convergence speeds. To this end, thispaper aims to design and implement a multiagent deep reinforcement learningbased cross-domain multicast routing method for SDWN with multicontrollerdomains. First, a multicontroller communication mechanism and a multicast groupmanagement module are designed to transfer and synchronize network informationbetween different control domains of the SDWN, thus effectively managing thejoining and classification of members in the cross-domain multicast group.Second, a theoretical analysis and proof show that the optimal cross-domainmulticast tree includes an interdomain multicast tree and an intradomainmulticast tree. An agent is established for each controller, and a cooperationmechanism between multiple agents is designed to effectively optimizecross-domain multicast routing and ensure consistency and validity in therepresentation of network state information for cross-domain multicast routingdecisions. Third, a multiagent reinforcement learning-based method thatcombines online and offline training is designed to reduce the dependence onthe real-time environment and increase the convergence speed of multipleagents.

 

Quick Read (beta)

loading the full paper ...