Deep Reinforcement Learning for Distributed Dynamic Power Allocation in Wireless Networks

  • 2018-08-01 18:16:57
  • Yasar Sinan Nasir, Dongning Guo
  • 6

Abstract

This work demonstrates the potential of deep reinforcement learningtechniques for transmit power control in emerging and future wireless networks.Various techniques have been proposed in the literature to find near-optimalpower allocations, often by solving a challenging optimization problem. Most ofthese algorithms are not scalable to large networks in real-world scenariosbecause of their computational complexity and instantaneous cross-cell channelstate information (CSI) requirement. In this paper, a model-free distributeddynamic power allocation scheme is developed based on deep reinforcementlearning. Each transmitter collects CSI and quality of service (QoS)information from several neighbors and adapts its own transmit poweraccordingly. The objective is to maximize a weighted sum-rate utility function,which can be particularized to achieve maximum sum-rate or proportionally fairscheduling (with weights that are changing over time). Both random variationsand delays in the CSI are inherently addressed using deep Q-learning. For atypical network architecture, the proposed algorithm is shown to achievenear-optimal power allocation in real time based on delayed CSI measurementsavailable to the agents. This work indicates that deep reinforcement learningbased radio resource management can be very fast and deliver highly competitiveperformance, especially in practical scenarios where the system model isinaccurate and CSI delay is non-negligible.

 

Quick Read (beta)

loading the full paper ...