In this paper, we propose a distributed solution to design a multi-hop ad hocnetwork where mobile relay nodes strategically determine their wirelesstransmission ranges based on a deep reinforcement learning approach. Weconsider scenarios where only a limited networking infrastructure is availablebut a large number of wireless mobile relay nodes are deployed in building amulti-hop ad hoc network to deliver source data to the destination. A mobilerelay node is considered as a decision-making agent that strategicallydetermines its transmission range in a way that maximizes network throughputwhile minimizing the corresponding transmission power consumption. Each relaynode collects information from its partial observations and learns itsenvironment through a sequence of experiences. Hence, the proposed solutionrequires only a minimal amount of information from the system. We show that theactions that the relay nodes take from its policy are determined as to activateor inactivate its transmission, i.e., only necessary relay nodes are activatedwith the maximum transmit power, and nonessential nodes are deactivated tominimize power consumption. Using extensive experiments, we confirm that theproposed solution builds a network with higher network performance than currentstate-of-the-art solutions in terms of system goodput and connectivity ratio.