Abstract
Reinforcement learning (RL) holds significant promise for adaptive trafficsignal control. While existing RL-based methods demonstrate effectiveness inreducing vehicular congestion, their predominant focus on vehicle-centricoptimization leaves pedestrian mobility needs and safety challengesunaddressed. In this paper, we present a deep RL framework for adaptive controlof eight traffic signals along a real-world urban corridor, jointly optimizingboth pedestrian and vehicular efficiency. Our single-agent policy is trainedusing real-world pedestrian and vehicle demand data derived from Wi-Fi logs andvideo analysis. The results demonstrate significant performance improvementsover traditional fixed-time signals, reducing average wait times per pedestrianand per vehicle by up to 67% and 52%, respectively, while simultaneouslydecreasing total accumulated wait times for both groups by up to 67% and 53%.Additionally, our results demonstrate generalization capabilities acrossvarying traffic demands, including conditions entirely unseen during training,validating RL's potential for developing transportation systems that serve allroad users.