Partially Observable Reinforcement Learning for Intelligent Transportation Systems

Abstract

Intelligent Transportation Systems (ITS) have attracted the attention ofresearchers and the general public alike as a means to alleviate trafficcongestion. Recently, the maturity of wireless technology has enabled acost-efficient way to achieve ITS by detecting vehicles using Vehicle toInfrastructure (V2I) communications. Traditional ITS algorithms, in most cases,assume that every vehicle is observed, such as by a camera or a loop detector,but a V2I implementation would detect only those vehicles with wirelesscommunications capability. We examine a family of transportation systems, whichwe will refer to as `Partially Detected Intelligent Transportation Systems'. Analgorithm that can act well under a small detection rate is highly desirabledue to gradual penetration rates of the underlying wireless technologies suchas Dedicated Short Range Communications (DSRC) technology. ArtificialIntelligence (AI) techniques for Reinforcement Learning (RL) are suitable toolsfor finding such an algorithm due to utilizing varied inputs and not requiringexplicit analytic understanding or modeling of the underlying system dynamics.In this paper, we report a RL algorithm for partially observable ITS based onDSRC. The performance of this system is studied under different car flows,detection rates, and topologies of the road network. Our system is able toefficiently reduce the average waiting time of vehicles at an intersection,even with a low detection rate.

Quick Read (beta)

loading the full paper ...