Abstract
The confluence of the advancement of Autonomous Vehicles (AVs) and thematurity of Vehicle-to-Everything (V2X) communication has enabled thecapability of cooperative connected and automated vehicles (CAVs). Building ontop of cooperative perception, this paper explores the feasibility andeffectiveness of cooperative motion prediction. Our method, CMP, takes LiDARsignals as input to enhance tracking and prediction capabilities. Unlikeprevious work that focuses separately on either cooperative perception ormotion prediction, our framework, to the best of our knowledge, is the first toaddress the unified problem where CAVs share information in both perception andprediction modules. Incorporated into our design is the unique capability totolerate realistic V2X bandwidth limitations and transmission delays, whiledealing with bulky perception representations. We also propose a predictionaggregation module, which unifies the predictions obtained by different CAVsand generates the final prediction. Through extensive experiments and ablationstudies, we demonstrate the effectiveness of our method in cooperativeperception, tracking, and motion prediction tasks. In particular, CMP reducesthe average prediction error by 17.2\% with fewer missing detections comparedwith the no cooperation setting. Our work marks a significant step forward inthe cooperative capabilities of CAVs, showcasing enhanced performance incomplex scenarios.