VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Abstract

Autonomous driving requires a comprehensive understanding of the surroundingenvironment for reliable trajectory planning. Previous works rely on denserasterized scene representation (e.g., agent occupancy and semantic map) toperform planning, which is computationally intensive and misses theinstance-level structure information. In this paper, we propose VAD, anend-to-end vectorized paradigm for autonomous driving, which models the drivingscene as fully vectorized representation. The proposed vectorized paradigm hastwo significant advantages. On one hand, VAD exploits the vectorized agentmotion and map elements as explicit instance-level planning constraints whicheffectively improves planning safety. On the other hand, VAD runs much fasterthan previous end-to-end planning methods by getting rid ofcomputation-intensive rasterized representation and hand-designedpost-processing steps. VAD achieves state-of-the-art end-to-end planningperformance on the nuScenes dataset, outperforming the previous best method bya large margin (reducing the average collision rate by 48.4%). Besides, VADgreatly improves the inference speed (up to 9.3x), which is critical for thereal-world deployment of an autonomous driving system. Code and models will bereleased for facilitating future research.

Quick Read (beta)

loading the full paper ...