Abstract
Human drivers exhibit individual preferences regarding driving style.Adapting autonomous vehicles to these preferences is essential for user trustand satisfaction. However, existing end-to-end driving approaches often rely onpredefined driving styles or require continuous user feedback for adaptation,limiting their ability to support dynamic, context-dependent preferences. Wepropose a novel approach using multi-objective reinforcement learning (MORL)with preference-driven optimization for end-to-end autonomous driving thatenables runtime adaptation to driving style preferences. Preferences areencoded as continuous weight vectors to modulate behavior along interpretablestyle objectives$\unicode{x2013}$including efficiency, comfort, speed, andaggressiveness$\unicode{x2013}$without requiring policy retraining. Oursingle-policy agent integrates vision-based perception in complex mixed-trafficscenarios and is evaluated in diverse urban environments using the CARLAsimulator. Experimental results demonstrate that the agent dynamically adaptsits driving behavior according to changing preferences while maintainingperformance in terms of collision avoidance and route completion.