Abstract
This paper presents a novel Lyapunov-Based Quantum Reinforcement Learning(LQRL) framework that integrates quantum policy optimization with Lyapunovstability analysis for continuous-time vehicle control. The proposed approachcombines the representational power of variational quantum circuits (VQCs) witha stability-aware policy gradient mechanism to ensure asymptotic convergenceand safe decision-making under dynamic environments. The vehicle longitudinalcontrol problem was formulated as a continuous-state reinforcement learningtask, where the quantum policy network generates control actions subject toLyapunov stability constraints. Simulation experiments were conducted in aclosed-loop adaptive cruise control scenario using a quantum-inspired policytrained under stability feedback. The results demonstrate that the LQRLframework successfully embeds Lyapunov stability verification into quantumpolicy learning, enabling interpretable and stability-aware controlperformance. Although transient overshoot and Lyapunov divergence were observedunder aggressive acceleration, the system maintained bounded state evolution,validating the feasibility of integrating safety guarantees within quantumreinforcement learning architectures. The proposed framework provides afoundational step toward provably safe quantum control in autonomous systemsand hybrid quantum-classical optimization domains.