Abstract
Quantum Support Vector Machines face scalability challenges due tohigh-dimensional quantum states and hardware limitations. We propose anembedding-aware quantum-classical pipeline combining class-balanced k-meansdistillation with pretrained Vision Transformer embeddings. Our key finding:ViT embeddings uniquely enable quantum advantage, achieving up to 8.02%accuracy improvements over classical SVMs on Fashion-MNIST and 4.42% on MNIST,while CNN features show performance degradation. Using 16-qubit tensor networksimulation via cuTensorNet, we provide the first systematic evidence thatquantum kernel advantage depends critically on embedding choice, revealingfundamental synergy between transformer attention and quantum feature spaces.This provides a practical pathway for scalable quantum machine learning thatleverages modern neural architectures.