Abstract
Augmented reality (AR) offers immersive interaction but remains inaccessiblefor users with motor impairments or limited dexterity due to reliance onprecise input methods. This study proposes a gesture-based interaction systemfor AR environments, leveraging deep learning to recognize hand and bodygestures from wearable sensors and cameras, adapting interfaces to usercapabilities. The system employs vision transformers (ViTs), temporalconvolutional networks (TCNs), and graph attention networks (GATs) for gestureprocessing, with federated learning ensuring privacy-preserving model trainingacross diverse users. Reinforcement learning optimizes interface elements likemenu layouts and interaction modes. Experiments demonstrate a 20% improvementin task completion efficiency and a 25% increase in user satisfaction formotor-impaired users compared to baseline AR systems. This approach enhances ARaccessibility and scalability. Keywords: Deep learning, Federated learning,Gesture recognition, Augmented reality, Accessibility, Human-computerinteraction