SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation

  • 2025-08-01 07:35:48
  • Hengxing Cai, Jinhan Dong, Yijie Rao, Jingcheng Deng, Jingjun Tan, Qien Chen, Haidong Wang, Zhen Wang, Shiyu Huang, Agachai Sumalee, Renxin Zhong
  • 0

Abstract

Unmanned Aerial Vehicle (UAV) Vision-Language Navigation (VLN) aims to enableagents to accurately localize targets and plan flight paths in complexenvironments based on natural language instructions, with broad applications inintelligent inspection, disaster rescue, and urban monitoring. Recent progressin Vision-Language Models (VLMs) has provided strong semantic understanding forthis task, while reinforcement learning (RL) has emerged as a promisingpost-training strategy to further improve generalization. However, existing RLmethods often suffer from inefficient use of training data, slow convergence,and insufficient consideration of the difficulty variation among trainingsamples, which limits further performance improvement. To address thesechallenges, we propose \textbf{Semantic-Aware Gaussian Curriculum Scheduling(SA-GCS)}, a novel training framework that systematically integrates CurriculumLearning (CL) into RL. SA-GCS employs a Semantic-Aware Difficulty Estimator(SA-DE) to quantify the complexity of training samples and a GaussianCurriculum Scheduler (GCS) to dynamically adjust the sampling distribution,enabling a smooth progression from easy to challenging tasks. This designsignificantly improves training efficiency, accelerates convergence, andenhances overall model performance. Extensive experiments on the CityNavbenchmark demonstrate that SA-GCS consistently outperforms strong baselinesacross all metrics, achieves faster and more stable convergence, andgeneralizes well across models of different scales, highlighting its robustnessand scalability. The implementation of our approach is publicly available.

 

Quick Read (beta)

loading the full paper ...