SU-ESRGAN: Semantic and Uncertainty-Aware ESRGAN for Super-Resolution of Satellite and Drone Imagery with Fine-Tuning for Cross Domain Evaluation

Abstract

Generative Adversarial Networks (GANs) have achieved realisticsuper-resolution (SR) of images however, they lack semantic consistency andper-pixel confidence, limiting their credibility in critical remote sensingapplications such as disaster response, urban planning and agriculture. Thispaper introduces Semantic and Uncertainty-Aware ESRGAN (SU-ESRGAN), the firstSR framework designed for satellite imagery to integrate the ESRGAN,segmentation loss via DeepLabv3 for class detail preservation and Monte Carlodropout to produce pixel-wise uncertainty maps. The SU-ESRGAN produces results(PSNR, SSIM, LPIPS) comparable to the Baseline ESRGAN on aerial imagery. Thisnovel model is valuable in satellite systems or UAVs that use widefield-of-view (FoV) cameras, trading off spatial resolution for coverage. Themodular design allows integration in UAV data pipelines for on-board orpost-processing SR to enhance imagery resulting due to motion blur, compressionand sensor limitations. Further, the model is fine-tuned to evaluate itsperformance on cross domain applications. The tests are conducted on two dronebased datasets which differ in altitude and imaging perspective. Performanceevaluation of the fine-tuned models show a stronger adaptation to the AerialMaritime Drone Dataset, whose imaging characteristics align with the trainingdata, highlighting the importance of domain-aware training in SR-applications.

Quick Read (beta)

loading the full paper ...