Abstract
Super-resolution (SR), a classical inverse problem in computer vision, isinherently ill-posed, inducing a distribution of plausible solutions for everyinput. However, the desired result is not simply the expectation of thisdistribution, which is the blurry image obtained by minimizing pixelwise error,but rather the sample with the highest image quality. A variety of techniques,from perceptual metrics to adversarial losses, are employed to this end. Inthis work, we explore an alternative: utilizing powerful non-reference imagequality assessment (NR-IQA) models in the SR context. We begin with acomprehensive analysis of NR-IQA metrics on human-derived SR data, identifyingboth the accuracy (human alignment) and complementarity of different metrics.Then, we explore two methods of applying NR-IQA models to SR learning: (i)altering data sampling, by building on an existing multi-ground-truth SRframework, and (ii) directly optimizing a differentiable quality score. Ourresults demonstrate a more human-centric perception-distortion tradeoff,focusing less on non-perceptual pixel-wise distortion, instead improving thebalance between perceptual fidelity and human-tuned NR-IQA measures.