Deep Tone Mapping Operator for High Dynamic Range Images

Abstract

A computationally fast tone mapping operator (TMO) that can quickly adapt toa wide spectrum of high dynamic range (HDR) content is quintessential forvisualization on varied low dynamic range (LDR) output devices such as moviescreens or standard displays. Existing TMOs can successfully tone-map only alimited number of HDR content and require an extensive parameter tuning toyield the best subjective-quality tone-mapped output. In this paper, we addressthis problem by proposing a fast, parameter-free and scene-adaptable deep tonemapping operator (DeepTMO) that yields a high-resolution and high-subjectivequality tone mapped output. Based on conditional generative adversarial network(cGAN), DeepTMO not only learns to adapt to vast scenic-content (e.g., outdoor,indoor, human, structures, etc.) but also tackles the HDR relatedscene-specific challenges such as contrast and brightness, while preserving thefine-grained details. We explore 4 possible combinations ofGenerator-Discriminator architectural designs to specifically address someprominent issues in HDR related deep-learning frameworks like blurring, tilingpatterns and saturation artifacts. By exploring different influences of scales,loss-functions and normalization layers under a cGAN setting, we conclude withadopting a multi-scale model for our task. To further leverage on thelarge-scale availability of unlabeled HDR data, we train our network bygenerating targets using an objective HDR quality metric, namely Tone MappingImage Quality Index (TMQI). We demonstrate results both quantitatively andqualitatively, and showcase that our DeepTMO generates high-resolution,high-quality output images over a large spectrum of real-world scenes. Finally,we evaluate the perceived quality of our results by conducting a pair-wisesubjective study which confirms the versatility of our method.

Quick Read (beta)

loading the full paper ...