Del-Net: A Single-Stage Network for Mobile Camera ISP

Abstract

The quality of images captured by smartphones is an important specificationsince smartphones are becoming ubiquitous as primary capturing devices. Thetraditional image signal processing (ISP) pipeline in a smartphone cameraconsists of several image processing steps performed sequentially toreconstruct a high quality sRGB image from the raw sensor data. These stepsconsist of demosaicing, denoising, white balancing, gamma correction, colourenhancement, etc. Since each of them are performed sequentially usinghand-crafted algorithms, the residual error from each processing moduleaccumulates in the final reconstructed signal. Thus, the traditional ISPpipeline has limited reconstruction quality in terms of generalizability acrossdifferent lighting conditions and associated noise levels while capturing theimage. Deep learning methods using convolutional neural networks (CNN) havebecome popular in solving many image-related tasks such as image denoising,contrast enhancement, super resolution, deblurring, etc. Furthermore, recentapproaches for the RAW to sRGB conversion using deep learning methods have alsobeen published, however, their immense complexity in terms of their memoryrequirement and number of Mult-Adds make them unsuitable for mobile camera ISP.In this paper we propose DelNet - a single end-to-end deep learning model - tolearn the entire ISP pipeline within reasonable complexity for smartphonedeployment. Del-Net is a multi-scale architecture that uses spatial and channelattention to capture global features like colour, as well as a series oflightweight modified residual attention blocks to help with denoising. Forvalidation, we provide results to show the proposed Del-Net achieves compellingreconstruction quality.

Quick Read (beta)

loading the full paper ...