Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models

  • 2023-01-28 23:50:50
  • Matthew J. Muckley, Alaaeldin El-Nouby, Karen Ullrich, Hervé Jégou, Jakob Verbeek
  • 0


Lossy image compression aims to represent images in as few bits as possiblewhile maintaining fidelity to the original. Theoretical results indicate thatoptimizing distortion metrics such as PSNR or MS-SSIM necessarily leads to adiscrepancy in the statistics of original images from those of reconstructions,in particular at low bitrates, often manifested by the blurring of thecompressed images. Previous work has leveraged adversarial discriminators toimprove statistical fidelity. Yet these binary discriminators adopted fromgenerative modeling tasks may not be ideal for image compression. In thispaper, we introduce a non-binary discriminator that is conditioned on quantizedlocal image representations obtained via VQ-VAE autoencoders. Our evaluationson the CLIC2020, DIV2K and Kodak datasets show that our discriminator is moreeffective for jointly optimizing distortion (e.g., PSNR) and statisticalfidelity (e.g., FID) than the state-of-the-art HiFiC model. On the CLIC2020test set, we obtain the same FID as HiFiC with 30-40% fewer bits.


Quick Read (beta)

loading the full paper ...