Guided Upsampling Network for Real-Time Semantic Segmentation

  • 2018-07-19 14:40:14
  • Davide Mazzini
  • 10

Abstract

Semantic segmentation architectures are mainly built upon an encoder-decoderstructure. These models perform subsequent downsampling operations in theencoder. Since operations on high-resolution activation maps arecomputationally expensive, usually the decoder produces output segmentationmaps by upsampling with parameters-free operators like bilinear ornearest-neighbor. We propose a Neural Network named Guided Upsampling Networkwhich consists of a multiresolution architecture that jointly exploitshigh-resolution and large context information. Then we introduce a new modulenamed Guided Upsampling Module (GUM) that enriches upsampling operators byintroducing a learnable transformation for semantic maps. It can be pluggedinto any existing encoder-decoder architecture with little modifications andlow additional computation cost. We show with quantitative and qualitativeexperiments how our network benefits from the use of GUM module. Acomprehensive set of experiments on the publicly available Cityscapes datasetdemonstrates that Guided Upsampling Network can efficiently processhigh-resolution images in real-time while attaining state-of-the artperformances.

 

Quick Read (beta)

loading the full paper ...