Abstract
Beneficial from Fully Convolutional Neural Networks (FCNs), saliencydetection methods have achieved promising results. However, it is stillchallenging to learn effective features for detecting salient objects incomplicated scenarios, in which i) non-salient regions may have "salient-like"appearance; ii) the salient objects may have different-looking regions. Tohandle these complex scenarios, we propose a Feature Guide Network whichexploits the nature of low-level and high-level features to i) make foregroundand background regions more distinct and suppress the non-salient regions whichhave "salient-like" appearance; ii) assign foreground label todifferent-looking salient regions. Furthermore, we utilize a Multi-scaleFeature Extraction Module (MFEM) for each level of abstraction to obtainmulti-scale contextual information. Finally, we design a loss function whichoutperforms the widely-used Cross-entropy loss. By adopting four differentpre-trained models as the backbone, we prove that our method is very generalwith respect to the choice of the backbone model. Experiments on fivechallenging datasets demonstrate that our method achieves the state-of-the-artperformance in terms of different evaluation metrics. Additionally, ourapproach contains fewer parameters than the existing ones, does not need anypost-processing, and runs fast at a real-time speed of 28 FPS when processing a480 x 480 image.