Fast Point R-CNN - Paper Detail

Abstract

We present a unified, efficient and effective framework for point-cloud based3D object detection. Our two-stage approach utilizes both voxel representationand raw point cloud data to exploit respective advantages. The first stagenetwork, with voxel representation as input, only consists of lightconvolutional operations, producing a small number of high-quality initialpredictions. Coordinate and indexed convolutional feature of each point ininitial prediction are effectively fused with the attention mechanism,preserving both accurate localization and context information. The second stageworks on interior points with their fused feature for further refining theprediction. Our method is evaluated on KITTI dataset, in terms of both 3D andBird's Eye View (BEV) detection, and achieves state-of-the-arts with a 15FPSdetection rate.

Quick Read (beta)

loading the full paper ...