PointFlowNet: Learning Representations for 3D Scene Flow Estimation from Point Clouds

  • 2018-09-17 14:20:26
  • Aseem Behl, Despoina Paschalidou, Simon DonnĂ©, Andreas Geiger
  • 0

Abstract

Despite significant progress in image-based 3D scene flow estimation, theperformance of such approaches has not yet reached the fidelity required bymany applications. Simultaneously, these applications are often not restrictedto image-based estimation: laser scanners provide a popular alternative totraditional cameras, for example in the context of self-driving cars, as theydirectly yield a 3D point cloud. In this paper, we propose to estimate 3D sceneflow from such unstructured point clouds using a deep neural network. In asingle forward pass, our model jointly predicts 3D scene flow as well as the 3Dbounding box and rigid body motion of objects in the scene. While the prospectof estimating 3D scene flow from unstructured point clouds is promising, it isalso a challenging task. We show that the traditional global representation ofrigid body motion prohibits inference by CNNs, and propose a translationequivariant representation to circumvent this problem. For training our deepnetwork, a large dataset is required. Because of this, we augment real scansfrom KITTI with virtual objects, realistically modeling occlusions andsimulating sensor noise. A thorough comparison with classic and learning-basedtechniques highlights the robustness of the proposed approach.

 

Quick Read (beta)

loading the full paper ...