3D Object Detection Using Scale Invariant and Feature Reweighting Networks

Authors

  • Xin Zhao Chinese Academy of Sciences
  • Zhe Liu Huazhong University of Science and Technology
  • Ruolan Hu Huazhong University of Science and Technology
  • Kaiqi Huang Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v33i01.33019267

Abstract

3D object detection plays an important role in a large number of real-world applications. It requires us to estimate the localizations and the orientations of 3D objects in real scenes. In this paper, we present a new network architecture which focuses on utilizing the front view images and frustum point clouds to generate 3D detection results. On the one hand, a PointSIFT module is utilized to improve the performance of 3D segmentation. It can capture the information from different orientations in space and the robustness to different scale shapes. On the other hand, our network obtains the useful features and suppresses the features with less information by a SENet module. This module reweights channel features and estimates the 3D bounding boxes more effectively. Our method is evaluated on both KITTI dataset for outdoor scenes and SUN-RGBD dataset for indoor scenes. The experimental results illustrate that our method achieves better performance than the state-of-the-art methods especially when point clouds are highly sparse.

Downloads

Published

2019-07-17

How to Cite

Zhao, X., Liu, Z., Hu, R., & Huang, K. (2019). 3D Object Detection Using Scale Invariant and Feature Reweighting Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 9267-9274. https://doi.org/10.1609/aaai.v33i01.33019267

Issue

Section

AAAI Technical Track: Vision