3D Object Detection Using Scale Invariant and Feature Reweighting Networks

Xin Zhao; Zhe Liu; Ruolan Hu; Kaiqi Huang

doi:10.1609/aaai.v33i01.33019267

Authors

Xin Zhao Chinese Academy of Sciences
Zhe Liu Huazhong University of Science and Technology
Ruolan Hu Huazhong University of Science and Technology
Kaiqi Huang Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v33i01.33019267

Abstract

3D object detection plays an important role in a large number of real-world applications. It requires us to estimate the localizations and the orientations of 3D objects in real scenes. In this paper, we present a new network architecture which focuses on utilizing the front view images and frustum point clouds to generate 3D detection results. On the one hand, a PointSIFT module is utilized to improve the performance of 3D segmentation. It can capture the information from different orientations in space and the robustness to different scale shapes. On the other hand, our network obtains the useful features and suppresses the features with less information by a SENet module. This module reweights channel features and estimates the 3D bounding boxes more effectively. Our method is evaluated on both KITTI dataset for outdoor scenes and SUN-RGBD dataset for indoor scenes. The experimental results illustrate that our method achieves better performance than the state-of-the-art methods especially when point clouds are highly sparse.

3D Object Detection Using Scale Invariant and Feature Reweighting Networks

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription