"Anchor-free 3D single stage detector with mask-guided attention for point cloud" by Jiale Li

Selected Works of Hang Dai

Article

Anchor-free 3D single stage detector with mask-guided attention for point cloud

MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia

Jiale Li, Zhejiang University
Hang Dai, Mohamed Bin Zayed University of Artificial Intelligence
Ling Shao, Inception Institute of Artificial Intelligence
Yong Ding, Zhejiang University

Link

Document Type

Conference Proceeding

Abstract

Most of the existing single-stage and two-stage 3D object detectors are anchor-based methods, while the efficient but challenging anchor-free single-stage 3D object detection is not well investigated. Recent studies on 2D object detection show that the anchor-free methods also are of great potential. However, the unordered and sparse properties of point clouds prevent us from directly leveraging the advanced 2D methods on 3D point clouds. We overcome this by converting the voxel-based sparse 3D feature volumes into the sparse 2D feature maps. We propose an attentive module to fit the sparse feature maps to dense mostly on the object regions through the deformable convolution tower and the supervised mask-guided attention. By directly regressing the 3D bounding box from the enhanced and dense feature maps, we construct a novel single-stage 3D detector for point clouds in an anchor-free manner. We propose an IoU-based detection confidence re-calibration scheme to improve the correlation between the detection confidence score and the accuracy of the bounding box regression. Our code is publicly available at https://github.com/jialeli1/MGAF-3DSSD.

DOI

10.1145/3474085.3475208

Publication Date

10-17-2021

Keywords

3D object detection,
anchor-free,
point cloud,
single stage

Disciplines

Comments

IR Deposit conditions: non-described

Citation Information

J. Li, H. Dai, L. Shao and Y. Ding, "Anchor-free 3D single stage detector with mask-guided attention for point cloud", in Proceedings of the 29th ACM International Conference on Multimedia, New York, 2021, pp. 553–562. Available: 10.1145/3474085.3475208