首页 /研究 /DiTNet: End-to-End 3D Object Detection and Track ID Assignment in Spatio-Temporal World
PERCEPTION

DiTNet: End-to-End 3D Object Detection and Track ID Assignment in Spatio-Temporal World

Sukai Wang, Peide Cai, Lujia Wang, Ming Liu

发表年份
2021
引用次数
26

摘要

End-to-end 3D object detection and tracking based on point clouds is receiving more and more attention in many robotics applications, such as autonomous driving. Compared with 2D images, 3D point clouds do not have enough texture information for data association. Thus, we propose an end-to-end point cloud-based network, DiTNet, to directly assign a track ID to each object across the whole sequence, without the data association step. DiTNet is made location-invariant by using relative location and embeddings to learn each object's spatial and temporal features in the Spatio-temporal world. The features from the detection module helps to improve the tracking performance, and the tracking module with final trajectories also helps to refine the detection results. We train and evaluate our network on the CARLA simulation environment and KITTI dataset. Our approach achieves competitive performance over the state-of-the-art methods on the KITTI benchmark.

关键词

Point cloudArtificial intelligenceEnd-to-end principleComputer scienceComputer visionObject detectionBenchmark (surveying)Tracking (education)Track (disk drive)Association (psychology)

相关论文

查看 PERCEPTION 分类全部论文