GitHub - NEU-REAL/MMF-Tracker: [TIV'23] MMF-Track: Multi-modal Multi-level Fusion for 3D Single Object Tracking

MMF-Track: Multi-modal Multi-level Fusion for 3D Single Object Tracking

This is the official code release of "MMF-Track: Multi-modal Multi-level Fusion for 3D Single Object Tracking"

Abstract

3D single object tracking plays an important role in computer vision and autonomous driving. The mainstream methods mainly rely on point clouds to achieve geometry matching between target template and search area. However, textureless and incomplete point clouds make it difficult for single-modal trackers to distinguish objects with similar structures. To overcome the mentioned limitations of geometry matching, we propose a Multi-modal Multi-level Fusion Tracker (MMF-Track), which exploits the image texture and geometry characteristic of point clouds to track 3D target. Specifically, we first propose a Space Alignment Module (SAM) to align RGB images with point clouds in 3D space, which is the prerequisite for constructing inter-modal associations. After that, in feature interaction level, we present a Feature Interaction Module (FIM) based on dual-stream structure, which enhances intra-modal features in parallel and constructs inter-modal semantic associations. Meanwhile, in order to refine each modal feature, we propose a Coarse-to-Fine Interaction Module (CFIM) to realize the hierarchical feature interaction at different scales. Finally, in similarity fusion level, we introduce a Similarity Fusion Module (SFM) to aggregate geometry and texture similarity from the target. Extensive experiments show that our method achieves competitive performance on KITTI and NuScenes datasets.

Method

Code

Coming soon...

Acknowledgment

This repo is built upon OpenPCDet.
Thank traveller59 for his implementation of Spconv.
Thank tianweiy for his implementation of CenterPoint.
Thank 3bobo for his implementation of LTTR.

Citation

If you find the project useful for your research, you may cite,

@ARTICLE{10292917,
  author={Li, Zhiheng and Cui, Yubo and Lin, Yu and Fang, Zheng},
  journal={IEEE Transactions on Intelligent Vehicles}, 
  title={MMF-Track: Multi-Modal Multi-Level Fusion for 3D Single Object Tracking}, 
  year={2024},
  volume={9},
  number={1},
  pages={1817-1829},
  keywords={Three-dimensional displays;Target tracking;Geometry;Point cloud compression;Feature extraction;Object tracking;Transformers;3D single object tracking (SOT);multi-modal data fusion;transformer;siamese network},
  doi={10.1109/TIV.2023.3326790}}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Picture		Picture
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMF-Track: Multi-modal Multi-level Fusion for 3D Single Object Tracking

Abstract

Method

Code

Acknowledgment

Citation

About

Releases

Packages

NEU-REAL/MMF-Tracker

Folders and files

Latest commit

History

Repository files navigation

MMF-Track: Multi-modal Multi-level Fusion for 3D Single Object Tracking

Abstract

Method

Code

Acknowledgment

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages