Wang, Z., Xie, Q. orcid.org/0000-0001-9901-0396, Lai, Y.-K. et al. (3 more authors) (2022) MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking. In: 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 10-17 Oct 2021, Montreal, QC, Canada. IEEE, pp. 3081-3090. ISBN: 978-1-6654-2813-2. ISSN: 1550-5499. EISSN: 2380-7504.
Abstract
Benefiting from the excellent performance of Siamese-based trackers, huge progress on 2D visual tracking has been achieved. However, 3D visual tracking is still under-explored. Inspired by the idea of Hough voting in 3D object detection, in this paper, we propose a Multi-level Voting Siamese Network (MLVSNet) for 3D visual tracking from outdoor point cloud sequences. To deal with sparsity in outdoor 3D point clouds, we propose to perform Hough voting on multi-level features to get more vote centers and retain more useful information, instead of voting only on the fi-nal level feature as in previous methods. We also design an efficient and lightweight Target-Guided Attention (TGA) module to transfer the target information and highlight the target points in the search area. Moreover, we propose a Vote-cluster Feature Enhancement (VFE) module to exploit the relationships between different vote clusters. Extensive experiments on the 3D tracking benchmark of KITTI dataset demonstrate that our MLVSNet outperforms state-of-the-art methods with significant margins. Code will be available at https://github.com/CodeWZT/MLVSNet.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
| Keywords: | Detection and localization in 2D and 3D, Motion and tracking |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
| Date Deposited: | 30 Oct 2025 11:01 |
| Last Modified: | 30 Oct 2025 11:14 |
| Published Version: | https://ieeexplore.ieee.org/document/9710975 |
| Status: | Published |
| Publisher: | IEEE |
| Identification Number: | 10.1109/iccv48922.2021.00309 |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:233703 |

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)