DepthTrack: Cluster Meets BEV for Multi-Camera Multi-Target 3D Tracking
  • Tran, Tai Huu-Phuong
  • Tran, Duong Nguyen-Ngoc
  • Huynh, Ngoc Doan-Minh
  • Tran, Chi Dai
  • Pham, Long Hoang
  • ... Jeont, Jae Wook
  • 외 7명
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Object perception using multi-view cameras is essential for intelligent systems operating in indoor environments such as warehouses, retail stores, and hospitals. Traditional multi-target multi-camera (MTMC) detection and tracking approaches typically depend on 2D object detection and single-view multi-object tracking (MOT), without properly exploiting 3D spatial information. While point cloud-based 3D detection models offer high accuracy, their adoption is hindered by expensive deployment requirements. Besides, most existing MTMC tracking methods are designed for daytime scenarios, neglecting the challenge posed by lowlight conditions. In this paper, we introduce DepthTrack, a novel MTMC tracking framework for all-day tracking without requiring high-precision 3D sensors to produce 3D bounding boxes. At the core of DepthTrack is our Tracklet-Cluster Mapping (TCM) strategy, which seamlessly integrates 3D object clusters with bird's-eye view (BEV) tracklets to achieve robust 3D tracking. Experiments conducted on the AICity'25 dataset validate the strong generalization of DepthTrack across diverse lighting conditions. In the 2025 AI City Challenge Track 1, our team secured the second position with an accuracy (HOTA) of 63.1396. The code will be released at https://github.com/SKKUAutoLab/AIC25_Track_01

키워드

3d object trackingbirds-eye view trackingmulti-camera 3d perceptionmulti-target multi-camera trackingmulti-view trackingre-identification
제목
DepthTrack: Cluster Meets BEV for Multi-Camera Multi-Target 3D Tracking
저자
Tran, Tai Huu-PhuongTran, Duong Nguyen-NgocHuynh, Ngoc Doan-MinhTran, Chi DaiPham, Long HoangHo, Quoc Pham-NamNguyen, Huy-HungVu, Duong KhacJeon, Hyung-MinJeon, Hyung-JoonPhan, Son HongKhanh, Trinh Le BaJeont, Jae Wook
DOI
10.1109/ICCVW69036.2025.00558
발행일
2025
유형
Proceedings Paper
저널명
Proceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025
페이지
5348 ~ 5357