Batch3DMOT - Offline 3D Multi-Object Tracking

arXiv | IEEE Xplore | website

This repository is the official implementation of the paper:

3D Multi-Object Tracking Using Graph Neural Networks with Cross-Edge Modality Attention

Martin Büchner and Abhinav Valada.

IEEE Robotics and Automation Letters (RA-L), Vol. 7, Issue 4, pp. 9707-9714, 2022

If you find our work useful, please consider citing our paper:

@article{buchner20223d,
  title={3D Multi-Object Tracking Using Graph Neural Networks With Cross-Edge Modality Attention},
  author={B{\"u}chner, Martin and Valada, Abhinav},
  journal={IEEE Robotics and Automation Letters},
  volume={7},
  number={4},
  pages={9707--9714},
  year={2022},
  publisher={IEEE}
}

📔 Abstract

Online 3D multi-object tracking (MOT) has witnessed significant research interest in recent years, largely driven by demand from the autonomous systems community. However, 3D offline MOT is relatively less explored. Labeling 3D trajectory scene data at a large scale while not relying on high-cost human experts is still an open research question. In this work, we propose Batch3DMOT that follows the tracking-by-detection paradigm and represents real-world scenes as directed, acyclic, and category-disjoint tracking graphs that are attributed using various modalities such as camera, LiDAR, and radar. We present a multi-modal graph neural network that uses a cross-edge attention mechanism mitigating modality intermittence, which translates into sparsity in the graph domain. Additionally, we present attention-weighted convolutions over frame-wise k-NN neighborhoods as suitable means to allow information exchange across disconnected graph components. We evaluate our approach using various sensor modalities and model configurations on the challenging nuScenes and KITTI datasets. Extensive experiments demonstrate that our proposed approach yields an overall improvement of 2.8% in the AMOTA score on nuScenes thereby setting a new benchmark for 3D tracking methods and successfully enhances false positive filtering.

👨‍💻 Code Release

Installation

Download nuScenes dataset from here.
Download Megvii and CenterPoint detections. You may use src/utils/concat_jsons.py to obtain mini-split results.
Define relevant paths in *_config.yaml
- The tmp-folder holds preprocessed graph data while the data-folder holds the raw nuScenes dataset.
- Adjust package paths to match your local setup.
Generate 2D image annotation by running python nuscenes/scripts/export_2d_annotations_as_json.py --dataroot=/path/to/nuscdata --version=v1.0-trainval and place it under the nuScenes data directory.

Preprocessing

Generate metadata and GT for feature encoder training:
- python batch_3dmot/preprocessing/preprocess_img.py --config cl_config.yaml
- python batch_3dmot/preprocessing/preprocess_lidar.py --config cl_config.yaml,
- python batch_3dmot/preprocessing/preprocess_radar.py --config cl_config.yaml
Train feature encoders:
- python batch_3dmot/preprocessing/train_resnet_ae.py --config cl_config.yaml
- python batch_3dmot/preprocessing/train_pointnet.py --config cl_config.yaml
- python batch_3dmot/preprocessing/train_radarnet.py --config cl_config.yaml
Construct disjoint, directed tracking graphs either using modalities or not:
- python batch_3dmot/preprocessing/construct_detection_graphs_disjoint_parallel.py --config cl_config.yaml
- python batch_3dmot/preprocessing/construct_detection_graphs_disjoint_parallel_only_poses.py --config pose_config.yaml

Training and Evaluation

Train Batch3DMOT (poses-only or using modalities):
- python batch_3dmot/train_poses_only.py --config pose_config.yaml
- python batch_3dmot/wandb_train.py --config cl_config.yaml
Perform inference using trained model (poses-only, pose+camera or using more modalities):
- python batch_3dmot/predict_detections_poses.py --config pose_config.yaml
- python batch_3dmot/predict_detctions_img.py --config cl_config.yaml
- python batch_3dmot/predict_detections.py --config cl_config.yaml
Evaluate produced tracking result:
- python batch_3dmot/eval/eval_nuscenes.py --config ***_config.yaml

👩‍⚖️ License

For academic usage, the code is released under the GPLv3 license. For any commercial purpose, please contact the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
batch_3dmot		batch_3dmot
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
batch3dmot_architecture.png		batch3dmot_architecture.png
cl_config.yaml		cl_config.yaml
mini_config.yaml		mini_config.yaml
pose_config.yaml		pose_config.yaml
setup.py		setup.py
video_banner.gif		video_banner.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Batch3DMOT - Offline 3D Multi-Object Tracking

📔 Abstract

👨‍💻 Code Release

Installation

Preprocessing

Training and Evaluation

👩‍⚖️ License

About

Releases

Packages

Contributors 2

Languages

License

robot-learning-freiburg/Batch3DMOT

Folders and files

Latest commit

History

Repository files navigation

Batch3DMOT - Offline 3D Multi-Object Tracking

📔 Abstract

👨‍💻 Code Release

Installation

Preprocessing

Training and Evaluation

👩‍⚖️ License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages