Separable Self and Mixed Attention Transformers for Efficient Object Tracking [WACV2024]

Official implementation

News

07-09-2023: The paper is available on arXiv now

28-08-2023: The pretrained tracker model is released

17-08-2023: The SMAT tracker training and inference code is released

14-08-2023: The paper is accepted at WACV2024

Installation

Install the dependency packages using the environment file smat_pyenv.yml.

Generate the relevant files:

python tracking/create_default_local_file.py --workspace_dir . --data_dir ./data --save_dir ./output

After running this command, modify the datasets paths by editing these files

lib/train/admin/local.py  # paths about training
lib/test/evaluation/local.py  # paths about testing

Training

Set the path of training datasets in lib/train/admin/local.py
Place the pretrained backbone model under the pretrained_models/ folder
For data preparation, please refer to this
Uncomment lines 63, 67, and 71 in the base_backbone.py file. Long story short: The code is opitmized for high inference speed, hence some intermediate feature-maps are pre-computed during testing. However, these pre-computations are not feasible during training.
Run

python tracking/train.py --script mobilevitv2_track --config mobilevitv2_256_128x1_ep300 --save_dir ./output --mode single

The training logs will be saved under output/logs/ folder

Pretrained tracker model

The pretrained tracker model can be found here

Tracker Evaluation

Update the test dataset paths in lib/test/evaluation/local.py
Place the pretrained tracker model under output/checkpoints/ folder
Run

python tracking/test.py --tracker_name mobilevitv2_track --tracker_param mobilevitv2_256_128x1_ep300 --dataset got10k_test or trackingnet or lasot --inference_mode pytorch or onnx or openvino or tensorrtfp32

Change the DEVICE variable between cuda and cpu in the --tracker_param file for GPU and CPU-based inference, respectively
The raw results will be stored under output/test/ folder

Tracker demo

To evaluate the tracker on a sample video, run

python tracking/video_demo.py --tracker_name mobilevitv2_track --tracker_param mobilevitv2_256_128x1_ep300 --videofile *path-to-video-file* --optional_box *bounding-box-annotation*

Visualization of tracker output and the attention maps

Acknowledgements

We use the Separable Self-Attention Transformer implementation and the pretrained MobileViTv2 backbone from ml-cvnets. Thank you!
Our training code is built upon OSTrack and PyTracking
To generate the evaluation metrics for different datasets (except, server-based GOT-10k and TrackingNet), we use the pysot-toolkit

Citation

If our work is useful for your research, please consider citing:

@InProceedings{Gopal2024Sep,
  author       = "Goutam Yelluru Gopal and Maria Amer",
  title        = "Separable Self and Mixed Attention Transformers for Efficient Object Tracking",
  booktitle    = "IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) ",
  year         = "2024",
  pages        = "8",
  month        = "Jan. 4-8",
  address      = "Waikoloa, Hawaii",
}

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.idea		.idea
assets		assets
docker		docker
experiments/mobilevitv2_track		experiments/mobilevitv2_track
got10k_val_anno		got10k_val_anno
lib		lib
misc		misc
output		output
pretrained_models		pretrained_models
tracking		tracking
LICENSE		LICENSE
README.md		README.md
int8-calib.cache		int8-calib.cache
smat_pyenv.yml		smat_pyenv.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Separable Self and Mixed Attention Transformers for Efficient Object Tracking [WACV2024]

Official implementation

News

Installation

Training

Pretrained tracker model

Tracker Evaluation

Tracker demo

Visualization of tracker output and the attention maps

Acknowledgements

Citation

About

Releases

Packages

Languages

License

HassanMahmoodKhan/SMAT

Folders and files

Latest commit

History

Repository files navigation

Separable Self and Mixed Attention Transformers for Efficient Object Tracking [WACV2024]

Official implementation

News

Installation

Training

Pretrained tracker model

Tracker Evaluation

Tracker demo

Visualization of tracker output and the attention maps

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages