Git Product home page Git Product logo

siamse-rgbt's Introduction

Scale Equivariance Improves Siamese Tracking

PWC PWC PWC PWC

Ivan Sosnovik*, Artem Moskalev*, and Arnold Smeulders Scale Equivariance Improves Siamese Tracking, WACV 2021.

Introduction

Siamese trackers turn tracking into similarity estimation between a template and the candidate regions in the frame. Mathematically, one of the key ingredients of success of the similarity function is translation equivariance. Non-translation-equivariant architectures induce a positional bias during training, which hinders accurate localization. In real-life scenarios, however, the target undergos more transformations than just translation, e.g. rotation and scaling. In this work we focus on the later and demonstrate that extending Siamese tracking with built-in scale-equivariance improves tracking quality.




Results


Models OTB-2013 OTB-2015 VOT2016 VOT2017
SiamFC+ 0.67 0.64 0.30 0.23
SE-SiamFC 0.68 0.66 0.36 0.27

Raw results and models are available here

Environment: The code is developed with Intel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz GPU: NVIDIA Titan RTX,

Requirements

To train models and to run evaluations, you need the following dependencies

yacs
scipy
shapely
opencv-python
numpy
pytorch
torchvision
pyyaml

You will need to install python API for MATLAB engine to run the VOT benchmarks.

Quick start

  1. Download the weights for model initialization from here. We used SESiamFCResNet22_pretrained.pth. You can also run the script transfer_weights.py to generate your own weights.
  2. Download the preprocessed training and testing datasets. Use the instructions provided here
  3. Adjust the training config in configs/train.yaml according to your tasks.
  4. Run the training script
CUDA_VISIBLE_DEVICES=0 python train_siamfc.py --cfg configs/train.yaml
  1. To run the tracker from a snapshot, simply do the following
CUDA_VISIBLE_DEVICES=0 python test_siamfc.py \
    --checkpoint snapshot/SESiamFCResNet22/checkpoint_otb.pth \
    --dataset OTB2015 \
    --dataset_root ~/datasets/ \
    --cfg configs/test.yaml 

  1. When the output is generated, you are able to evaluate the performance of the tracker. Use lib/core/eval_otb.py or lib/core/eval_vot.py. For example,
python lib/core/eval_otb.py \
    --dataset OTB2015 \
    --dataset_root ~/datasets \
    --result_path results \
    --tracker_reg "SE*" \

Acknowledgements

The Robert Bosch GmbH is acknowledged for financial support.

Citation

If you found this work useful in your research, please consider citing

@InProceedings{Sosnovik_2021_WACV,
    author    = {Sosnovik, Ivan and Moskalev, Artem and Smeulders, Arnold W.M.},
    title     = {Scale Equivariance Improves Siamese Tracking},
    booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month     = {January},
    year      = {2021},
    pages     = {2765-2774}
}

License

Licensed under an MIT license.

siamse-rgbt's People

Contributors

isosnovik avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.