Git Product home page Git Product logo

anchor-diff-vos's Introduction

Anchor diffusion VOS

This repository contains code for the paper

Anchor Diffusion for Unsupervised Video Object Segmentation
Zhao Yang*, Qiang Wang*, Luca Bertinetto, Weiming Hu, Song Bai, Philip H.S. Torr
ICCV 2019 | PDF | BibTex

Setup

Code tested for Ubuntu 16.04, Python 3.7, PyTorch 0.4.1, and CUDA 9.2.

  • Clone the repository and change to the new directory.
git clone https://github.com/yz93/anchor-diff-VOS-internal.git && cd anchor-diff-VOS
  • Save the working directory to an environment variable for reference.
export AnchorDiff=$PWD
  • Set up a new conda environment.
    • For installing PyTorch 0.4.1 with different versions of CUDA, see here.
conda create -n anchordiff python=3.7 pytorch=0.4.1 cuda92 -c pytorch
source activate anchordiff
pip install -r requirements.txt

Data preparation

  • Download the data set
cd $AnchorDiff
wget https://data.vision.ee.ethz.ch/csergi/share/davis/DAVIS-2017-trainval-480p.zip
unzip DAVIS-2017-trainval-480p.zip -d data
cd $AnchorDiff
unzip snapshots.zip -d snapshots
  • (If you do not intend to apply instance pruning described in the paper, feel free to skip this.) Download the detection results that we have computed using ExtremeNet, and generate the pruning masks.
cd $AnchorDiff
wget www.robots.ox.ac.uk/~yz/detection.zip
unzip detection.zip
python detection_filter.py

Evaluation on DAVIS 2016

  • Examples for evaluating mean IoU on the validation set with options,
    • save-mask (default 'True') for saving the predicted masks,
    • ms-mirror (default 'False') for multiple-scale and mirrored input (slow),
    • inst-prune (default 'False') for instance pruning,
    • model (default 'ad') specifying models in Table 1 of the paper,
    • eval-sal (default 'False') for computing saliency measures, MAE and F-score.
cd $AnchorDiff
python eval.py
python eval.py --ms-mirror True --inst-prune True --eval-sal True

License

The MIT License.

anchor-diff-vos's People

Contributors

yz93 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

anchor-diff-vos's Issues

403 Forbidden in pretrained model website

I encounter "You don't have permission to access this resource." when I visit the pretrained weights' website.
Could you please help me get the pretrained model? Thanks.

Do not understand "unsupervised"

hello, after reading your paper, I don't understand how to train the model.
In experiment section, "binary cross-entropy loss" is mentioned, it seems the network should be trained with pixel-wise supervised label like segmentation.
Could you show the "unsupervised process" with detail? Thank you.

predicted masks

Hi Zhao Yang & Qiang Wang,

Thank you for your impressive work. I evaluated the model by following your instruction in readme. However the masks (png files) in pred_masks were all black. Why did it get these black masks while the mean J was about 0.8? Thank you!
image

can not get the same test result

I use pretrained model to inference: python eval.py --ms-mirror True --inst-prune True --eval-sal True

and use matlab code to evalate, the result is as follows:
cosnet adnet mmadnet

J mean 0.805 0.817 0.803
J recall 0.931 0.909 0.900
J decay 0.044 0.022 0.021

F mean 0.795 0.805 0.793
F recall 0.895 0.851 0.847
F decay 0.050 0.006 0.004

T (GT 0.088) 0.184 0.225 0.228

adnet is from your provided files. mmadnet is from me.
I am not sure what is wrong. Is it some parameters wrong?

Confused about how to optimize

Hi, I am confused about how to optimize.
1, what's the loss function? in this paper, it seems not to be mentioned. is it just BCE?
2, what's the ground truth?
Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.