Light

yz93 / anchor-diff-vos Goto Github PK

View Code? Open in Web Editor NEW

116.0 5.0 14.0 26 KB

Anchor Diffusion for Unsupervised Video Object Segmentation

Python 76.66% C++ 11.62% Cuda 11.72%

video-object-segmentation video-saliency pytorch deep-learning iccv2019 computer-vision

anchor-diff-vos's Introduction

Anchor diffusion VOS

This repository contains code for the paper

Anchor Diffusion for Unsupervised Video Object Segmentation
Zhao Yang*, Qiang Wang*, Luca Bertinetto, Weiming Hu, Song Bai, Philip H.S. Torr
ICCV 2019 | PDF | BibTex

Setup

Code tested for Ubuntu 16.04, Python 3.7, PyTorch 0.4.1, and CUDA 9.2.

Clone the repository and change to the new directory.

git clone https://github.com/yz93/anchor-diff-VOS-internal.git && cd anchor-diff-VOS

Save the working directory to an environment variable for reference.

export AnchorDiff=$PWD

Set up a new conda environment.
- For installing PyTorch 0.4.1 with different versions of CUDA, see here.

conda create -n anchordiff python=3.7 pytorch=0.4.1 cuda92 -c pytorch
source activate anchordiff
pip install -r requirements.txt

Data preparation

Download the data set

cd $AnchorDiff
wget https://data.vision.ee.ethz.ch/csergi/share/davis/DAVIS-2017-trainval-480p.zip
unzip DAVIS-2017-trainval-480p.zip -d data

Download pre-trained weights (1.5G) to $AnchorDiff.

cd $AnchorDiff
unzip snapshots.zip -d snapshots

(If you do not intend to apply instance pruning described in the paper, feel free to skip this.) Download the detection results that we have computed using ExtremeNet, and generate the pruning masks.

cd $AnchorDiff
wget www.robots.ox.ac.uk/~yz/detection.zip
unzip detection.zip
python detection_filter.py

Evaluation on DAVIS 2016

Examples for evaluating mean IoU on the validation set with options,
- save-mask (default 'True') for saving the predicted masks,
- ms-mirror (default 'False') for multiple-scale and mirrored input (slow),
- inst-prune (default 'False') for instance pruning,
- model (default 'ad') specifying models in Table 1 of the paper,
- eval-sal (default 'False') for computing saliency measures, MAE and F-score.

cd $AnchorDiff
python eval.py
python eval.py --ms-mirror True --inst-prune True --eval-sal True

Use the benchmark tool to evaluate the saved masks under more metrics.
Pre-computed results

License

The MIT License.

anchor-diff-vos's People

Contributors

Stargazers

Watchers

Forkers

mzahran001 ml-lab ingeniousfrog castile yangshushuaige visionresearch chiukin chunyang-zhang lhyciomp grapesonwang rogeriouff27 sundarammuthu yahoo0742 cv-ip

anchor-diff-vos's Issues

All of the Pre-computed maps seems to be background maps.

Very thanks for sharing your code. But when I download the Pre-computed results, each map is the background map. Could you upload the Pre-computed results again? Thank you~

403 Forbidden in pretrained model website

I encounter "You don't have permission to access this resource." when I visit the pretrained weights' website.
Could you please help me get the pretrained model? Thanks.

where is the training code?

Thanks for your nice work, where is the training code?

Could you please share the training code?

Dear author:

could you please share me the training code? Thank you in advance...

Do not understand "unsupervised"

hello, after reading your paper, I don't understand how to train the model.
In experiment section, "binary cross-entropy loss" is mentioned, it seems the network should be trained with pixel-wise supervised label like segmentation.
Could you show the "unsupervised process" with detail? Thank you.

When will the code be available?

Thanks for your contribution, I just want to know when will the code be available?

predicted masks

Hi Zhao Yang & Qiang Wang,

Thank you for your impressive work. I evaluated the model by following your instruction in readme. However the masks (png files) in pred_masks were all black. Why did it get these black masks while the mean J was about 0.8? Thank you!

What would it take to make this work with multiple classes?

Just more similarity matrices, more memory? Do the concatenation approach instead of the multiplication?

can not get the same test result

I use pretrained model to inference: python eval.py --ms-mirror True --inst-prune True --eval-sal True

and use matlab code to evalate, the result is as follows:
cosnet adnet mmadnet

J mean 0.805 0.817 0.803
J recall 0.931 0.909 0.900
J decay 0.044 0.022 0.021

F mean 0.795 0.805 0.793
F recall 0.895 0.851 0.847
F decay 0.050 0.006 0.004

T (GT 0.088) 0.184 0.225 0.228

adnet is from your provided files. mmadnet is from me.
I am not sure what is wrong. Is it some parameters wrong?

trainnin code?

hello , can you push the training code to my E-mail, thanks!

[email protected]

Confused about how to optimize

Hi, I am confused about how to optimize.
1, what's the loss function? in this paper, it seems not to be mentioned. is it just BCE?
2, what's the ground truth?
Thank you!

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.