Git Product home page Git Product logo

3dc-seg's Introduction

3DC-Seg

This repository contains the official implementation for the paper:

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan*, Ali Athar*,Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe

ECCV 2020 | Paper | Video | Project Page

Required Packages

  • Python 3.7
  • PyTorch 1.4 or greater
  • Nvidia-apex: https://github.com/NVIDIA/apex
  • tensorboard, pycocotools and other packages listed in requirements.txt

Setup

  1. Clone the repository and append it to the PYTHONPATH variable:

    git clone https://github.com/sabarim/3DC-Seg.git
    cd 3DC-Seg
    export PYTHONPATH=$(pwd):$PYTHONPATH
  2. Create a folder named 'saved_models'

Checkpoint

  1. The trained checkpoint is available in the below given link:

    Target Dataset Datasets Required for Training Model Checkpoint
    DAVIS, FBMS, ViSal COCO, YouTubeVOS, DAVIS'17 link

Usage

Training:

  1. Run mkdir -p saved_models/csn/
  2. Download the pretrained backbone weights and place it in the folder created above.
  python main.py -c run_configs/<name>.yaml --num_workers <number of workers for dataloader> --task train

Inference:

Use the pre-trained checkpoint downloaded from our server along with the provided config files to reproduce the results from Table. 4 and Table. 5 of the paper. Please note that you'll have to use the official davis evaluation package adapted for DAVIS-16 as per the issue listed here if you wish to run an evaluation on DAVIS.

  1. DAVIS:
python main.py -c run_configs/bmvc_final.yaml --task infer --wts <path>/bmvc_final.pth

  1. DAVIS - Dense
python main.py -c run_configs/bmvc_final_dense.yaml --task infer --wts <path>/bmvc_final.pth

  1. FBMS:
python main.py -c run_configs/bmvc_fbms.yaml --task infer --wts <path>/bmvc_final.pth

  1. ViSal
python main.py -c run_configs/bmvc_visal.yaml --task infer --wts <path>/bmvc_final.pth

Pre-computed results

Pre-computed segmentation masks for different datasets can be downloaded from the below given links:

Target Dataset Results
DAVIS link
DAVIS - Dense link
FBMS link
ViSal link

3dc-seg's People

Contributors

sabarim avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.