Git Product home page Git Product logo

deeppbm's Introduction

DeepPBM: Deep Probabilistic Background Modeling

This code is the implementation of the following paper accepted to the ICPR2020 Workshop on Deep Learning for Pattern Recognition (DLPR20):

DeepPBM: Deep Probabilistic Background Model Estimation from Video Sequences (https://arxiv.org/pdf/1902.00820.pdf)

Authors: Amirreza Farnoosh, Behnaz Rezaei, and Sarah Ostadabbas Corresponding Author: [email protected]

Requirements

This code is tested on Python3.6, Pytorch 1.0 and CUDA 8.0 on Ubuntu 16.04. MATLAB R2016b.

Data preparation

The following dataset is used for experiments in the paper:

BMC2012 dataset:

@inproceedings{vacavant2012benchmark,
  title={A benchmark dataset for outdoor foreground/background extraction},
  author={Vacavant, Antoine and Chateau, Thierry and Wilhelm, Alexis and Lequi{\`e}vre, Laurent},
  booktitle={Asian Conference on Computer Vision},
  pages={291--300},
  year={2012},
  organization={Springer}
}

After downloading the dataset, you should run BMC2012DataLoader.py to preprocess dataset and get .npy files.

Training and Testing

You should run BetaVAE_BMC2012_Vid#.py files for training the network for each specicfic video of BMC2012 dataset, and generating background images for each frame.

Foreground mask generation

You should run MaskExtraction_BMC2012.m to generate binary foreground masks from generated background images from the previous steps.

Quantitative results

You should run processVideoFolder.m , and then confusionMatrixToVar.m to generate quantitative results.

Reference

@article{farnoosh2020deeppbm, title={DeepPBM: deep probabilistic background model estimation from video sequences}, author={Farnoosh, Amirreza and Rezaei, Behnaz and Ostadabbas, Sarah}, journal={The Third International Workshop on Deep Learning for Pattern Recognition (DLPR20), in conjunction with the 25th International Conference on Pattern Recognition (ICPR 2020)}, year={2020} }

For further inquiry please contact:

Sarah Ostadabbas, PhD Electrical & Computer Engineering Department Northeastern University, Boston, MA 02115 Office Phone: 617-373-4992 [email protected] Augmented Cognition Lab (ACLab) Webpage: http://www.northeastern.edu/ostadabbas/

deeppbm's People

Contributors

amirfarnoosh avatar ostadabbas avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

deeppbm's Issues

Nan elbo loss

Hello,

thank you for your great job.

I have been training video 1 but after some epoch elbo loss becomes β€œnan”. My configuration is default (epoch number, batch size, learning rate. My environment is in miniconda, python 3.6, pytorch 1.0. Os is ubuntu 20.04, gpu is gtx 1080ti

I have one more pc that has the same hardware. I replicate the same experiment and this time elbo loss stays constant and after the training the resulting images are garbage.

what would you suggest me to do to have right training configuration?

Thanks
Fatih

Missing Code for Mask Extraction

There seems to be some code missing in MaskExtraction_BMC2012.m. See here.
The missing code should probably read the background model files to a matrix. It would be nice if you could add the missing piece.

Is it possibile to generalize the input and the output size?

Hi thank you for this amazing repo. I was wondering if it is possibile to generalize the method to train with any height and width without using a huge amount of memory?

Plus how are these parameters setted for different frame sizes:

feature_row = 13
feature_col = 18

?
Is it possibile to trai the VAE with smaller but changing number of samples each x epoch?

Thank you.

Data Set

From where I can download this dataset, BMC 2012 ?Thanks.

Generalization feature of VAE

Hello,

thank you for your great work. What I see on the network is that it memorizes the background model of the scene and even if I change inference input, it produces the background that was trained. Is it possible to make the network train for different scenes and expect it produce bg of the scenes that network has never seen before?

thank you
Fatih

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.