Git Product home page Git Product logo

crcnn-action's Introduction

Context-aware RCNNs: a Baseline for Action Detection in Videos

Source code for the following paper(arXiv link):

Context-aware RCNNs: a Baseline for Action Detection in Videos
Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu
in ECCV 2020

Our implementation is based on Video-long-term-feature-banks.

Prepare dataset

Please follow LFB on how to prepare AVA dataset.

Prepare environment

Please follow LFB on how to prepare Caffe2 environment.

Download pre-trained weights

Please download R50-I3D-NL, and put it in [code root]/pretrained_weights folder.

Train a baseline model without scene feature and long-term feature

Run:

bash train_baseline.sh configs/avabox_r50_baseline_32x2_scale1_5.yaml

Train a model with scene feature

Run:

bash train_baseline.sh configs/avabox_r50_baseline_16x4_scale1_5_withScene.yaml

Train a model with scene feature and long-term feature

Stage1. Train a baseline model that will be used to infer LFB:

bash train_baseline.sh configs/avabox_r50_baseline_16x4_scale1_5.yaml

Stage2. Train a model with scene feature and LFB:

bash train_lfb.sh configs/avabox_r50_lfb_win60_L3_16x4_withScene.yaml [path to baseline model weight from step1]

crcnn-action's People

Contributors

wjchaogit avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

crcnn-action's Issues

Prepare JHMDB dataset

Hi,

Thanks for your opensource work.

Is there an introduction of preparing the JHMDB dataset?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.