Git Product home page Git Product logo

p3d-ctn's Introduction

P3D-CTN

The P3D-CTN is a framework for spatio-temporal action detection. It integrates the frame-based two-dimensional convolutional module with the video-based P3D convolutional module.

A two step manner. First, tube proposals are generated from P3D-module, and box proposals are produced from 2d-module based on the tube proposals.

Installation

  1. Just follow Caffe standard installation instructions.

  2. Run P3D-module setup.py to build fundamental enviroment

     python P3D-module/setup.py
    

Datasets

Download three benchmark datasets(JHMDB, UCF101, UCFSports).
Use the scripts on P3D-module/datasets to generate the data format for training

P3D-module

Training

P3D_cls_train.sh and P3D_loc_train.sh are used for training P3D-module

sh P3D-module/P3D_cls_train.sh
sh P3D-module/P3D_loc_train.sh

Testing

P3D_cls_eval.py and P3D_loc_eval.py are used for testing P3D-module

    python P3D-module/P3D_cls_eval.py
    python P3D-module/P3D_loc_eval.py

2D-module

Traning

ll.sh(JHMDB), ll_101.sh(UCF101), ll_sports.sh(UCFSports) are used for training 2D-module

    sh 2D-module/ll.sh
    sh 2D-module/ll_101.sh
    sh 2D-module/ll_sports.sh

Evaluating

action_tools/jhmdb_eval.py ucfsports_eval.py ucf101_eval.py are used for evaluating 2D-module(frame-AP, video-AP)

    python 2D-module/action_tools/jhmdb_eval.py --proto 2D-module//models/JHMDB/VGG_16/test_1.prototxt --net 2D-module/output/faster_rcnn_end2end/JHMDB_RGB_1_split_0/RGB_1_VGG_16_iter_70000.caffemodel --imdb JHMDB_RGB_1_split_0 --out 2D-module/action_results/jhmdb.pkl
    python 2D-module/action_tools/ucf101_eval.py --proto 2D-module//models/UCF101/VGG_16/test_1.prototxt --net 2D-module/output/faster_rcnn_end2end/UCF101_RGB_1_split_0/RGB_1_VGG_16_iter_100000.caffemodel --imdb UCF101_RGB_1_split_0 --out 2D-module/action_results/ucf101.pkl
    python 2D-module/action_tools/ucfsports_eval.py --proto 2D-module//models/UCFSports/VGG_16/test_1.prototxt --net 2D-module/output/faster_rcnn_end2end/UCFSports_RGB_1_split_0/RGB_1_VGG_16_iter_70000.caffemodel --imdb UCFSports_RGB_1_split_0 --out 2D-module/action_results/ucfsports.pkl

p3d-ctn's People

Contributors

roundchuan avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

p3d-ctn's Issues

Running Error

Thank you for your code, but I get some problem when I run P3D-module/setup.py. I guess these problems may be caused by inappropriate environment and dependences.
Could you show the environment and dependencies when building the project?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.