Git Product home page Git Product logo

ava-dataset-tool's Introduction

ava-dataset-tool

AVA Dataset

The AVA Dataset is a newly exciting dataset for action detection and localization.

I will put a few tools here that may be useful for future development.

Prerequisites

  1. FFMPEG
  2. OpenCV

Download

Following CVDF's latest instruction, we are able to download the AVA v2.1 train/val videos with their urls.

$ cd ava-dataset-tool/video/trainval
$ ./download.sh

Download the test videos with their urls.

$ cd ava-dataset-tool/video/test
$ ./download.sh

Download annotations

The trainval annotations can be downloaded by

$ cd ava-dataset-tool
$ wget https://s3.amazonaws.com/ava-dataset/annotations/ava_v2.1.zip
$ unzip ava_v2.1.zip

Training data visualization

Data preprocessing is important for network training. Previously I tried OpenCV, but it is challenging to extract the exact keyframe maybe due to different codec settings. Currently we extract the video clips and the keyframes using ffmpeg. We visualize the bboxes for each keyframe in a new folder preproc. Note that this script extracts keyframes, bboxes, and the 3-second video clips from training set.

$ cd ava-dataset-tool
$ python3 extract_keyframe.py

If you would like to extract video clips and keyframes from the validation set or test set, you may need slightly adjust the corresponding path in extract_keyframe.py.

Stay tuned..

ava-dataset-tool's People

Contributors

kevinlin311tw avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.