Git Product home page Git Product logo

win-fail-action-recognition's Introduction

Win-Fail-Action-Recognition (WACV Workshops, 2022)

Introduction

Current video/action understanding systems have demonstrated impressive performance on large recognition tasks. However, they might be limiting themselves to learning to recognize spatiotemporal patterns, rather than attempting to thoroughly understand the actions. To spur progress in the direction of a truer, deeper understanding of videos, we introduce the task of win-fail action recognition -- differentiating between successful and failed attempts at various activities. We introduce a first of its kind paired win-fail action understanding dataset with samples from the following domains: "General Stunts," "Internet Wins-Fails," "Trick Shots," and "Party Games." Unlike existing action recognition datasets, intra-class variation is high making the task challenging, yet feasible. We systematically analyze the characteristics of the win-fail task/dataset with prototypical action recognition networks and a novel video retrieval task. While current action recognition methods work well on our task/dataset, they still leave a large gap to achieve high performance. We hope to motivate more work towards the true understanding of actions/videos.

Dataset

Dataset can be downloaded from: https://drive.google.com/drive/folders/1TEuOeOJwN0ehCdNkI0TAPfqWOR4LHImH?usp=sharing. Please use 7-zip to uncompress the dataset.

Annotation format

  1. All the samples are pairwise -- win and the corresponding fail.
  2. 'g' stands for a good execution or win; while 'b' stands for a bad execution or failure.
  3. 'label_0', 'start_0', 'stop_0' belong to clip_0 (win or fail) of a paired sample. Likewise, 'label_1', 'start_1', 'stop_1' belong to clip_1 (opposite of clip_0) of a paired sample.
  4. Sample annotation explanation:

win-fail action recognition annotation sample

Please feel free to reach out to me if you face any problem or have any questions.

If you find this dataset useful, please consider citing:

@inproceedings{parmar2022win,
  title={Win-Fail Action Recognition},
  author={Parmar, Paritosh and Morris, Brendan},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={161--171},
  year={2022}
}

win-fail-action-recognition's People

Contributors

paritoshparmar avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

hadryan

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.