Git Product home page Git Product logo

v3c1-asr's Introduction

V3C Transcripts

The contents of this repository are superseded by a new dataset containing transcripts of the entire V3C collection, generated using whisper. You can find this new dataset on zenodo and an accompanying analysis of the language content of V3C on arXiv.

V3C1 ASR

This repository contains transcripts for the videos of the V3C1, the first shard of the Vimeo Creative Commons Collection. These transcripts have been generated using the public Google Cloud Speech-to-Text API set to use English. The results are stored in one file per video with the video id as file name. The files are encoded as JSON maps where the key refers to the shot number within the video and the value contains all the words spoken during this shot. For videos without any detected English speech, no file is present. All data is provided without any correctness guarantees. If you use the data provided in this repository, please cite the following corresponding publication:

Bibtex

@inproceedings{vitrivrvbs2019,
  title={Deep Learning-Based Concept Detection in vitrivr},
  author={Rossetto, Luca and Parian, Mahnaz Amiri and Gasser, Ralph and Giangreco, Ivan and Heller, Silvan and Schuldt, Heiko},
  booktitle={International Conference on Multimedia Modeling},
  pages={616--621},
  year={2019},
  organization={Springer}
}

v3c1-asr's People

Contributors

lucaro avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.