Git Product home page Git Product logo

hsd-dataset's Introduction

HSD: A hierarchical singing annotation dataset

This repository provides a singing annotation dataset that records vocal information in pop songs. It mainly labels pitch, duration, lyric, onset, and offset of each musical note. Meanwhile, all the information is recorded in a hierarchical structure.

Annotations

Two kind of annotations are offered: enhanced LRC and MIDI. The enhanced LRC annotations are recommended because the singing information is recorded in a hierarchical structure.

enhanced LRC

The enhanced LRC files are in the "enhanced_lrc" folder. Each line in an enhanced LRC file records the vocal information of a music phrase. Each line is in the format:

[phrase time tag]<onset time tag>lyric pitch duration{offset time tag}<onset time tag>lyric pitch duration{offset time tag}...<onset time tag>lyric pitch duration{offset time tag}

"read_enhanced_lyric.py" can be used to read the annotations.

MIDI

The annotation MIDIs are also provided in the "midi" folder.

Label Initialization

The midi labels are initialized by music notation and LRC files. The corrected musical notation and LRC files are in folder "notation" and "lrc".

"initialize_label.py" can be used to create coarse labels.

Manual Label Calibration

All the labels are calibrated via a manually process. Annotators correct the time tags in the LRC files to calibrate the whole song.

Meanwhile, we provide a method that directly converting the enhanced LRC files to MIDI files. "elrc2midi.py" can be used to execute this process.

Raw Audio

The youtube links of all the raw audio are recorded in "youtubeLinks.txt".

"download.py" can be used to get the raw audio.

hsd-dataset's People

Contributors

hirabarahyt avatar noyami2033 avatar

Stargazers

 avatar Carlneil Domkam avatar  avatar Gong Junmin avatar Yi-Hsuan Yang avatar Gerald Golka avatar

Watchers

Carlneil Domkam avatar  avatar

Forkers

cdomkam

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.