Git Product home page Git Product logo

3i4k's Introduction

3i4K

Intonation-aided intention identification for Korean

Requirements

fastText, Keras (TensorFlow), Numpy, Librosa
Currently available for python 3.5 and upper version is in implementation

Word Vector

Pretrained 100dim fastText vector

  • Download this and unzip THE .BIN FILE in the NEW FOLDER named 'vectors'
  • This can be replaced with whatever model the user employs, but it requires an additional training.

Dataset

(19.06.06) We provide train & validation, and test set separately here, for an easier Keras-based implementation.

The (train+validation) : test ratio is 9 : 1, and train : validation ratio is also 9 : 1 (thus, in total, 0.81 : 0.09 : 0.1).

The renewed version of the corpus is uploaded along with the models. May not be changed unless severe defect is observed.

(18.12.28) Dataset under modification!!

We've found a few misclassified utterances and undergoing modification, thus the true-final version will be disclosed before Fabrary. Pilot implemenation of the system (e.g., as tutorial) is less involved with this problem, but do not cite this dataset as a benchmark until Fabrary. The notice will be available as soon as possible.

(18.11.22) A final version of dataset and the new model is uploaded!

The next version will incoporate much more utterances and will be treated as a separate dataset.

FCI: A seven-class text corpus for the classification of conversation-style and non-canonical Korean utterances

  • F: Fragments (nouns, noun phrases, incomplete sentences etc.) (FRs)
  • C: Clear-cut cases (statements, questions, commands, rhetorical questions, rhetorical commands) (CCs)
  • I: Intonation-dependent utterances (IUs)

Corpus composition


  • IAA: kappa = 0.85 for Corpus 1
  • Data for FCI module is labeled in 0-6, split in train:test with ratio 9:1.
  • Available in data folder.

Block diagram


System Description

  • Easy start: Demonstration.exe
 python3 3i4k_demo.py 

Intention Identification

  • Given only a text input, the system classifies the input into one the aforementioned 7 categories. Available in demo.
  • Text classification is also available in demo; a corpus (input: filename without '.txt') can be categorized into 7 classes.
  • Available by importing module
 from classify import pred_only_text('sentence_you_choose') 
 from classify import classify_document('filename_you_choose') 

Speech Intention Understanding

  • Available by importing module
 from classify import pred_audio_text('speechfilename_you_choose', 'sentence_you_choose') 

Annotation Guideline and Acknowledgement

The annotation guideline (in Korean) was elaborately constructed by Won Ik Cho, with the great help of Ha Eun Park and Dae Ho Kook. Also, the authors appreciate Jong In Kim, Kyu Hwan Lee, and Jio Chung from SNU Spoken Language Processing laboratory (SNU SLP) for providing the useful corpus for the analysis. We note that this work was supported by the Technology Innovation Program (10076583, Development of free-running speech recognition technologies for embedded robot system) funded By the Ministry of Trade, Industry & Energy (MOTIE, Korea).

Citation

For the utilization of the word vector dictionary, cite the following:

@article{cho2018real,
	title={Real-time Automatic Word Segmentation for User-generated Text},
	author={Cho, Won Ik and Cheon, Sung Jun and Kang, Woo Hyun and Kim, Ji Won and Kim, Nam Soo},
	journal={arXiv preprint arXiv:1810.13113},
	year={2018}
}

For the utilization of the annotation guideline or the dataset, cite the following:

@article{cho2018speech,
	title={Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency},
	author={Cho, Won Ik and Lee, Hyeon Seung and Yoon, Ji Won and Kim, Seok Min and Kim, Nam Soo},
	journal={arXiv preprint arXiv:1811.04231},
	year={2018}
}

YouTube demo (non-audio-input version; for past submission)

https://youtu.be/OlvLlH8JgmM

3i4k's People

Contributors

warnikchow avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.