Git Product home page Git Product logo

data_oral_abaza_corpus's Introduction

Spoken corpus of Abaza Data Repository

DOI

This repository is the place where the data from the Spoken corpus of Abaza is curated. This repository also provides an alternative way to access corpus data locally. The data is stored in data_oral_abaza_corpus.csv with 4094 rows and 13 columns:

  • filename
  • time_start
  • time_end
  • speaker
  • sentence_id
  • text
  • translation
  • word_forms
  • morphonology
  • gloss
  • language
  • dataset_creator
  • dataset_provider

About corpus

The corpus contains oral texts of the Tapanta dialect of the Abaza language. Recording were made during a joint HSE University / RSUH expeditions to the village of Inzhich-Chukun in the Abazinsky district of the Karachay-Cherkess Republic in 2017-2019. Text analysis and glossing was done by the participants in the research and study group “Aspects of Abaza Grammar” and the RSF grant # 17-18-01184 “Communicative organization of natural discourse in spoken and signed languages.” The search function entered a closed testing regime in December 2019.

How to cite the corpus and the data

If you use data from the Spoken corpus of Abaza in your research, please cite as follows:

Anastasia Panova, Anna Sorokina, Peter Arkadiev, Elena Sokur. Spoken corpus of Abaza. Moscow: School of Linguistics, HSE University; Linguistic Convergence Laboratory, HSE University. (Available online at: http://lingconlab.ru/spoken_abaza/, accessed on ...)

You may contact with questions about the Corpus data or leave an issue in this repository:

[email protected] (Anastasia Panova)

You may contact with questions about the search platform or leave an issue in its own repository:

[email protected] (Elena Sokur)

data_oral_abaza_corpus's People

Contributors

agricolamz avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.