Git Product home page Git Product logo

cmcqa's Introduction

CMCQA

A large Chinese Medical CQA

CMCQA is a huge conversational question-and-answer data set for the Chinese medical field. It is collected from the Chinese medical conversational question answering website ChunYu, and has medical conversational materials in 45 departments, such as andrology, stormotologry, gynaecology and obstetrics. Specifically, CMCQA has 1.3 million complete sessions or 19.83 million statements or 0.65 billion tokens. At the same time, we further open source all data to promote the development of related fields of conversational question answering in the medical field.

CMCQA是**医学领域一个庞大的会话问答数据集。它来自**医学对话问答网站春雨,在男科、耳科、妇产科等45个科室拥有医学对话材料。具体而言,CMCQA拥有130万个完整会话或1983万条语句或6.5亿个令牌,总容量2.84GB。同时,我们进一步开放所有数据源,促进医学领域对话式问答相关领域的发展。

You can find our data in Google drive

你可以从百度网盘中下载数据集

引用:

@misc{CMCQA,
  title={A large Chinese Medical CQA},
  author={Yixuan Weng},
  howpublished={\url{https://github.com/WENGSYX/CMCQA}},
  year={2022}
}

cmcqa's People

Contributors

wengsyx avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.