Git Product home page Git Product logo

primock57's Introduction

PriMock57

This repository contains the data and annotations described in the papers:

The dataset consists of 57 mock medical primary care consultations held over 5 days by 7 Babylon clinicians and 57 Babylon employees acting as patients, using case cards with presenting complaints, symptoms, medical & general history etc. The data in this repository includes:

  1. Audio recordings of the consultations (audio folder);
  2. Manual utterance-level transcriptions of the recordings (transcripts folder);
  3. Consultation notes written by the consulting clinicians (notes folder);
  4. Human evaluation annotations & data (human_eval_data folder).

The scripts folder includes some data transformation scripts (utterance extraction, transcript collation etc.)

More detailed descriptions are found in each folder's README.md files.

How to clone

Due to their size, the audio files are stored using Git Large File Storage (https://git-lfs.github.com/). To clone the repository:

  1. Install Git LFS using the link above. For Mac, you can use Homebrew: brew install git-lfs
  2. Set up Git LFS for your user account: git lfs install
  3. You can now clone this repository: git clone https://github.com/babylonhealth/primock57.git

Contacts

Citing

@inproceedings{korfiatis2022primock57,
  title={(in press): PriMock57: A Dataset Of Primary Care Mock Consultations},
  author={Papadopoulos Korfiatis, Alex and Moramarco, Francesco and Sarac, Radmila and Savkov, Aleksandar},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics},
  year={2022}
}
@inproceedings{moramarco2022human,
  title={(In press): Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation},
  author={Moramarco, Francesco and Papadopoulos Korfiatis, Alex and Perera, Mark and Juric, Damir and Flann, Jack and Reiter, Ehud and Belz, Anya and Savkov, Aleksandar},
  booktitle={Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics},
  year={2022}
}

primock57's People

Contributors

fran-babylon avatar

Stargazers

Gaurav Trivedi avatar minstar avatar Aditya Kadam avatar xcesiv avatar Michael Oberst avatar Shohreh Haddadan avatar Peter J Edwards avatar Stefan Hegselmann avatar Farhang Dehzad avatar Abhinav_Kashyap avatar Shivprasad Sagare avatar Hyunjae Kim avatar Joseph Tutera avatar William Horton avatar Jiayang Wu avatar  avatar Gayani Nanayakkara avatar Ben Nortier avatar haotian avatar Eric Lam avatar Guanyi Chen avatar Van Hoang avatar YUANG LI avatar Augustin Toma avatar Yujuan Fu avatar Adam Bozson avatar  avatar Aspasia Vozi avatar David Sontag avatar Francesco avatar Ruochen avatar

Watchers

Abraham avatar Uriel avatar Nick Mullen avatar James Cloos avatar Giorgos Christos Dimitriou avatar Daeus avatar JPFrancoia avatar  avatar nieszkah avatar Bartłomiej Wojcieszek avatar Ricardo Abreu avatar Eyal Kazin avatar Saurabh Johri avatar Bonamy Klu avatar Jasmeet Singh Saini avatar Chris Pitt avatar Cotie Long avatar  avatar  avatar Giulia Prando avatar Joshua Leung avatar Zori avatar Akshay Kumar avatar  avatar Jetendr Shamdasani avatar  avatar  avatar Vito Celentano avatar  avatar Matt Banjo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.