Git Product home page Git Product logo

particle-health-sandbox-environment's Introduction

Particle Health Sandbox

This GitHub Repository is published as a part of the Synthetic Health Data Challenge. This includes a Synthea synthetic patient population generator that outputs point-in-time CCDA files with in-document provider notes, as well as stores the original Synthea CCDA, FHIR, and Note files for reference. A CSV for Validation results as well as population demographics will also be output to the resulting folder.

Please visit https://www.particlehealth.com/ and sign up for a free account to interact with pre-loaded patient populations generated from this solution via our API and User Interface!

Requirements:

  • brew install jenv
  • os.system('brew install jenv')
  • os.system('jenv local 1.8')

Open Source Tools and Libraries Used:

Tools and Languages:

  • Python 3.8.3
  • jenv 1.8
  • Synthea (latest release)
  • HealthIT.gov’s open source CCDA 2.0 scorecard API (For Validation)

Python Libraries:

  • import os
  • import argparse
  • import sys
  • from difflib import get_close_matches
  • import re
  • import shutil
  • from shutil import copyfile
  • from datetime import date
  • import pandas as pd
  • import json
  • import requests
  • import urllib3
  • import jinja2
  • from jinja2 import Environment
  • from jinja2 import FileSystemLoader
  • from lxml import etree
  • from io import StringIO, BytesIO
  • import csv
  • import string

Usage:

Ensure you are working in the 'synthea-master' directory

Run:

python Synthea_Runner.py --condition {condition_of_interest} --num-patients {population}

Entry options for condition_of_interest include: ['covid19', 'diabetes', 'lung cancer', 'opioid addiction']

Recommended entry options for population are any numerical value 1 - 5000

Example:

  python Synthea_Runner.py --condition 'diabetes' --num-patients 50

Output:

A folder names with the condition specified, as well as the date of generation will be output

  • Example output folder: 'diabetes_output_2021-07-07'

The following sub-directories and files will be contained:

  1. generator_output: (main)
  • contains output point-in-time CCDA files with in-document provider notes
  1. synthea_ccda:
  • contains original synthea CCDA documents for reference
  1. synthea_fhir:
  • contains original synthea FHIR documents for reference
  1. synthea_notes:
  • contains original synthea txt note files for reference
  1. output_directory.csv:
  • CSV file containing demographics of the population created
  1. validation_results.csv:
  • CSV file containing the validation results for each point-in-time document generated (from HealthIT.gov’s open source CCDA 2.0 scorecard API)

particle-health-sandbox-environment's People

Contributors

pbannister123 avatar

Stargazers

Wai avatar

Watchers

Garrett Montague avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.