Git Product home page Git Product logo

open-healthcare-data-hack's Introduction

Open Healthcare Data Hack

Code produced at Open Healthcare Data Hack 13/14 December 2014.

The Python script preprocesses MIMIC II Physionet dataset. Each patient is represented as one row of CSV file with the following features:

  • Survival - OUTPUT VARIABLE (0: survived, or 1: not survived)
  • Age (years)
  • Gender (0: female, or 1: male)
  • Height (cm)
  • ICUType (1: Coronary Care Unit, 2: Cardiac Surgery Recovery Unit, 3: Medical ICU, or 4: Surgical ICU)
  • Weight (kg)

There are 36 temporal attributes in the dataset for which some measurements are taken during 48 hours:

  • Albumin (g/dL)
  • ALP [Alkaline phosphatase (IU/L)]
  • ALT [Alanine transaminase (IU/L)]
  • AST [Aspartate transaminase (IU/L)]
  • Bilirubin (mg/dL)
  • BUN [Blood urea nitrogen (mg/dL)]
  • Cholesterol (mg/dL)
  • Creatinine [Serum creatinine (mg/dL)]
  • DiasABP [Invasive diastolic arterial blood pressure (mmHg)]
  • FiO2 [Fractional inspired O2 (0-1)]
  • GCS [Glasgow Coma Score (3-15)]
  • Glucose [Serum glucose (mg/dL)]
  • HCO3 [Serum bicarbonate (mmol/L)] • HCT [Hematocrit (%)]
  • HR [Heart rate (bpm)]
  • K [Serum potassium (mEq/L)]
  • Lactate (mmol/L)
  • Mg [Serum magnesium (mmol/L)]
  • MAP [Invasive mean arterial blood pressure (mmHg)]
  • MechVent [Mechanical ventilation respiration (0:false, or 1:true)]
  • Na [Serum sodium (mEq/L)]
  • NIDiasABP [Non-invasive diastolic arterial blood pressure (mmHg)]
  • NIMAP [Non-invasive mean arterial blood pressure (mmHg)]
  • NISysABP [Non-invasive systolic arterial blood pressure (mmHg)] • PaCO2 [partial pressure of arterial CO2 (mmHg)]
  • PaO2 [Partial pressure of arterial O2 (mmHg)]
  • pH [Arterial pH (0-14)]
  • Platelets (cells/nL)
  • RespRate [Respiration rate (bpm)]
  • SaO2 [O2 saturation in hemoglobin (%)]
  • SysABP [Invasive systolic arterial blood pressure (mmHg)]
  • Temp [Temperature (°C)]
  • TropI [Troponin-I (μg/L)]
  • TropT [Troponin-T (μg/L)]
  • Urine [Urine output (mL)]
  • WBC [White blood cell count (cells/nL)]

For EACH of these attributes, the following features are derived:

  • Mean
  • Standard deviation
  • Averaged gradient (averaged slopes of the lines connecting each two consecutove measurements)
  • Standard deviation of gradients (standard deviation of slopes of the lines connecting each two consecutove measurements)

open-healthcare-data-hack's People

Contributors

bojanvujatovic avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.