Git Product home page Git Product logo

covid19-prediction's Introduction

Interpretable Machine Learning for COVID-19 Diagnosis Through Clinical Variables

The main goal is to propose an efficient and also transparent/interpretable ML solution for the diagnosis of suspicious COVID-19 cases, based on some clinical variables.

Secondary goal will be to eliminate least important features while keeping the preditive power of the solution.

Classification algorithms utilized in the solution:

  • Random Forest
  • Logistic Regression
  • Support Vector Machine

Interpretation algorithms:

  • SHAP
  • Regression weights

Original dataset comes from a Kaggle Competition held by Einstein Data4u. It can be found at: https://www.kaggle.com/dataset/e626783d4672f182e7870b1bbe75fae66bdfb232289da0a61f08c2ceb01cab01/tasks?taskId=645

Specific steps adopted on this novel:

  • Remove the least amount of rows and columns in order to eliminate "holes" of missing values in the dataset.
  • Execute a Grid Search with Cross Validation to tune models hyperparameters and obtain preliminare results.
  • Execute a Recursive Feature Elimination (RFE) with Cross Validation. Test models after RFE-CV without tuning hyper parameters.
  • Choose variables to eliminate and execute another Grid Search with remaining variables.
  • Visualize feature weights with feature_importance of linear models and SHAP values of non-linear models.

About the original paper

Link: https://doi.org/10.48011/asba.v2i1.1590

Abstract:

This work proposes an interpretable machine learning approach to diagnose suspected COVID-19 cases based on clinical variables. Results obtained for the proposed models have F-2 measure superior to 0.80 and accuracy superior to 0.85. Interpretation of the linear model feature importance brought insights about the most relevant features. Shapley Additive Explanations were used in the non-linear models. They were able to show the difference between positive and negative patients as well as offer a global interpretability sense of the models.

If you enjoy this work, please cite as :

@article{thimoteo_vellasco_amaral_figueiredo_yokoyama_marques_2020, title={Interpretable Machine Learning for COVID-19 Diagnosis Through Clinical Variables}, DOI={10.48011/asba.v2i1.1590}, journal={Anais do Congresso Brasileiro de Automática 2020}, author={Thimoteo, Lucas M. and Vellasco, Marley M. and Amaral, Jorge M. Do and Figueiredo, Karla and Yokoyama, Cátia Lie and Marques, Erito}, year={2020}, month={Dec}}

covid19-prediction's People

Contributors

jamaral68 avatar lucasthim avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.