Git Product home page Git Product logo

local_control_prediction's Introduction

Local-Control-Prediction

Team

Anish Mukherjee, Young Zeng

Background

We worked on this project during our internship at the University of California, San Francisco. The main task for this project was to predict local control from three different types of data: image, clinical(tabular), and radiomic(tabular) data.

Local control: cancer growth after stereotactic radiosurgery(binary target)

Dataset

Image data 8461 3D MRI images with brain tumors and their masks.
Clinical data 8407 tableau dataset with clinical features.
Radiomics data 8407 tableau dataset with radiomics features.


Methodology

We fit CNN models on the 3D MRI scans, XGBoost, and Random Forest models on the tabular data.

The imaging data was around ~ 240GB, and the model took 2-3 hours to train per epoch. The solution was to load the images onto RAM and use PyTrotchs distributed multi GPU training, which sped up execution by approximately 90%

Since the data size was large, we had memory constraints. To overcome this we decided to convert each pixel value from float32 to float16 and cropped out the borders of the image, reducing the size of the image by 70% facilatating the loading of the images onto RAM

Results

Image Models

We tried different architectures of Convolutional Neural Network model such as UNet, ResNet 50.

Architecture Best Accuracy Best Area under ROC
ResNet50 0.8412 0.8359

Models for tabular data

Since both clinical and radiomics data are tabular, it would be easier for us to model them together. We used gradient boosting model from sci-kit learn, xgboost, and random forest models.

Clinical Data

Architecture Best Accuracy Best Area under ROC
XGBoost 0.8347 0.7932
Random Forest 0.7714 0.7214

Radiomic Data

Architecture Best Accuracy Best Area under ROC
XGBoost 0.553 0.591
Random Forest 0.521 0.572

Clinical + Radiomic Data

Architecture Best Accuracy Best Area under ROC
XGBoost 0.73 0.705
Random Forest 0.692 0.683

Conclusion

The imaging model performed best on the validation data, for the next steps we could train a neural network feeding in both the clinical and the imaging data, with the imaging features extracted from the last layer of the ResNet

local_control_prediction's People

Watchers

Anish Mukherjee avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.