Git Product home page Git Product logo

fastmri-plus's Introduction

fastMRI+: Clinical pathology annotations for the fastMRI dataset

The fastMRI dataset is a publicly available MRI raw (k-space) dataset. It has been used widely to train machine learning models for image reconstruction and has been used in reconstruction challenges.

This repo includes clinical pathology annotations for this dataset. The entire knee dataset and approximately 1000 brain datasets have been labeled. The goal of providing these labels is to enable developers of image reconstruction models and algorithms to evaluate the performance of the developed techniques with a focus on the sections or regions that could contain clinical pathology.

Limitations

Each image has labeled by a single radiologist and without the benefit of looking at other views and angles of the same subject, and should therefore be considered in that context. Specifically, the labels should not be considered clinical ground truth or an exhaustive list of all lesions but rather an indicatition of where a pathology could be present.

Obtaining fastMRI raw data and images

The fastMRI raw data and reference images can be obtained from fastmri.org. You will be able to download and use the data for academic purposes after signing the data sharing agreement. If you are looking for automation for downloading the dataset and training fastMRI models, please see the InnerEye Deep Learning Toolkit.

Labeling procedure and generating DICOM images from fastMRI data

In order to label the data, DICOM files were generated from the fastMRI dataset, and we are providing a fastmri_to_dicom.py to document the procedure. This script can be used like this:

python fastmri_to_dicom.py --filename fastmridatafile.h5

Note: In the process of converting the images to DICOM, the pixel arrays were flipped (up/down) to provide a view that was closer to DICOM orientation and assist with labeling. This should be taken into consideration when using the labels.

The labeling was performed by experienced radiologists using MD.ai.

Working with the annotations

The Annotations folder contains a label file for each of the knee (knee.csv and brain (brain.csv datasets. The files contain one line for each annotation (bounding box) that was labeled by the radiologists. Datasets with no findings (no annotations) are not represented in the label files, however, you can see which files were reviewed in the brain_file_list.csv and knee_file_list.csv. If a dataset (a fastMRI file) is listed in the file lists but not in the label files, it means that it has been reviewed, but there were no findings.

The repo contains an example jupyter notebook, which illustrates how to read the labels and overlay them onto the image pixels.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

fastmri-plus's People

Contributors

hansenms avatar microsoftopensource avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

fastmri-plus's Issues

About the application of this data set in target detection

Hello, I have tried to convert the fast MRI Brain data set into PNG images in COCO format (I'm sure I flipped the images to align them with labels), and there are more than 3000 valid data.Then I called the open source target detection model for training, but the actual result was not ideal, the mAP could not exceed 0.1. I did some parameter experiments, but the result was not improved so far, so I began to doubt the quality and noise of the data set, and I hope to get confirmation from you

Label error in knee dataset?

Dear fastmri+ plus,
Thank you so much for contributing to the society, its super helpful, I just wanna point out that there is some error in one bounding box: the height of the bounding box is zero, FYI
file1002097,27,No,190,201,18,0,Cartilage - Partial Thickness loss/defect

Thank you!!

Looks like data lacks direction information

Dear fastmri-plus,
Thank you for such a large amount of annotation.
I tried to use these data but ran into some problems.
I found that the converted DICOM data does not have the correct view in ITK-SNAP.
Are these h5 data in the FAST-MRI (and the plus version) without direction and origin information?

Best Wishes.
Zixu.

h5 and DICOM relationship

Hi, is there any relationship between .h5 and .dcm files available in fastMRI dataset? I'd like to get DICOMs with all planes instead of Coronal plane, but I don't find any relationship between .h5 and .dcm names.

Best regards.
Alberto.

About the application of this data set in target detection

Hello, I have tried to convert the fast MRI Brain data set into PNG images in COCO format (I'm sure I flipped the images to align them with labels), and there are more than 3000 valid data.Then I called the open source target detection model for training, but the actual result was not ideal, the mAP could not exceed 0.1. I did some parameter experiments, but the result was not improved so far, so I began to doubt the quality and noise of the data set, and I hope to get confirmation from you

There are some errors in the label

Hello, we found some problems in the label of brain data, as shown in the picture below (the source of the picture is the 7th slice of file_brain_AXFLAIR_200_6002493 and the 8th slice of file_brain_AXT1POST_201_6002812 respectively), may I ask if it is a labeling error?And how are the criteria for annotation defined?
file_brain_AXFLAIR_200_6002493_007
file_brain_AXT1POST_201_6002812_008

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.