Git Product home page Git Product logo

Comments (2)

rcuocolo avatar rcuocolo commented on August 23, 2024

Hello, thank you for your interest in the repository.
Regarding your questions:

  1. As stated in the readme file, currently I would suggest converting the PROSTATEx DICOM files into NIFTI and using the NIFTI masks. This is the process we actually followed and I then transposed the NIFTI-coordinate masks to the original DICOM space. As can be seen in the other open issue, this gave some problems with the DWI/ADC masks as the array appears to be still partially "flipped". Using dcm2niix (using the patient name, sequence name and number options) the output files should be identical to those we used and there would be no geometry issues. NIFTI and NRRD should be convertible from one format to another (for example using 3D Slicer, which reads both) but I do not see a particularly good reason to prefer one over the other.
  2. The class labels are available both in terms of clinically significant (GGG 2 or higher) lesions or not on the whole dataset as well as specific GGG labels for the PROSTATEx2 dataset. You can simply pair the csv files to the extracted radiomic database using patient names and lesion number as the indices. I usually employ the pandas package for similar tasks. Whether you choose to use the entire PROSTATEx dataset or only PROSTATEx2, I would suggest keeping the GGG 2 or higher threshold to dichotomize the classes. There is some debate on classifying GGG 2 lesions as "intermediate" compared to 3 or higher, but I would not use this approach without good domain knowledge in the team (this choice has to be appropriately supported by the study aim and you should be ready to address this point during a hypothetical peer review process).
  3. Among currently available software for radiomic data extraction, I prefer PyRadiomics. It is fairly simple to use. You can find details of the parameters I usually employ in prostate MRI described in literature (e.g., doi: 10.1007/s00330-021-07856-3, open access, contains the parameters file in the supplementary materials). For the analysis in and of itself, there is a wide range of tools, from R (e.g., caret), Python (e.g., scikit-learn) tools or GUI-based software (e.g., WEKA, KNIME, RapidMiner). I usually employ scikit-learn or WEKA with some R (mainly for statistics or metrics).

from prostatex_masks.

rcuocolo avatar rcuocolo commented on August 23, 2024

I would also suggest reading the pinned Issue #5 as it contains some insights on the dataset that could be useful for you.

from prostatex_masks.

Related Issues (19)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.