Enhancing Brain Tumor Segmentation in Multi-modal MRI Images

This repository contains the code implementation of my Master's dissertation project at Brunel University London. The project focuses on developing advanced methods for automated segmentation of brain tumors from multimodal MRI scans by integrating modern mechanisms like Attention, Multi-Objective Losses and Swin Transformers into the traditional convolutional neural networks (CNNs) based U-Net architecture. Key challenges addressed include localizing indistinct tumor boundaries, handling class imbalances in medical datasets, and efficiently capturing global context. This work integrates attention mechanisms, multi-scale deep supervision, multi-objective loss functions, and Swin-Transformers into CNN-based architectures.

In this README file, we provide an overview and explanation of the code and models developed for this project. For a comprehensive understanding of the methodologies, theories, and detailed analysis, we highly recommend referring to the code overview section at the end of this file and the comments in the code itself. If you require further help or would like to take this study forward, I would be pleased to support you.

Dataset

The Brain Tumor Segmentation (BraTS) 2023 dataset with T1, T1-contrasted, T2, and FLAIR MRI scans, including expert annotations for 1251 cases.

Model Architectures

AR2B-UNet: A 3D UNet Model enhanced with Attention Blocks. This architecture is designed to improve the segmentation accuracy by focusing on relevant features in the MRI scans.

AR2B-DeepSup-UNet: An extension of the base model (AR2B-UNet) enhanced with Deep Supervision. Deep Supervision facilitates the training of deeper networks by addressing the vanishing gradient problem and improving feature learning at multiple levels.

Swin-AR2B-DeepSup-UNet: This model integrates Swin Transformers into the UNet Encoder. The Swin Transformer is designed to capture global context more effectively, which is crucial for accurate segmentation in complex medical images like MRI scans.

Key Architectural Blocks

Convolutional, and Feature Refinement Blocks to extract hierarchical features from the input.
Attention Blocks to focus on the relevant features before passing them on to the decoder.
Swin Transformer Block for capturing global context.
Max Pooling and Transpose Convolution Layers for downsampling and upsampling the feature maps, respectively.
3D Convolution Layers for Interpolation.

Evaluation

Rigorous training and validation with a focus on the benefits of attention, deep supervision, and multi-objective loss, was performed. The performance of the three models, AR2B, AR2B-DeepSup, and Swin-AR2B-DeepSup, was evaluated based on several key metrics, including Accuracy, Loss, Dice Coefficients, Sensitivity, Specificity, and Precision.

These metrics were computed for each tumor class during both training and validation phases. The highest mean Dice Coefficient achieved during these phases was used to select the best epoch for each model.

Performance

AR2B Model: Achieved the minimum training loss of 0.145 but displayed occasional spikes indicating potential training instability. It achieved a maximum validation dice of 0.691 at Epoch 350, which then decreased and the training loss increased, suggesting overfitting.
AR2B-DeepSup Model: Maintained more stable training with fewer spikes and achieved minimum training losses of 0.330. During validation, this model surpassed the others, recording a mean Dice score of 0.795 at epoch 900.
Swin-AR2B-DeepSup Model: Achieved minimum training losses of 0.533 and during validation reached a mean Dice score of 0.757 at epoch 950.

Table comparing the model performance

Models	Dice Coefficient (1)	(2)	(3)	Mean	Sensitivity (1)	(2)	(3)	Mean	Specificity (1)	(2)	(3)	Mean
AR2B	0.650	0.653	0.768	0.691	0.582	0.842	0.824	0.749	0.999	0.972	0.995	0.989
AR2B-DeepSup	0.770	0.770	0.843	0.795	0.800	0.843	0.911	0.851	0.997	0.987	0.998	0.994
Swin-AR2B-DeepSup	0.729	0.736	0.807	0.757	0.685	0.827	0.896	0.802	0.933	0.999	0.988	0.973
nnU-Netv2	0.756	0.861	0.850	0.822	0.763	0.893	0.926	0.873	1.000	0.999	1.000	1.000

The AR2B-DeepSup model achieved the highest overall performance among our developed models across all evaluated metrics, and even surpasses nnU-Netv2 performance for tumour class 1.

Qualitative Results

Accurate Predictions by All Models:

All models demonstrated high overlap with the ground truth segmentation in well-segmented cases, indicating accurate segmentation capabilities under certain conditions.

Superior Performance of AR2B-DeepSup:

This model most closely matched the true tumor shape and outperformed the other two models, capturing intricate tumor morphology, especially for Tumor Class 1 – the NCR region.

Better Background Identification by Swin-AR2B-DeepSup:

In some cases, this model outperformed the others by better identifying the background region and more accurately predicting tumor boundaries.

Challenges in Poorly-Segmented Cases:

All models struggled in some cases, especially those with low contrast, failing to detect tumors or making false positive predictions, indicating room for performance improvement on challenging cases.

Contributions:

This work contributes to the field of medical image analysis by demonstrating the effective integration of contemporary techniques in CNN architectures for brain tumor segmentation. It highlights the potential and limitations of these approaches, providing a foundation for future research and development in automated medical image segmentation.

Code Overview

The below section provides an overview of my dissertation project, "Enhancing Brain Tumor Segmentation in Multimodal MRI Scans," and serves as a guide for the attached source code. This folder contains project code, a sampled dataset, and related resources for a medical image segmentation project employing various deep learning models. Below, you'll find detailed information about the contents of the src folder, the structure of the data folder, and the saved_models folder.

Contents of the `src` Folder

Data Download and Preprocessing

00_data_download.py: Downloads the BraTS 2023 training dataset using SynapseClient and extracts it to the ../input folder. Please edit the username and password with your credentials from the Synapse website to download the dataset.
01_create_directories.py: Creates the necessary directory structure for both the nnUNet framework and our custom models.
02_data_description.ipynb: Provides a comprehensive description of the dataset, checking for missing files, analyzing segmentation labels, and visualizing original and cropped modalities.
02_data_nnunet_split.py: Splits the input dataset into training and validation sets in the format required for nnUNetv2.
03_data_nnunet_to_npy.py: Preprocesses and converts the training and test dataset, previously split for nnUNet, into a suitable format for our custom models.

Resources

custom_data_generator.py: Implements a custom dataloader in PyTorch to efficiently feed data in batches to our custom models.
losses.py: Contains implementations of various loss functions and evaluation metrics for model training and evaluation.
models.py: Implements key architectural blocks and model architectures for AR2B, AR2B-DeepSup, and Swin-AR2B-DeepSup.
generate_predictions.py: Generates predictions from our custom models and provides corresponding ground truth data for evaluation.
nnunet_evaluation.py: Evaluates the predictions made by nnUNetv2, utilizing predefined evaluation metrics.
write_options.py: Implements a DualStream module to direct the output simultaneously to the screen and an output file, aiding in better management of results and logs.
plots.ipynb: Provides code to plot and annotate model performances from our evaluation and experiments.
qualitative_visualisation.ipynb: Visualizes the predictions of the models against the ground truth.

Model Training

17_training_base_parallel.py: Trains the base model AR2B using combined loss with class weighting.
08_training_deep_parallel.py: Trains the deep supervision model AR2B-DeepSup using combined loss with class weighting.
09_training_swin_parallel.py: Trains the Swin transformer model Swin-AR2B-DeepSup, utilizing combined loss with class weighting.

Experiments

07_training_base_parallel_cross_entropy.py: Trains the base model AR2B using cross-entropy loss with class weighting as an alternative approach.
28_training_deep_parallel_learning_rate.py: Conducts experiments with different learning rates for training AR2B-DeepSup.
38_training_deep_parallel_equal_weights.py: Trains the deep supervision model AR2B-DeepSup using combined loss with equal weights for the background and tumor classes.

Folder Structure in the `data` Folder

The data folder is structured as follows:

* data
    o training
        * images
        * masks
        * nnUNet_raw
            * Dataset001_BRATS
                o imagesTr
                o labelsTr
            * nnUNet_preprocessed
            * nnUNet_results
    o test
        * images
        * masks
        * nnUNet_raw
            * Dataset001_BRATS
                o imagesTs
                o labelsTs
        * nnUNet_results

This organized structure effectively manages the training and test data for the project.

Contents of the `data` Folder

The data folder contains a total of 30 training records, including both images and masks, along with 8 records designated for validation and testing. These samples are preprocessed and formatted for training and evaluating the models.

To download the raw dataset or if the dataset file is missing, please run the 00_data_download.py script with your credentials and follow the data preprocessing scripts sequentially.

Additional Details Regarding the Attached Zip File

Contents of the `saved_models` Folder

The saved_models folder contains best model checkpoints from our training:

17_best_model_base.pt: The best AR2B model trained on Combined Loss.
08_best_model_deep.pt: The best AR2B-DeepSup model trained on Combined Loss.
09_best_model_swin.pt: The best Swin-AR2B-DeepSup model trained on Combined Loss.

These saved models can be used for inference or further experimentation.

Contents of the `predictions` Folder

In the predictions folder, we have saved ground truths and model predictions from our three models for 2 randomly selected cases. You can visualize them using the qualitative_visualisation.ipynb notebook.

Thank you for using our project code!

nmn-pandey / brain-tumour-segmentation Goto Github PK

brain-tumour-segmentation's Introduction

Enhancing Brain Tumor Segmentation in Multi-modal MRI Images

Dataset

Model Architectures

Key Architectural Blocks

Evaluation

Performance

Qualitative Results

Accurate Predictions by All Models:

Superior Performance of AR2B-DeepSup:

Better Background Identification by Swin-AR2B-DeepSup:

Challenges in Poorly-Segmented Cases:

Contributions:

Code Overview

Contents of the src Folder

Data Download and Preprocessing

Resources

Model Training

Experiments

Folder Structure in the data Folder

Contents of the data Folder

Additional Details Regarding the Attached Zip File

Contents of the saved_models Folder

Contents of the predictions Folder

brain-tumour-segmentation's People

Contributors

Stargazers

Watchers

Forkers

brain-tumour-segmentation's Issues

Recommend Projects

Recommend Topics

Recommend Org

Contents of the `src` Folder

Folder Structure in the `data` Folder

Contents of the `data` Folder

Contents of the `saved_models` Folder

Contents of the `predictions` Folder