Git Product home page Git Product logo

debeshjha / transrupnet Goto Github PK

View Code? Open in Web Editor NEW
4.0 2.0 0.0 847 KB

TransRUPNet for Improved Out-of-Distribution Generalization in Polyp Segmentation (IEEE EMBC)

Home Page: https://arxiv.org/pdf/2306.02176.pdf

Python 100.00%
colonoscopy encoder-decoder-architecture gastroenterology medical-image-segmentation polyp-segmentation pvt segmentation-models transformer-architecture unet-pytorch

transrupnet's Introduction

TransRUPNet for Improved Out-of-Distribution Generalization in Polyp Segmentation

Overview

We develop a novel real-time deep learning-based architecture, TransRUPNet, that is based on a Transformer and residual upsampling network for colorectal polyp segmentation to improve OOD generalization. The proposed architecture, TransRUPNet, is an encoder-decoder network that consists of three encoder blocks, three decoder blocks, and some additional upsampling blocks at the end of the network. With the image size of $256\times256$, the proposed method achieves an excellent real-time operation speed of 47.07 frames per second with an average mean dice coefficient score of 0.7786 and mean Intersection over Union of 0.7210 on the out-of-distribution polyp datasets. The results on the publicly available PolypGen dataset (OOD dataset in our case) suggest that TransRUPNet can give real-time feedback while retaining high accuracy for in-distribution datasets. Furthermore, we demonstrate the generalizability of the proposed method by showing that it significantly improves performance on OOD datasets compared to the existing methods.

Architecture

Key features

Encoder-Decoder Structure: It consists of three encoder blocks, three decoder blocks, and additional upsampling blocks.

Use of Pyramid Vision Transformer (PVT): The network begins with a PVT as a pretrained encoder, which helps in extracting various feature maps.

Feature Map Processing: The extracted feature maps are reduced and passed through up blocks and decoder blocks, involving bilinear upsampling and residual blocks for robust representation learning.

Output Generation: The outputs from the up blocks are concatenated into a single feature map, followed by a residual block, 1x1 convolution, and a sigmoid activation to generate the final segmentation mask.

Performance Metrics: TransRUPNet demonstrated impressive real-time operation speed and accuracy, with significant performance improvements on OOD datasets compared to existing methods.

Datasets:

The following datasets are used in this experiment:

  1. Kvasir-SEG (https://datasets.simula.no/kvasir-seg/)
  2. [Polypgen] (https://drive.google.com/drive/u/1/folders/16uL9n84SrMt7IiQFzTUQNaJ9TbHJ8DhW)
  3. [BKAI-IGH] (https://paperswithcode.com/dataset/bkai-igh-neopolyp-small)/li>

Results

Citation

Please cite our paper if you find the work useful:

@article{jha2023transrupnet,
  title={TransRUPNet for Improved Out-of-Distribution Generalization in Polyp Segmentation},
  author={Jha, Debesh and Tomar, Nikhil Kumar and Bagci, Ulas},
  journal={arXiv preprint arXiv:2306.02176},
  year={2023}
}

Contact

please contact [email protected] for any further questions.

transrupnet's People

Contributors

debeshjha avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.