Git Product home page Git Product logo

openseg.pytorch's Introduction

openseg.pytorch

Updates

News!!! @ 2019/08/09.

We would like to support various backbones such as MobileNetV2, ResNet-101, Wide-ResNet-38, HRNetV2-48.

Update @ 2019/07/31.

We have released the paper ISA, which is very easy to use and implement while being much more efficient than the conventional self-attention based OCNet or DANet.

Update @ 2019/07/23.

We (HRNet + OCR w/ ASP) achieve Rank#1 on the leaderboard of Cityscapes (with a single model) on 3 of 4 metrics.

Update @ 2019/06/19.

We achieve 83.3116%+ on the leaderboard of Cityscapes test set based on single model HRNetV2 + OCR. Cityscapes leaderboard

We achieve 56.02% on the leaderboard of ADE20K test set based on single model ResNet101 + OCR without any bells or whistles. ADE20K leaderboard

Update @ 2019/05/27.

We achieve SOTA on 6 different semantic segmentation benchmarks including: Cityscapes, ADE20K, LIP, Pascal-Context, Pascal-VOC, COCO-Stuff. We provide the source code for our approach on all the six benchmarks. More benchmarks will be supported latter. We will consider release all the check-points and training log for the below experiments.

82.0%+/83.0%+ on the test set of Cityscapes with only Train-Fine + Val-Fine datasets/Coarse datasets.

45.5%+ on the val set of ADE20K.

56.5%+ on the val set of LIP.

56.0%+ on the val set of Pascal-Context.

81.0%+ on the val set of Pascal-VOC with ss test. (DeepLabv3+ is 80.02% with only train-aug)

40.5%+ on the val set of COCO-Stuff-10K.

Performances with openseg.pytorch

  • Cityscapes (testing with single scale whole image)
Methods Backbone Train. mIOU Val. mIOU Test. mIOU BS Iters
FCN MobileNetV2 - - - - -
FCN 3x3-ResNet101 - - - 8 4W
FCN Wide-ResNet38 - - - 8 4W
FCN HRNetV2-48 - - - 8 10W
OCNet MobileNetV2 - - - - -
OCNet 3x3-ResNet101 - - - 8 4W
OCNet Wide-ResNet38 - - - 16 2W
OCNet HRNetV2-48 - - - 8 10W
ISA MobileNetV2 - - - - -
ISA 3x3-ResNet101 - - - 8 4W
ISA Wide-ResNet38 - - - 16 2W
ISA HRNetV2-48 - - - 8 10W
OCR MobileNetV2 - - - - -
OCR 3x3-ResNet101 - - - 8 4W
OCR Wide-ResNet38 - - - 16 2W
OCR HRNetV2-48 - - - 8 10W
  • ADE20K (testing with single scale whole image)
Methods Backbone Val. mIOU PixelACC BS Iters
FCN 3x3-ResNet101 - - 16 15W
FCN Wide-ResNet38 - - 16 15W
FCN HRNetV2-48 - - 16 15W
OCNet 3x3-ResNet101 - - 16 15W
OCNet Wide-ResNet38 - - 16 15W
OCNet HRNetV2-48 - - 16 15W
ISA 3x3-ResNet101 - - 16 15W
ISA Wide-ResNet38 - - 16 15W
ISA HRNetV2-48 - - 16 15W
OCR 3x3-ResNet101 - - 16 15W
OCR Wide-ResNet38 - - 16 15W
OCR HRNetV2-48 - - 16 15W
  • LIP (testing with single scale whole image + left-right flip)
Methods Backbone Val. mIOU PixelACC BS Iters
FCN 3x3-ResNet101 - - 32 10W
FCN Wide-ResNet38 - - 32 10W
FCN HRNetV2-48 - - 32 10W
OCNet 3x3-ResNet101 - - 32 10W
OCNet Wide-ResNet38 - - 32 10W
OCNet HRNetV2-48 - - 32 10W
ISA 3x3-ResNet101 - - 32 10W
ISA Wide-ResNet38 - - 32 10W
ISA HRNetV2-48 - - 32 10W
OCR 3x3-ResNet101 - - 32 10W
OCR Wide-ResNet38 - - 32 10W
OCR HRNetV2-48 - - 32 10W
  • Pascal-VOC (testing with single scale whole image)
Methods Backbone Val. mIOU PixelACC BS Iters
FCN 3x3-ResNet101 - - 16 6W
FCN Wide-ResNet38 - - 16 6W
FCN HRNetV2-48 - - 16 6W
OCNet 3x3-ResNet101 - - 16 6W
OCNet Wide-ResNet38 - - 16 6W
OCNet HRNetV2-48 - - 16 6W
ISA 3x3-ResNet101 - - 16 6W
ISA Wide-ResNet38 - - 16 6W
ISA HRNetV2-48 - - 16 6W
OCR 3x3-ResNet101 - - 16 6W
OCR Wide-ResNet38 - - 16 6W
OCR HRNetV2-48 - - 16 6W
  • Pascal-Context (testing with single scale whole image)
Methods Backbone Val. mIOU PixelACC BS Iters
FCN 3x3-ResNet101 - - 16 3W
FCN Wide-ResNet38 - - 16 3W
FCN HRNetV2-48 - - 16 3W
OCNet 3x3-ResNet101 - - 16 3W
OCNet Wide-ResNet38 - - 16 3W
OCNet HRNetV2-48 - - 16 3W
ISA 3x3-ResNet101 - - 16 3W
ISA Wide-ResNet38 - - 16 3W
ISA HRNetV2-48 - - 16 3W
OCR 3x3-ResNet101 - - 16 3W
OCR Wide-ResNet38 - - 16 3W
OCR HRNetV2-48 - - 16 3W
  • COCO-Stuff-10K (testing with single scale whole image)
Methods Backbone Val. mIOU PixelACC BS Iters
FCN 3x3-ResNet101 - - 16 6W
FCN Wide-ResNet38 - - 16 6W
FCN HRNetV2-48 - - 16 6W
OCNet 3x3-ResNet101 - - 16 6W
OCNet Wide-ResNet38 - - 16 6W
OCNet HRNetV2-48 - - 16 6W
ISA 3x3-ResNet101 - - 16 6W
ISA Wide-ResNet38 - - 16 6W
ISA HRNetV2-48 - - 16 6W
OCR 3x3-ResNet101 - - 16 6W
OCR Wide-ResNet38 - - 16 6W
OCR HRNetV2-48 - - 16 6W

Citation

Please consider citing our work if you find it helps you,

@article{yuan2018ocnet,
  title={Ocnet: Object context network for scene parsing},
  author={Yuan Yuhui and Wang Jingdong},
  journal={arXiv preprint arXiv:1809.00916},
  year={2018}
}

@article{huang2019isa,
  title={Interlaced Sparse Self-Attention for Semantic Segmentation},
  author={Huang Lang and Yuan Yuhui and Guo Jianyuan and Zhang Chao and Chen Xilin and Wang Jingdong},
  journal={arXiv preprint arXiv:1907.12273},
  year={2019}
}

Acknowledgment

This project is developed based on the segbox.pytorch and the author of segbox.pytorch donnyyou retains all the copyright of the reproduced Deeplabv3, PSPNet related code.

openseg.pytorch's People

Contributors

pkurainbow avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.