This repo try to implement state-of-art fast semantic segmentation models on road scene dataset(CityScape, Camvid).
This repo aims to do experiments and verify the idea of fast semantic segmentation and this repo also provide some fast models.
- ICNet:ICnet for real-time semantic segmentation on high-resolution images.
- DF-Net: Partial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search.
- Bi-Seg: Bilateral segmentation network for real-time semantic segmentation.
- DFA-Net: Deep feature aggregation for real-time semantic segmentation.
- ESP-Net: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
- SwiftNet: In defense of pre-trained imagenet architectures for real-time semantic segmentation of road-driving images.
- Real-Time Semantic Segmentation via Multiply Spatial Fusion Network
- Fast-SCNN: Fast Semantic Segmentation Network
- use train_distribute.py for training
- use prediction_test_different_size.py for prediction with different size input.
- use eval.py for evaluation on validation dataset.
- You can download [cityscapes] dataset (https://www.cityscapes-dataset.com/) from here. Note: please download leftImg8bit_trainvaltest.zip(11GB) and gtFine_trainvaltest(241MB).
- You can download camvid dataset from here.
- use syn-bn(apex).
- use batch-size >=8.
- use deep supervised loss for easier optimation.
- use large crop size during training.
- longer training time for small models(60,000 interaction or more).
- use Mapillary data for pretraining for boosting performance.
- Deeply based resnet runs slowly than torch pretrained resnet but with higher accuracy.
(a) test image (b) ground truth (c) predicted result
This project is released under the Apache 2.0 license.