neptune-ai / open-solution-mapping-challenge Goto Github PK

Open solution to the Mapping Challenge :earth_americas:

Home Page: https://www.crowdai.org/challenges/mapping-challenge

License: MIT License

Python 55.84% Jupyter Notebook 44.10% Makefile 0.07%

data-science machine-learning deep-learning kaggle python satellite-imagery data-science-learning lightgbm unet unet-image-segmentation

open-solution-mapping-challenge's Introduction

Open Solution to the Mapping Challenge Competition

Note

Unfortunately, we can no longer provide support for this repo. Hopefully, it should still work, but if it doesn't, we cannot really help.

More competitions 🎇

Check collection of public projects 🎁, where you can find multiple Kaggle competitions with code, experiments and outputs.

Poster 🌍

Poster that summarizes our project is available here.

Intro

Open solution to the CrowdAI Mapping Challenge competition.

Check live preview of our work on public projects page: Mapping Challenge 📈.
Source code and issues are publicly available.

Results

0.943 Average Precision 🚀

0.954 Average Recall 🚀

No cherry-picking here, I promise 😉. The results exceded our expectations. The output from the network is so good that not a lot of morphological shenanigans is needed. Happy days:)

Average Precision and Average Recall were calculated on stage 1 data using pycocotools. Check this blog post for average precision explanation.

Disclaimer

In this open source solution you will find references to the neptune.ai. It is free platform for community Users, which we use daily to keep track of our experiments. Please note that using neptune.ai is not necessary to proceed with this solution. You may run it as plain Python script 😉.

Reproduce it!

Check REPRODUCE_RESULTS

Solution write-up

Pipeline diagram

Preprocessing

✔️ What Worked

Overlay binary masks for each image is produced (code 💻).
Distances to the two closest objects are calculated creating the distance map that is used for weighing (code 💻).
Size masks for each image is produced (code 💻).
Dropped small masks on the edges (code 💻).
We load training and validation data in batches: using torch.utils.data.Dataset and torch.utils.data.DataLoader makes it easy and clean (code 💻).
Only some basic augmentations (due to speed constraints) from the imgaug package are applied to images (code 💻).
Image is resized before feeding it to the network. Surprisingly this worked better than cropping (code 💻 and config 📑).

✖️ What didn't Work

Ground truth masks are prepared by first eroding them per mask creating non overlapping masks and only after that the distances are calculated (code 💻).
Dilated small objects to increase the signal (code 💻).
Network is fed with random crops (code 💻 and config 📑).

🤔 What could have worked but we haven't tried it

Ground truth masks for overlapping contours (DSB-2018 winners approach).

Network

✔️ What Worked

Unet with Resnet34, Resnet101 and Resnet152 as an encoder where Resnet101 gave us the best results. This approach is explained in the TernausNetV2 paper (our code 💻 and config 📑). Also take a look at our parametrizable implementation of the U-Net.

✖️ What didn't Work

Network architecture based on dilated convolutions described in this paper.

🤔 What could have worked but we haven't tried it

Unet with contextual blocks explained in this paper.

Loss function

✔️ What Worked

Distance weighted cross entropy explained in the famous U-Net paper (our code 💻 and config 📑).
Using linear combination of soft dice and distance weighted cross entropy (code 💻 and config 📑).
Adding component weighted by building size (smaller buildings has greater weight) to the weighted cross entropy that penalizes misclassification on pixels belonging to the small objects (code 💻).

Weights visualization

For both weights: the darker the color the higher value.

distance weights: high values corresponds to pixels between buildings.
size weights: high values denotes small buildings (the smaller the building the darker the color). Note that no-building is fixed to black.

Training

✔️ What Worked

Use pretrained models!
Our multistage training procedure:
1. train on a 50000 examples subset of the dataset with lr=0.0001 and dice_weight=0.5
2. train on a full dataset with lr=0.0001 and dice_weight=0.5
3. train with smaller lr=0.00001 and dice_weight=0.5
4. increase dice weight to dice_weight=5.0 to make results smoother
Multi-GPU training
Use very simple augmentations

The entire configuration can be tweaked from the config file 📑.

🤔 What could have worked but we haven't tried it

Set different learning rates to different layers.
Use cyclic optimizers.
Use warm start optimizers.

Postprocessing

✔️ What Worked

Test time augmentation (tta). Make predictions on image rotations (90-180-270 degrees) and flips (up-down, left-right) and take geometric mean on the predictions (code 💻 and config 📑).
Simple morphological operations. At the beginning we used erosion followed by labeling and per label dilation with structure elements chosed by cross-validation. As the models got better, erosion was removed and very small dilation was the only one showing improvements (code 💻).
Scoring objects. In the beginning we simply used score 1.0 for every object which was a huge mistake. Changing that to average probability over the object region improved results. What improved scores even more was weighing those probabilities with the object size (code 💻).
Second level model. We tried Light-GBM and Random Forest trained on U-Net outputs and features calculated during postprocessing.

✖️ What didn't Work

Test time augmentations by using colors (config 📑).
Inference on reflection-padded images was not a way to go. What worked better (but not for the very best models) was replication padding where border pixel value was replicated for all the padded regions (code 💻).
Conditional Random Fields. It was so slow that we didn't check it for the best models (code 💻).

🤔 What could have worked but we haven't tried it

Ensembling
Recurrent neural networks for postprocessing (instead of our current approach)

Model Weights

Model weights for the winning solution are available here

You can use those weights and run the pipeline as explained in REPRODUCE_RESULTS.

User support

There are several ways to seek help:

crowdai discussion.
You can submit an issue directly in this repo.
Join us on Gitter.

Contributing

Check CONTRIBUTING for more information.
Check issues to check if there is something you would like to contribute to.

open-solution-mapping-challenge's People

Contributors

Stargazers

Watchers

Forkers

apyskir taraspiotr spmohanty gitter-badger nojuman robeson1010 newcoder0531 lixiang-ucas wzhang1 hjoab kepengxu dgreyling kant szywind nulledexceptions sharadgupta27 mahmud83 vuhoangminh fanrangit bilgan gisdeveloper2017 cuulee windson87 ragitagha wxphb erikfather tustslf alzayats danielc92 bhonris farsee2inhust dalarango kuntahu lytk01 cookcv reynierhdez bezova prossir danielhyunilkim sonwyang csaybar jdc08161063 cvkdean muxli weixiaorui hazxone azridev simonhong111 jackyysu hit-hancg selina013 tdyku aficionadoai dr-alok-tiwari rafealzheng pankajmehar samux87 wanghaiying8228 pratik-nagdeve schmollgruberja tangjuncheng1986 aabin whitepig27 yuanqinglie gevorgantonyan ssandhu-enph liangyang666 lyf6 cmcconomy ericcmq arasharchor fatmaelik mainkai turguthaspolat li-guihai stjordanis marcos-turing farhang74 mohammad-ghahramani-ila marrrcin junjie2008v grigoryantigran an-ivanov dreamstudio2015 hehongjie ltphuongunited dimkoss11 winterchaufr yoooooo-o tongjt 81123 sumerudataanalaytics stargazeryuan

open-solution-mapping-challenge's Issues

hickle = h5py + pickle

Official implementation of the Fully Convolutional Instance-aware Semantic Segmentation (FCIS) based on MXNet -> they use hickle to dump masks fast. It is wotrh checking if it is useful to us.

hickle in their code.

Use ResNet or VGG as an encoding part of the U-Net

Modify U-Net:

encoding part is pre-trained ResNet or VGG,
decoding part as usual -> How to make horizontal connections?
train entire network

Idea described in this blog post -> DSB'18 winning solution

Small Buildings

Models are performing badly on small images

Review info from DSTL competition

DSTL competition:

Discussions during DSTL
confluence (ds.ai internal)
1st place solution by Kyle Lee blog post
3rd place solution video
4th place solution -> ds.ai blog post

improve model saving

Add epoch number to the saved model name, that is model.torch.
Save model i.e. model_epoch123.model
remove previously saved model (with smaller epoch number).

Multi-GPU support

Prepare metadata to handle directories to eroded masks (10,20,30)

Currently there are not enough columns in metadata file. Fix it.

improve U-Net

handle kernel size, strides, upsampling.

prepare metadata and masks

General requirements:

generate metadata file
in main.py option to prepare meta and masks (for train)

For the training purposes:

prepare masks overlayed

Note, that masks for evaluation will be prepared on-the-fly in the loader from single json.

Models transform method that returns generators

Right now each steps.pytorch.Model instance returns a dictionary of lists of outputs from the network. It causes problems when working with larger datasets. I think it should return a generator instead

train models on small buildings only

take only masks with objects smaller than 32^2
check if we can make reasonable result on small buildings only
if successful train small-buildings-specialist

Investigate if the validation set has any rotated images

Using random rotation in augmentation may cause trouble because:

if pictures are taken at the same time of the day (presumable of the short period of the year) shadow will be more or less the same
orientation of buildings/roads may be more or less the same throughout the dataset

Actions:

go through train/valid/test examples and check if constant angle hypothesis is correct
drop rotation from augmentations or use only a very small rotation

Add instance labeler for multiple classes

Instance labeler works only for one class (buildings).
In case of more classes it wouldn't work.

large dataset

Add dilated convolutions instead of maxpool 2d

fix callbacks - NeptuneMonitorSegmentation

It is needed to adjust NeptuneMonitorSegmentation class in callbacks.py to the output format of unet.

evaluation in chunks

Evaluation is currently not possible on the entire validation set.
There is however option to generate predictions in chunks.
Similar option in evaluation would help

prepare loaders for training and evaluation

regarding single masks for evaluation purposes:

load json to memory and convert to mask on-the-fly.

Prepare masks for multiclass case

Currently function overlay_masks_from_annotations in preparation.py assumes single class case. It has to be modified and adapted to multiclass case. Some tuning of loaders may be necessary.

refactor passing augmentation/transformation function to loader

Loader has a method .dataset() where one passes objects like:

self.transform
self.transform_with_target

etc.
That is clumsy. Since those methods are available via self. why pass them?

mosaic padding fails in stream_mode

using mosaic padding with stream_mode=1 causes errors at the score evaluation stage.

Loaders will not work with multiple classes

Current loaders do not support multiple classes just one class with multiple outputs

Connected Instances

Class instances that are close are connected

Calculate receptive field for the models

We may be training on too large images.
Go through:

Decide whether we are training on correct size images:

Mayby smaller crops are enough in which case faster training is possible

Exploit fitting to competition metric

erode 30% of the mask
train on eroded mask
no dilation

Experiment with loss parameters

Loss params like:

BCE weight
DICE weight
distance weighted loss params
size weighted loss params
weight schedule

need to be investigated

Use Pillow-SIMD for better performance

Pillow can be replaced by a faster Pillow-SIMD with little to no effort.
I think it is worth trying https://github.com/uploadcare/pillow-simd

add recall as a evaluation metric

calculate only for threshold = 0.5

EDA on the dataset: both train and valid

Explore randomly adding small buildings

Funky augmentation where small buildings are added randomly as long as they are not touching any real image

save 1000 best and worst predictions.

submission generation (polygons)

external data: read about DSB U-Net improvements

Persisting multi-gpu models

We have to find a good way to save nn.DataParallel models and later load them to transformers.

Calculate normalization constants step

Currently normalization constants mean=0 std=1 are hard-coded, they should be calculated on train and passed to loaders
Substitute mean and std with values from pretrained PyTorch models (i.e. ResNet).

Mean and std different than pretrained pytorch models

It seems that in the pipeline_config.py:

MEAN = [0., 0., 0.]
STD = [1., 1., 1.]

But the pytorch pretrained models have:

transforms.Normalize(mean=[0.485, 0.456, 0.406],
std=[0.229, 0.224, 0.225]))

I think this may cause suboptimal results

Adapt creating submission for multiclass case

Current submission generator assumes that predictions is a list of single-channel images (one image per test image), where pixels with values 1,2,3... correspond to building instances 1,2,3...

New submission generator has to be adjusted to handle output from MulticlassLabeler (that will be prepared in issue #23

Use HDF5 format to store images

One of the options to speed-up data loaders is to:

transform your images and target masks into one large HDF5 file
refactor/add pytorch Datasets that read from the HDF5 file

Calculate w1 parameter of weighted loss function

Now it is just torch.ones(). It should handle class imbalance.

Prepare sum of distances to 2 closest objects, now we store both distances and sum them later

modify preparation.py
modify loaders.py

erode masks (so all masks were separated, they didn’t touch),
train U-Net with weighted cross entropy loss function. Idea described in this kaggle post. While training add more weight in loss function to points that we would assign to contours touching category,
make predictions,
dilate predictions.

Weights calculated as in the U-Net paper (page 5, eq. 2).

neptune-ai / open-solution-mapping-challenge Goto Github PK

open-solution-mapping-challenge's Introduction

Open Solution to the Mapping Challenge Competition

Note

More competitions 🎇

Poster 🌍

Intro

Results

Disclaimer

Reproduce it!

Solution write-up

Pipeline diagram

Preprocessing

✔️ What Worked

✖️ What didn't Work

🤔 What could have worked but we haven't tried it

Network

✔️ What Worked

✖️ What didn't Work

🤔 What could have worked but we haven't tried it

Loss function

✔️ What Worked

Weights visualization

Training

✔️ What Worked

🤔 What could have worked but we haven't tried it

Postprocessing

✔️ What Worked

✖️ What didn't Work

🤔 What could have worked but we haven't tried it

Model Weights

User support

Contributing

open-solution-mapping-challenge's People

Contributors

Stargazers

Watchers

Forkers

open-solution-mapping-challenge's Issues

Recommend Projects

Recommend Topics

Recommend Org