rvmaretto / deepgeo Goto Github PK

View Code? Open in Web Editor NEW

22.0 7.0 6.0 113.05 MB

DeepGeo: Deep Learning for Earth Observation data ToolBox

License: GNU General Public License v3.0

Jupyter Notebook 99.43% Python 0.57% Shell 0.01%

machine-learning deep-learning remote-sensing geospatial-intelligence geospatial-data neural-networks

deepgeo's People

Contributors

Stargazers

Watchers

Forkers

menimato drroad ridvansalihkuzu akhilchibber1 arun-1997 fdbesanto2

deepgeo's Issues

Implement a notebook for data augmentation

Implement a notebook to plot the results of data augmentation.

Implement features to profile the networks

In the current version, there is a clear bottleneck in the CPU integration of the network weights. Implement profiling options to analyse the execution in each device (CPUs and GPUs)

Implement an intuitive way to define different default parameters for different networks

Implement first Deep network

Implemente a first simpler CNN, like VGG or another like that.

Save geotiff files from the samples

Samples are now saved only in PNG files. Save it in GeoTiff files to make the visualization possible in SIGs.

Fix parallelism in Data Augmentation

Data augmentation operations are not working on multiple GPUs, and rotation operations are not even running on GPUs, only on one CPU. Fix it.

Implement unittests for the SampleGenerator class

Implement unittests for the SampleGenerator class.

Verify results of EVI

Synthetic band generated by the computeEVI function seems to be wrong. Verify results and formula.

Create top folder "deepleeo" inside "src"

Inside the src folder, create a new folder "deepleeo", that will be the top folder of the package. Verify structure. The main init.py must be inside this folder?

After this, include the code coverage in the Travis script:

nosetests --with-coverage --cover-erase --cover-package=deepleeo --cover-html

Implement another vegetation indexes

According to the folowing image, implement another vegetation indexes in the preprocessor class.

Fix shift issue in write_pred_chips method

Chips are being saved with some shift.

Implement function to mosaic (merge) different scenes

Verify if the Rasterizer is working with a base raster with more than three bands

The Rasterizer class must work with any number of bands. Verify the behavior when the base raster have more than 3 bands. Fix if necessary.

Implement strategies for chip generation in the ChipGenerator

Implement the following strategies:

Randomly
Randomly per class
Sequential
Sequential with overlap

Implement U-Net with early fusion option

Implement the U-Net with the possibility of early fusion, stacking images BEFORE the encoder.

Implement a function to compute the difference between two maps

This function must generate a new map with false negatives, false positives and pixels correctly classified. It must plot it with matplotlib or seaborn.

Rename src/deepgeo/utils to src/deepgeo/common

Rename module utils to common.

Implement method to evaluate final classification

This method must compare the final classification map with the ground through the metrics computed in quality_metrics.compute_quality_metrics.

Verify results of raster saved with NDVI and EVI

When opened in TerraView, the saved raster are all black. Verify if they were correctly saved.

Change the name of datasetGen module

It must be something like data_manager or things like this.

Another solution is to move the data augmentation to utils module.

Implement a notebook to rasterize and plot the labels data and reference image

Implement a jupyter notebook to rasterize and plot a reference shape file and a reference Landsat Image. It will work as a visual test for the rasterizer.

Implement a function to print the summary of the model

Implement a function to print a summary of the model in the same way of keras model.summary()

https://stackoverflow.com/questions/46560313/is-there-an-easy-way-to-get-something-like-keras-model-summary-in-tensorflow

Create file with plot functions

This file must contain the functions to plot original image, labeled image, samples and data augmentation.

Plot new metrics in tensorboard

Plot also crossentropy and f1-score in tensorboard. Merge them with the accuracy in the same "group".

Implement validation method in ModelBuilder

This method should make prediction in some chips and compute some metrics like f1-score, IoU, overall accuracy, precision, recall, confusion matrix, etc.

This method should either save these results in a folder ./validation inside the training model directory.

Implement PCA in preprocessor

Implement a predefined function to compute PCA in preprocessor. It must receive as parameter the number of components to keep. To compute it, try to use the scikitlearn following function:

http://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html

In the rasterizer, allow to select the classes of interest

Allow the user to select the classes of interest in the rasterizer. In this way, tranform into "interest classes" and the remainder as "non-class", that will have the same value in the rasterized data.

Try to compute Mixture model in the preprocessor class

Mixture model needs some samples of vegetation, soil and shadow. Verify if there is a way to automate this proccess. There are some methodologies to compute it according to spectral libraries. Verify.

Implement a method to save a shape file with the extent of the samples

To make it easier to visualize the spatial distribution of the samples, instead of saving several geotiffs, save a shape file with the extent of all the samples.

Implement preprocessor class

The class would be responsible for allowing the user to extract some synthetic data or indexes (NDVI, EVI, etc) in a synthetic band to compose the dataset. The API must provide some predefined functions, like the NDVI and EVI, but allow the user to pass a customized function as parameter. Thus, the system must be able to compute a synthetic band based on this (or these) functions (It must allow to produce more than just one band).

This class must either be able to remove classes that the user is not interested, croping either the base raster.

Add auc to the Tensorboard metrics

Implement AUC metric through the function tf.metrics.auc

reference: https://www.tensorflow.org/api_docs/python/tf/metrics/auc

Implement DatasetGenerator class

This class must be able to generate chips for a list of images and shapefiles using the chipGenerator with a given strategy and produce a single dataset.

Finish the implementation of the SampleGenerator class

Test different chip (patch) sizes

Verify the impact of the patch size in the classification accuracy. Try from 64 to 256. Is it dependent on the target sizes?

Normalize images in the preprocessor class

To make the DNN performance better, the data must be normalized, usually between -1 and 1, with the mean centered in 0. Verify this information in the literature, and implement this normalization.

Implement unit tests for data augmentation functions

It could use sinthetic data, like a 3x3 matrix or things like this.

Review method geofunctions.load_image

Method geofunctions.load_image is really necessary?
Is it necessary to convert the data to float32?
Is it necessary to mask it?

Review this method and refactor it if necessary.

Remove prints from methods body

Several of the implemented methods have prints in their body. Remove it.

Search for APIs for Data Augmentation

Keras seems to have some Data Augmentation functionalities. Verfify if there are another packages with this functionalities. It is better to use them insetead of implement it. Verify if TensorFlow provides some of these functionalities.

Crop the files in data directory to generate smaller test data

Generate smaller test data to use in the unit tests.

Refactoring - Change plot chips module to use plot_img_rgb or plot_labels

The contrast is not working in the plot_chips method. Try to use in it plot_img_rgb or plot_labels, depending on the number of channels passed as parameter or depending on the another parameters (classes, etc).

Implement strategy to deal with samples containing "no data" values

Verify if samples containing "no data" values can confuse the network. If positive, implement an strategy to avoid patches containig "no data" values.

Change unit tests to use cropped data

Unit tests are now using bigger data. Change it to use cropped data to fix failing build on Travis.

Implement U-Net with Later Fusion

Implement an U-Net version with the time fusion between the encoder and the decoder.

Implement a method to get a band from a numpy array

Implement a method or a class to, given a numpy array, retrieve a raster band, or even a class that will encapsulate the numpy array. It would make it easier to the user to deal with the raster. The class can have either the path to the input raster as a parameter.