mdbloice / augmentor Goto Github PK

View Code? Open in Web Editor NEW

5.0K 125.0 863.0 1.54 MB

Image augmentation library in Python for machine learning.

Home Page: https://augmentor.readthedocs.io/en/stable

License: MIT License

Python 100.00%

augmentation machine-learning deep-learning neural-networks

augmentor's People

Contributors

Stargazers

Watchers

Forkers

lradacher jonny911 yananjian gitter-badger bfour joestrummer82 python3pkg dabauxi s1mona mwessley diego999 stevenlol jdc08161063 wubizhi ericustc realzheng benjamesbabala clcarwin fuxicv yemika dreadlord1984 chelovekhe xy0806 labimage starstylesky ml-lab tangyoubao boozyguo braveapple hellodrx stefanopini xhwxd yanghaha11514 mave5 chaichou juventi tsingjinyun tanshen kli-casia bauerzhou gzzgz zxdeepdiver ghzhangnj zshanwei johndpope shannonyu joeyzhyp tangal0203 remyyang zhunzhong07 qianwenjun0801 lxwithgod misha-zhukov juneoh alesolano vijayendra-g rlnx embeddedsamurai gninnur stnk20 rosssong nhu2000 visualead yangpan011 inkimage danache xinario zbxzc35 nguyenhongchau donnyyou marvis yiliangnie warmstar1986 zhangleiedu prabhuiitdhn clustersdata valeman two222 mapillary garrylau backyes laodar gpnuelvis qaavi ronkg li-lai go2carter gachiemchiep linhanxiao mdraw h7474 cvtower firur maludaga zouliangyu mtcld matt-deboer ella0102 gauthiermartin hyperdrive

augmentor's Issues

Possible to run multiple pipeline at the same time

I was wondering if it was possible to run multiple pipeline at the same time

Here is what im trying to achieve, we are trying to create multiple pipeline for each of our classes and then trigger random augmentation on each of them now we run on job after the other but is it possible to run like 2 or 3 at once ?

Adding black_and_white to the pipeline should force a lossless format (Not JPEG)

Currently, even if you specify a black and white operation but don't change the default picture format (.jpeg) the resulting picture contains values besides 0 and 255 due to compression.

To be clear black_and_white is not malfunctioning, changing to a lossless format (.png for example) when creating the pipeline fixes the problem.

apply_current_pipeline fails trying to save the file

It looks like output_directory is not available for apply_current_pipeline.

from Augmentor import Pipeline

aug = Pipeline(source_directory='./',
               output_directory='out')

aug.apply_current_pipeline('pic.png', save_to_disk=True)

Traceback (most recent call last):
  File "bug.py", line 6, in <module>
    aug.apply_current_pipeline('pic.png', save_to_disk=True)
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 268, in apply_current_pipeline
    return self._execute(AugmentorImage(os.path.abspath(image_path), None), save_to_disk)
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 205, in _execute
    image.save(os.path.join(augmentor_image.output_directory, file_name), self.save_format)
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/posixpath.py", line 78, in join
    a = os.fspath(a)
TypeError: expected str, bytes or os.PathLike object, not NoneType

module 'Augmentor' has no attribute 'Pipeline'

Thanks for the wonderful library. But I am having an issue here. First time it worked alright but now as soon as I import Augmentor in Python I get the message

builtins.AttributeError: module 'Augmentor' has no attribute 'Pipeline'

Is there anything I can do about that?

tif files are not loaded

from Augmentor import Pipeline
from skimage.io import imread

aug = Pipeline(source_directory='./',
               output_directory='out')

print(imread('sample.tif'))
aug.apply_current_pipeline('sample.tif')

Let's try to read a file using scikit-image to be sure it is valid tiff file, and see how Augmentor fails:

Initialised with 1 image(s) found in selected directory.
Output directory set to ./out.
[[[3366 2681 1454 4441]
  [3248 2588 1490 4039]
  [3422 2731 1579 4285]
  ...,
  [3659 3072 1881 7845]
  [3733 3154 1954 8042]
  [3751 3110 1889 7561]]

 [[3357 2647 1488 4480]
  [3161 2559 1437 4037]
  [3400 2719 1584 4146]
  ...,
  [3645 3124 1882 7944]
  [3642 3137 1811 8230]
  [3690 3155 1925 7592]]

 [[3409 2690 1481 4475]
  [3343 2637 1539 4038]
  [3444 2764 1626 4112]
  ...,
  [3798 3186 1921 7724]
  [3863 3266 2034 8210]
  [3662 3080 1836 7586]]

 ...,
 [[3670 3012 1836 5635]
  [3654 3005 1810 5861]
  [3545 2963 1774 5925]
  ...,
  [3567 2858 1712 6473]
  [3706 2971 1852 7023]
  [3742 3049 1853 7311]]

 [[3677 3031 1837 5715]
  [3599 3015 1815 5769]
  [3593 2994 1817 5706]
  ...,
  [3620 2938 1757 7244]
  [3702 3052 1834 7525]
  [3696 3098 1857 7699]]

 [[3540 2968 1778 5665]
  [3572 2987 1818 5662]
  [3549 2952 1805 5710]
  ...,
  [3611 3023 1809 7788]
  [3657 3114 1822 7792]
  [3717 3168 1896 7859]]]
Traceback (most recent call last):
  File "bug.py", line 8, in <module>
    aug.apply_current_pipeline('sample.tif')
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 268, in apply_current_pipeline
    return self._execute(AugmentorImage(os.path.abspath(image_path), None), save_to_disk)
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 189, in _execute
    image = Image.open(augmentor_image.image_path)
  File "/Users/Arseny/.pyenv/versions/3.6.0/lib/python3.6/site-packages/PIL/Image.py", line 2452, in open
    % (filename if filename else fp))
OSError: cannot identify image file '/Users/Arseny/dev/kaggle/amzn/sample.tif'

Constructing Augmentor.Pipeline with a file path pointing to a directory that contains other directories results in the augmentor not finding any images

When I had a directory with other directories within it and images, the Augmentor.Pipeline() finds 0 images.

how to install this by anaconda

how to use conda install this lib
my default python interpreter is anaconda, if i use pip install this lib I can't use it
I use pycharm(backend by anaconda as interpreter)

Grayscale class

I've been trying to add the grayscale option to the pipeline, as said in the documents, but when I do it shows the following error.
>>> p.greyscale(probability=1.0) Traceback (most recent call last): File "<stdin>", line 1, in <module> AttributeError: 'Pipeline' object has no attribute 'greyscale'

I'm using Python 2.7.12.

Perform image operations in-memory

It should not be default that images need to be stored to disk. Instead the operations should be performed in-memory such that operations can be chained easily

This comparison statement seems not right. The warning message is shown even if the image format is one of the two formats.

Augmentor/Augmentor/Pipeline.py

Line 337 in 06ac333

if image_data_format != "channels_first" or "channels_last":

Summary functions.

Both ImageSource and Pipeline object require functions, named summary that output details regarding that object's state. For the ImageSource class that is a run down of the image sources that are currently loaded. The Pipeline's summary should output the operations that have been added so far,

Also, the ImageSource and Pipeline's summary function should output tables in Markdown valid syntax. This would mean that if the package is being used in Jupyter, a nicely formatted table will render when summary is called. Also, they would look fine in a REPL interpreter.

It should output valid Markdown such as the following output from an ImageSource object:

| Summary           |                    |
|:-----------------:|--------------------|
| Image count       | 120                |
| Dimensions        | 480x480 (120)      |
| Aspect ratio(s)   | 1:1                |
| File type(s)      | 1                  |
| File extension(s) | BMP                |
| File format(s)    | Grayscale          |

Example rendered output for an ImageSource object:

Summary
Image count	120
Dimensions	480x480 (120)
Aspect ratio(s)	1:1
File type(s)	1
File extension(s)	BMP
File format(s)	Grayscale

Note that the idea is that summary provides an overview of the interesting properties of the data source. For example it could be the case that all images are of different size, in that situation it wouldn't be very informative to print every size there is. However, if it is still the case that all images are of the same aspect ratio than that information is interesting and worth printing.

Define a class label / class integer map - content of image

I am new to this and excuse the question if it is too silly.
I am trying to construct a different set of images (rather than with mnist as basis) and have created a set of basic images.
In the integer_labels assigning part, it is not clear how the int:label to be assigned. the content of my image is not an integer but an image. Is it okay if the following syntax is used:
integer_labels = {'a':0, 'b':1, 'c':2} where 'a' is the content of the image?
in the given example of MNIST, it is '0', '1' etc. which makes it straight forward. thank you for the support. BTW, I have stored the images as .png files (0.png, 1.png etc), if that is a clue to what is going wrong
Error: Image.fromarray(x_array) - TypeError: Cannot handle this data type

Rotate to the same angle?

The description of the function "rotate" says: "If you wish to rotate the images by an exact number of degrees, set both :attr:max_left_rotation and :attr:max_right_rotation to the same value."
I am confused by this. If I set the attr:max_left_rotation and :attr:max_right_rotation to the same value, then which angle does it rotate? To left or to right?

How to change the output filename and output fileformat

@Evizero @jonny911 @YananJian

Zoom-out functionality available but turned off?

First thing - thanks for this great repo!

The input for the zoom operation is limited to min_factor>1.
I actually needed to use it for a "zoom out" functionality and simply disabled the input check - then everything worked as expected.
Should this input check even be there? I know for most users adding a black margin for the "zoom out" effect is unwanted, but they can simply only use min_factor>1 - no need to forbid it.
What do you think?

Reproducible issue: black image (all pixels are zero) after rotate

from skimage import io
import Augmentor.Operations as ops
from PIL import Image
import numpy as np
from matplotlib import pyplot as plt

sample = io.imread('train.jpg')
rotate = ops.RotateRange(probability=1, max_left_rotation=50, max_right_rotation=10)

for i in range(6):
  img_ro = np.array(rotate.perform_operation(Image.fromarray(sample)))
  plt.imshow(img_ro)
  plt.show()

Sometimes, you can get a black image after rotate the image.

Image processing gets slower and slower as time goes on

Feeding Pipeline with numpy array

The library looks pretty good. I would like to ask about a feature.

In addition to reading from directory and writing to directory, Is it possible to add a feature that can read from numpy array and returns augmented numpy array images?

skew(probability=0.6,skew_type="CORNER",magnitude=4)

@S1mona @jonny911 @Evizero

When i use skew(probability=0.6,skew_type="CORNER",magnitude=4) getting error of TypeError: skew() got an unexpected keyword argument 'skew_type'

Pipeline.image_generator() causes out of range index

This line causes out of range index:

Augmentor/Augmentor/Pipeline.py

Line 300 in eae7394

im_index = random.randint(0, len(self.augmentor_images))

It should be:
im_index = random.randint(0, len(self.augmentor_images-1))
Or clearer way:
img= random.choice(self.augmentor_images)

Create 'Read the Docs' documentation

Read the Docs is a really nice way to provide documentation to users. The Julia version of the documentation can be seen here:

http://augmentorjl.readthedocs.io/en/latest/index.html

We should do the same for the Python version for consistency.

Option for absolute output path

Hi,

currently, only a relative output path can be defined in Pipeline(). It would be helpful to be able to specify an absolute output path.

Cheers

Add support for semantic segmentation problems

It would be useful if the pipelines could be applied in parallel to both the image and label dataset for segmentation problems. It is possible to sync two pipelines and have them operate independently on the input images and output labels, but it would be useful to support both

Unit Tests for lossless image operations

Each lossless image operation should have a test in place that makes sure it does what it promises to do. Lossless in this context just means that there is no interpolation or extrapolation going on. For example flipping the x dimension is just a rearrangement of pixels (As a negative example rotating 3 degrees forces one to interpolate between neighbouring pixels in order to get good results).

Since testing these kind of lossless transformations is a lot easier than the others they deserve separate issues. In the Julia version I test those using an artificial 2x2 pixel which I then check manually. This approach seems to work pretty well. See: https://github.com/Evizero/Augmentor.jl/blob/master/test/tst_crop.jl

How to use the function of GaussianDistortion

Inside output folder an empty directory named "0" is created

While I processed a batch of multiple files in the output folder besides output files an empty directory with name "0" is created.
Same things does not happen if I have only 1 file to be processed inside my Pipeline.

This is a small reproducer

pipeline = Augmentor.Pipeline(source_directory="my_input",
                    output_directory="test_out", save_format="PNG")
        pipeline.random_distortion(probability=1, grid_width=3, grid_height=4, magnitude=3)
        pipeline.sample(100)

in this case folder 'my_input' contains multiple image files.

rotation changes the scale of the image

Just curious about the rotation operation. Shouldn't rotation preserve the scale of the image? I don't see the benefits of cropping out the largest rectangular area of the rotated image just to get rid of the black padding. In my opinion, scale and rotation should be isolated.

Also there're too many interfaces to rotation and sometimes they are confusing.

Anyway, awesome work.

PyPI structure compliancy.

Structure the project to make it compatible for upload to the Python Package Index (PyPI).

Object detection

Hey. Is data augmentation supported for object detection tasks?

histogram_equalisation method does not recover from IOError

While using histogram_equalisation method i got multiple error that go raised by PIL.

Here is the error returned by my terminal

Processing 000010.jpg: 81%|███████████████████████████████████████████████████████████████████████████████████████████████████▋ | 342/422 [00:29<00:06, 12.99 Samples/s]Output directory set to /home/spark/Desktop/Augmentation/images/Pont Jacques-Cartier/_histogram_equalisation.Traceback (most recent call last):
File "core/main.py", line 43, in
main = Main()
File "core/main.py", line 10, in init
self.main()
File "core/main.py", line 40, in main
augmentator.execute_all_single_image_augmentation()
File "/home/spark/Projects/image-augmentation-tool/core/augmentator.py", line 457, in execute_all_single_image_augmentation
self._histogram_equalisation()
File "/home/spark/Projects/image-augmentation-tool/core/augmentator.py", line 229, in _histogram_equalisation
pipeline.sample(count)
File "/home/spark/.virtualenvs/image-augmentation-tool/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 254, in sample
self._execute(augmentor_image)
File "/home/spark/.virtualenvs/image-augmentation-tool/lib/python3.6/site-packages/Augmentor/Pipeline.py", line 192, in _execute
image = operation.perform_operation(image)
File "/home/spark/.virtualenvs/image-augmentation-tool/lib/python3.6/site-packages/Augmentor/Operations.py", line 123, in perform_operation
return ImageOps.equalize(image)
File "/home/spark/.virtualenvs/image-augmentation-tool/lib/python3.6/site-packages/PIL/ImageOps.py", line 248, in equalize
return _lut(image, lut)
File "/home/spark/.virtualenvs/image-augmentation-tool/lib/python3.6/site-packages/PIL/ImageOps.py", line 57, in _lut
raise IOError("not supported for this image mode")
OSError: not supported for this image mode

param name out of sync with readme & code?

Hi,

Thank you for the library.

I was trying to run with my own sample found that, it seems there is some changes not getting reflected in readme?

Eg., as of now readme & docs site both says

`
p.rotate(probability=0.7, max_left=10, max_right=10)
p.zoom(probability=0.5, min_scale=1.1, max_scale=1.5)

`
but i found out that the arguments have a different name (based on autogenerated documentation)

p.rotate(probability=0.7, max_left_rotation=10, max_right_rotation=10) p.zoom(probability=0.5, min_factor=1.1, max_factor=1.5)

Iterate over mini-batches of image source

It should be possible in a convenient way to use an image source as an iterator to iterate over the whole dataset one image at a time or in mini-batches of a specified size

Adding operations on keypoints/landmarks?

This tool is awesome, especially for the elastic distortion function. Is is possible to add operations on given keypoints/landmarks, which means the inputs and outputs are images and corresponding points? This is helpful for alignment problems.

Error writing processed data when multi-procesed

Processing fer2013_6_35656.png: 100%|█████████▉| 18872/18888 [03:42<00:00, 86.62 Samples/s]Error writing 6_sadness__bc322a31-1668-412c-9314-4ea527d9666c.JPEG.
Error writing 6_sadness__e97e2502-e259-4a05-ac95-d9f1c67c1e90.JPEG.
Error writing 6_sadness__4739eb49-99fa-4e54-92db-77c770248bf6.JPEG.
Error writing 6_sadness__8754065a-56fd-494f-9d51-719d71552011.JPEG.
Error writing 6_sadness__0e20f205-97fc-4d14-9c51-044033f8c404.JPEG.
Error writing 6_sadness__0e14d4ed-93e6-4132-a20c-e6f6199a0e42.JPEG.
Error writing 6_sadness__a68996d9-c0a6-4079-999f-1a3536cf6edb.JPEG.
Error writing 6_sadness__fa9383c6-a94f-4466-afc0-359efe320a62.JPEG.
Error writing 6_sadness__8f877a1d-d65e-4d13-97ce-e92f03612860.JPEG.
Error writing 6_sadness__30e297ef-2996-45fb-bf42-e458b1b2ed21.JPEG.

Hi, I run the pipeline in three loops, applying different operation strategy for each one, then I got writing error for the last two loops. The code follows:

for f in glob.glob(root_dir):
    if os.path.isdir(f):
        folders.append(os.path.abspath(f))
        
print("Folders (classes) found: %s " % [os.path.split(x)[1] for x in folders])

pipelines = {}
for folder in folders:
    print("Folder %s:" % (folder))
    pipelines[os.path.split(folder)[1]] = (Augmentor.Pipeline(folder))
    print("\n----------------------------\n")

for p in pipelines.values():
    p.flip_left_right(probability=1)
    p.sample(len(p.augmentor_images))
    
for p in pipelines.values():
#    p.flip_left_right(probability=1)
    p.rotate(probability=1,max_left_rotation=15,max_right_rotation=15)
    p.sample(len(p.augmentor_images)*6)
    
for p in pipelines.values():
    p.flip_left_right(probability=1)
    p.rotate(probability=1,max_left_rotation=15,max_right_rotation=15)
    p.sample(len(p.augmentor_images)*6)
    print("Class %s has down processing %s samples." % (p.augmentor_images[0].class_label, len(p.augmentor_images)))

(edited by @Evizero for readability)

Randomizing the images used by the pipeline

Is there a way to randomize the images the pipeline processes? For me it always takes the images one by one in the listdir() list... Kind of a problem when I perform the same program several times.

Unable to save out images after augmenting

Hello,

I installed Augmentor successfully using pip. After that I was able to write a short script to augment images and ran it without any issue.

Today, when I tried to run the script on a different set of images, I got the error :
Error writing f8aee8ea-418c-4742-a977-5896b2b160d5, .
Change save_format to PNG?
You can change the save format using the set_save_format(save_format) function.

I tried using set_save_format("auto") and the problem still persists. Can you please help?

How to pass 2 images through the same pipeline (with same decisions)

I have paired images (grayscale and colored ones). How can I use Augmentator to feed pairs and get output pairs that have passed for the same augmentations decisions?

How to make this work with multiple masks?

For any given image, there may be multiple masks (for used in instance segmentation). How does the ground_truth function work with this?

augment only a single, or set of images

I was wondering if it is possible to use Augmentor to augment a single image or augment multiple images in exactly the same manner (e.g. multiple np.array as input, not filename)? if you have image + Masks, it may be necessary to augment masks the exact same way.

so something like:

i1 = cv2.imread (f1)
i2 = cv2.imread (f2)

x = []
x.append (i1)
x.append (i2)
print(np.array(x).shape)

rotate = ops.RotateRange(probability=0, max_left_rotation=50, max_right_rotation=10)

img_ro = np.array(rotate.perform_operation(Image.fromarray(x)))
for ii in img_ro:
  plt.imshow(ii) # each image in x is rotated in the same way
  plt.show()

How to crop a rotated image without losing content?

When using the rotation function the automatic crop removes some of the content of the original image.

Here is an example:

Is there a way to avoid this?

I looked at the source code and found no way to pass parameters that control the crop mode but maybe there is some other way?

Can I achieve this by defining a new operation? It feels like overkill to do so.

Pipelining multiple image with a probability of 1.0 to trigger each function

This is actually more a question then an issue.

I was wondering if their was an existing feature that grants us the ability to trigger each augmentor function on a set of images with a percentage of 100% in a way that every image in a dataset for exemple get generated for each available function of augmentor ... if not is their an easy way you could think of for doing this.

Use Augmentor to predict a image, but don't know how to pre-process test image

Hi,
Now, I am using Augmentor to get a generator and use Keras fit_generator function to train my model.
After I trained, I am facing a problem.
I don't know how Augmentor pre-processing the train set. But as a result it normalized the input image value between 0 and 1. So I don't know how to pre-process my test set so that it can be predict from the trained model.
I tried to devide my test set by 255.. but the result is wrong...
Can you give me any help? Or just show how you reappear your image processed by Augmentor. Then I will know how to go on.
Thank you !

Using Augmentor with TensorFlow

How can i use Augmentor on the fly with tensorflow? There's an example with Keras. Can you provide an example with tensor flow too?

Something wrong occured when applying function ' ground_truth() function'

When I was using your demo code to augment original and mask images identically, I found I got different number of augmented mask and images! I can't figure out why this happened. This meant the mask and image were NOT AUGMENTED IDENTICALLY, were they? Could you please help me to deal with this question, thanks.

How to pass an image path as parameter and only preprocess it?

It seems Augmentor needs a work directory as the parameter and it will preprocess all files in it.
However, we always need to preprocess a specific image and use the image path as the parameter.
So how to use Augmentor in this situation?
Thank you.

Images for README.md

Keras Generator with RGB images

Hello,
I'm trying to create a keras generator using the Augmentor library but I'm running into an issue with the Pil library. My training data is in a numpy array of form [num_samples, width, height, num_channels] where num_channels =3. I tried running the jupyter notebook Augmentor_Keras_Array_Data.ipynb using the mnist dataset and it worked fine, but with the 3 layer images once I try to visualize a sample using the X, y = next(generator) function I get the following errors:

File "C:/Users/amand/Documents/Masters Thesis/Cow Injury Project/cow_injury_CNN/augment_dataset.py", line 61, in
X, y = next(generator)

File "C:\Anaconda3\envs\ConvNeuralNet\lib\site-packages\Augmentor\Pipeline.py", line 489, in keras_generator_from_array
numpy_array = self._execute_with_array(np.reshape(images[random_image_index], (w, h, l)))

File "C:\Anaconda3\envs\ConvNeuralNet\lib\site-packages\Augmentor\Pipeline.py", line 226, in _execute_with_array
pil_image = [Image.fromarray(image)]

File "C:\Anaconda3\envs\ConvNeuralNet\lib\site-packages\PIL\Image.py", line 2426, in fromarray
raise TypeError("Cannot handle this data type")

TypeError: Cannot handle this data type

Is this function designed to work with RGB images?

Resolution for output images

Marcus, Thanks for sharing your nice augmentation tool. I really like it.

I am wondering how to keep output images to be the same resolution as input images after finishing augmentation. I do see that there is a loss of resolution for augmented images. Please let me know where I could make a change for the code in order to keep the same resolutions. Thank you for your advice!

Best regards,
Peter

How to preserve the name of the files?

When a file image110.png gets passed into the pipeline, out comes out something like parent_dir_png_original_.... where ... is a hash. Is there a way to know which image augmented image is related to the original image?

not an issue - potential break through in 3d Point cloud scanning inference

I watched this video this morning on 3d point cloud + ARKit
https://www.youtube.com/watch?v=kupq1C41XcU&feature=youtu.be
and it seems like a trained Augmentor model could help bridge the inference here in conjunction with trained model. Not sure if you agree, or if this is the correct repo for this - perhaps it is a new project.

I guess as a feature request / potential enhancement for Augmentor to solve / (unless you can think of something better or maybe it already does this)
we need a way to guess (train a model) the transformation necessary to go from one transformation to the other.
eg. given a view is transformed from
A -> B
// what was the transformation??? then from this glue / and some kalman filters - retrofit the point cloud.

Step 2 could be to isolate this to bounding box. just thinking out loud here.

this is the code from video above
https://github.com/johndpope/ARKitExperiments