I use train_ssd7 to train 67 pictures of udacity dataset provided by you, but it seems

train_ssd7 can not converge with small dataset about ssd_keras HOT 4 CLOSED

pierluigiferrari commented on May 24, 2024

train_ssd7 can not converge with small dataset

from ssd_keras.

Comments (4)

mikeszabi commented on May 24, 2024

I am trying to use ssd_7 to detect power pylons - which have pretty large aspect ratios (height/width). Small datasets (~200 images) would not converge for me too. I wonder it is because of the unusual aspect ratios or because fo the lack of enough training images...
My aspect ratios are:
aspect_ratios = [2.5, 3.5, 4.5, 5.5]

from ssd_keras.

luckyuho commented on May 24, 2024

hello, pierluigiferrari
I have tried about 2000 images for ssd7, however it still can work but with bad prediction.
I still can not figure why we need so lots data for predicting our training data, so I assume maybe it is caused by the difficult environment and variant objects.
Therefore, I build a environment myself in unity like this.

I want to set all same types of cars and people, the only different things would be color and scale
What I try to do is recognize road, people and car like below

I want to try an easy dataset first to learn how the network works, but I do not understand the number of data is good to work, all I know is the more the better ==".
so how many data do you think for ssd7 to get work in this environment?
Thank you!!^^

from ssd_keras.

pierluigiferrari commented on May 24, 2024

I've just tried to overfit a small fraction of the Udacity dataset (120 images), and it overfits quite nicely:

I cannot reproduce the issue you're having. That being said, I did make a change to the loss function in a recent update just now. Maybe pull the latest commit and try again with the updated version.

You can also use the new random_sample option of BatchGenerator's parse_csv(). It allows you to randomly sample a fraction of a dataset. In my overfitting experiment I used 0.005 of the original Udacity dataset.

@mikeszabi the small number of training images is not the cause if it doesn't converge. Fewer images make the weights converge faster (and overfit of course). Be careful with the aspect ratios though: Using large aspect ratios without adjusting the scaling factors at the same time could lead to anchor boxes that are larger in one spatial dimension than the receptive field of their predictor layer. If you wanted to do this in a systematic way you would really have to compute what your aspect ratios actually mean in terms of the sizes of the resulting anchor boxes in pixels and compare that to the receptive fields of the respective predictor layers. I know it sounds tedious to do that, but training an SSD is non-trivial in this respect. I recommend reading and understanding the first part of the code of SSDBoxEncoder's generate_anchor_boxes() method. The widths and heights of the anchor boxes are computed as follows:

w = scale * size * sqrt(aspect_ratio)
h = scale * size / sqrt(aspect_ratio)

You might have to decrease the scaling factors in order to get anchor box sizes that make sense for the respective predictor layers. I believe it is likely that decreasing the scaling factors would help.

from ssd_keras.

luckyuho commented on May 24, 2024

Oh, it's my fault!
and random sampling is really important when we only has small dataset!!(my little experiment)
really thanks for your help, pierluigiferrari!!!

from ssd_keras.

Recommend Projects

train_ssd7 can not converge with small dataset about ssd_keras HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent