Git Product home page Git Product logo

Comments (16)

argman avatar argman commented on July 24, 2024

I have tried pva, it can get comparable result with resnet(need pretrain on imagenet).
IOU is just part of the loss, you should also see the regression part, during training you should set a balance parameter between the two losses. I do not recommend rotation augmentation as it may cause some un-agreement of text orientation and the start point of text polygon.

from east.

blankWorld avatar blankWorld commented on July 24, 2024

thanks! so you just use random scaling to augment data as training with res_net-50?
In my training I use Iou loss and cosine loss to regress rbox and class balancing softmax loss to predict the mask of shrinked text polygon, what is the regression part? whether you mean that in pva training you used other rbox regression loss besides Iou and cosine loss?

from east.

argman avatar argman commented on July 24, 2024

Do you use imagenet pretrained models?

from east.

blankWorld avatar blankWorld commented on July 24, 2024

yes I do

from east.

argman avatar argman commented on July 24, 2024

do you have a code repo? maybe some bugs..

from east.

argman avatar argman commented on July 24, 2024

you can check this code first, its easy to run

from east.

blankWorld avatar blankWorld commented on July 24, 2024

OK I go to check.. 5555 so sad

from east.

blankWorld avatar blankWorld commented on July 24, 2024

sorry to bother you again, will dice loss influence much in pva training? I implemented entirely base upon paper's method

from east.

argman avatar argman commented on July 24, 2024

In my experiment, dice loss perform much better..

from east.

zxDeepDiver avatar zxDeepDiver commented on July 24, 2024

@argma Hi,what do you mean by

some un-agreement of text orientation and the start point of text polygon.

As far as I know, the rotation augmentation is used in many recent papers on text detection. Is there any difference between EAST and the other algorithms on dealing with the rotation augmentation?

from east.

argman avatar argman commented on July 24, 2024

@zxDeepDiver , in east, there is upper, right, down and left side of a polygon, when rotated, this order can change, i think other methods encounter this problem too.

from east.

blankWorld avatar blankWorld commented on July 24, 2024

hi! argman, now my iou average training loss over one mini-batch can reduce to 0.18( the loss I used is 1.0 - Iou, 0,18 means that training Iou is 1 - 0.18) and my validation average Iou loss over all test sets which has ignored 'do not care' region reduces to 0.44.
could you share your training and validation loss in the last few iterations?

from east.

argman avatar argman commented on July 24, 2024

I do not test that, why do you test the final output ?

from east.

rmmal avatar rmmal commented on July 24, 2024

@argman do you have a tensorflow slim code for pvanet ?

from east.

bharatsubedi avatar bharatsubedi commented on July 24, 2024

do you have a tensorflow slim implementation code for pvanet?

from east.

hvags avatar hvags commented on July 24, 2024

I have tried pva, it can get comparable result with resnet(need pretrain on imagenet).

Could you please advice on how you did the pre training? More specifically:

  • How to isolate the backbone and train only this
  • Any suggested subset of Imagenet for training

Thanks.

from east.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.