Git Product home page Git Product logo

Comments (13)

MichalBusta avatar MichalBusta commented on August 28, 2024 1

from e2e-mlt.

MichalBusta avatar MichalBusta commented on August 28, 2024 1

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

@MichalBusta

from e2e-mlt.

MichalBusta avatar MichalBusta commented on August 28, 2024

print('bad image')

you can add:
import sys, traceback
traceback.print_exc(file=sys.stdout)
after 'except:'

to see what is going on

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

Thank you so much for your reply,it seems that in ocr_test_utils.py
1
this fuction needs four outputs but just unpack three,when i add one output
det_text, conf, dec_s, _=print_seq_ext()
the problem solves!
thanks you ~

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

besides, i have one question to ask for your help~
can i just use the ocr model to train a seperate word recognition model(just to achieve recognition task).if i can do like this,how large dataset should i have?(i just want to test on icdar2015 word recognition dataset )
thank you ! @MichalBusta

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

Could you please give me some advice?Thanks a lot~ @MichalBusta

from e2e-mlt.

MichalBusta avatar MichalBusta commented on August 28, 2024

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

hello,when i use the dataset from IC15、IC17MLT&IC19MLT(only use the latin words,about 70000word images),and only run the train_ocr.py,but the accuracy on ic15 test word dataset can just achieve 56.3% accuracy ,the batchsize i used equals to 4,could you please give me some advice why the accuracy is so low,and what should i do to improve it ?
4

Thanks a lot~
@MichalBusta

from e2e-mlt.

MichalBusta avatar MichalBusta commented on August 28, 2024

No easy answer sorry :)

from e2e-mlt.

coorful avatar coorful commented on August 28, 2024

ok,i will try.thanks so much for your reply~

from e2e-mlt.

alwc avatar alwc commented on August 28, 2024

@MichalBusta I also have some questions regarding your text recognition model.

1/ For latin languages, did you train your text recognition models with single word images only (i.e. no text lines)?

2/ How many images did you train your text recognition model with?

3/ Your text recognition model seems to use a ResNet like structure. Since this project focus on real-time, have you tried to train your text recognition model with a MobileNet backbone?

from e2e-mlt.

alwc avatar alwc commented on August 28, 2024

Thanks @MichalBusta ! One more thing, for the 500k images, it consists of an equal share between Arabic, Bangla, Chinese, Japanese, Korean and Latin (i.e. ~80k images for each script)?

from e2e-mlt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.