Git Product home page Git Product logo

mangaocr's Introduction

MangaOCR

This project is currently still being improved to resolve outstanding problems.

Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with PaddePaddle framework and PaddleOCR library.

This MangaOCR is inspired by an old project called manga-ocr built by kha-white and other contributors. We admit that although kha-white's manga-ocr model has excellent performance, the size of the model is large (444Mb) and the recognition speed is a bit slow. Thus, this project is built with our goal to provide another end-to-end lightweight model for text recognition in manga. And to achieve performance equivalent to manga-ocr, we also aim to handle a variety of special text types in manga:

  • both vertical and horizontal text
  • text with furigana
  • text overlaid on images
  • wide variety of fonts and font styles
  • low quality images

Result

Up until now, our model has achieved the following:

  • Small model size with only about 8Mb
  • Recognize both vertical and horizontal text
  • Recognize text overlapping the image
  • Recognize with different fonts and font styles

And things that haven't been achieved yet:

  • Better recognition results when compared with manga-ocr. Currently, our model has a CER(Character Error Rate) result of about 14.4% ( 2-10% is good for printed text and 10-20% for handwritten text ).
  • Full recognition of unusually long text. With texts of unusually long length (usually over 100 characters), the model is able to predict the first 25-50% of characters, then errors gradually appear.

Examples

Here are some currently cherry-picked examples showing the capability of the model.

image Our MangaOCR result
軍のテストフライトセンター
つぎの日曜日は
成魚」はこれくらい
うわああああああああああああ
付き合い始めたきっかけだって
極東方面に所属不明機を多数確認!
永遠が見えるまで駆けぬけていきたい
極道やった経験が生かされてるやろ
まだダイエットあきらめてなかったわけ?

Acknowledgments

This project is currently working with the usage of:

mangaocr's People

Contributors

gnurt2041 avatar omnihs1 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

mangaocr's Issues

How To use it ?

Hi,

Thank you for the hard work you've put into this project. I appreciate the effort and dedication.

I have been trying to use and test the project, but I'm not sure how to get started. The README doesn't provide detailed instructions on how to use the model.

Could you please explain the steps to make it work?

Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.