Git Product home page Git Product logo

amharic_ocr's Introduction

Detection and Recognition of Amharic Scene Text using MMOCR toolboxs

MMOCR is an open-source toolbox based and for details of installation and relate information see (https://github.com/open-mmlab/mmocr.

Geʽez/Abugida/Ethiopic script has up 519 characters. For Amharic, we use 289-319 characters depending on whether we use Ethiopic numerals and punctuation. There are no capital or small letters.

Amharic Text Detection dataset preprocessing

We have two datasets for the detection task. HUST-ART is the real word dataset, and HUST-AST is the synthetic dataset. HUST-ART consists of 1500 training images and 700 test images. HUST-AST comprises 75,904 training images. To convert the datasets labels to MMOCR format, use tools/data/textdet/icdar_converter.py as follows

python tools/data/textdet/icdar_converter.py det_datasets/HUST-ART -o det_datasets/HUST-ART -d icdar2015 --split-list training test

Amharic Text Recognition

We have two training sets and two test sets datasets. Tana (TN) and Waliya (WL) training set consist of 2.85 and 4M cropped words, respectively. HUST-ART and ABE test sets consist of 4039 and 5218 text images. We also have a validation dataset consisting of 14835 text images, which is the training part of HUST-ART and ABE. All five datasets are in LMDB format.

MMOCR usage

  1. In the directory configs/base/recog_pipelines/, you have different pipelines you must change dict(type='LoadImageFromFile') to dict(type='LoadImageFromLMDB'),
  2. In the directory configs/base/recog_datasets/, you need to modify the path of test and train datasets.
  3. In the directory mmocr/models/textrecog/convertors/ base.py define the dictionary using the 314 Amharic characters. No need to worry we have modified it. Based on your character set, modify dict_type in all other related files. We have modified the configs/textrecog/satrn/satrn_small.py settings. You can use it as an example.
    The datasets for both detection and recognition can be downloaded from the website https://dk-liang.github.io/HUST-ASTD/.

The Waliya related LMDB dataset link will be provided very soon.

  1. Test and Validation sets LMDB https://mega.nz/folder/Ub0SnBBa#Fh6pFqbvXVxsa7OJPfJEwA
  2. Tana(TN) LMDB https://mega.nz/folder/NGcC1DaQ#soagog8p_LgOnm6Gx9wdCQ

Citation

If you find our datasets are useful in your research, please consider cite:

@article{dikubab2022comprehensive,
  title={Comprehensive benchmark datasets for Amharic scene text detection and recognition},
  author={Dikubab, Wondimu and Liang, Dingkang and Liao, Minghui and Bai, Xiang},
  journal={Science China Information Sciences, Vol. 65, Special Focus on Deep Learning for Computer Vision, Article number: 160106},
  year={2022}
}

License

This project is released under the Apache 2.0 license.

amharic_ocr's People

Contributors

dikubab avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.