This repository is to generate synthetic data for scene text detection and recognition from open source tools.
python synthetic_generator.py --data ./synthetic --image ./synthetic/images/ --label ./synthetic/labels/ --word ./synthetic/words/ --patch ./synthetic/patches --num 10
--data: Directory to store downloaded data (fonts and background images).
--image: Directory to store synthesized images.
--label: Directory to store synthesized polygon word labels for each synthesized image, line by line.
--word: Directory to store word text labels for each synthesized image, line by line.
--patch: Directory to store word patches with word text as its image file name.
--num: Number of synthetic images to be generated.
*Other parameters e.g., font size, word line rotation angle,...etc can be tuned inside main generator function.