0% found this document useful (0 votes)

13 views

textrecognitiondatagenerator-readthedocs-io-en-latest

Uploaded by

Samuel Vangu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

textrecognitiondatagenerator-readthedocs-io-en-latest

Uploaded by

Samuel Vangu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

TextRecognitionDataGenerator

Documentation
Release latest

Edouard Belval

Aug 04, 2022

Contents

1 Installation 3
1.1 Official package . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 From source . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

2 Overview 5
2.1 Most useful arguments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2 Getting help . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

3 Tutorial 9
3.1 Just generating data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.2 Generating Chinese data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.3 Text distorsions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.4 A more advanced use case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.5 Manipulating margins . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

4 Module 15

5 Reference 17
5.1 DataGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.2 BackgroundGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.3 ComputerTextGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.4 DistorsionGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.5 HandwrittenTextGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.6 StringGenerator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

i
ii
TextRecognitionDataGenerator Documentation, Release latest

Since the name is quite long, all subsequent refrences will be under the acronym TRDG.
If you are new to the project, start with the tutorial section!

Contents 1
TextRecognitionDataGenerator Documentation, Release latest

2 Contents
CHAPTER 1

Installation

1.1 Official package

TRDG has a pip package with a matching name.

pip install trdg
Once that is installed, the trdg binary should be in your PATH.

1.2 From source

If you want to add a new language The easiest way to use the tool is by cloning the official repo.
git clone https://github.com/Belval/TextRecognitionDataGenerator
Then you need to install the dependencies. It is recommended to use a virtual environment for those.
pip3 install -r requirements.txt
If you want to use the handwritten text generation feature, you need to install the -hw dependencies.
pip3 install -r requirements-hw.txt
Once that is done, you can move to the tutorial for tips and tricks on how to use TRDG!

3
TextRecognitionDataGenerator Documentation, Release latest

4 Chapter 1. Installation
CHAPTER 2

Overview

2.1 Most useful arguments

1. -i, --input_file
Use it when the provided dictionaries do not fit your usecase. Each line will become an image, if your -c
parameter is high enough.
2. -c, --count
Self-explanatory parameter, but one you will probably want to change. The default value is 1000.
3. -l, --language
This argument is especially important if you want to generate data using a specific script. It changes the dic-
tionary to be used (-l fr is equivalent to -i dicts/fr.txt), but most importantly it changes the default
fonts to take one that supports the language’s script. Passing a chinese dictionary without changing the language
will cause invalid images to be generated.
4. -t, --thread_count
Another self-explanatory parameter, yet very important as most computers these days ship with a multi-core
CPU. Setting this to -t 8 makes TRDG create 8 processes to generate the data.
5. -f, --format
By default, all generated images will be 32 pixels high (or wide if you use -or 1). Now that might be too
small for you. -f allows you to make bigger images.

2.2 Getting help

As with most CLI tools, TRDG’s help is accessible through the -h argument.
If you need more information on a specific argument, find its definition in the reference. If even that does not do, feel
free to open an issue on the official repository.

5
TextRecognitionDataGenerator Documentation, Release latest

usage: trdg [-h] [--output_dir [OUTPUT_DIR]] [-i [INPUT_FILE]] [-l [LANGUAGE]]

-c [COUNT] [-rs] [-let] [-num] [-sym] [-w [LENGTH]] [-r]
[-f [FORMAT]] [-t [THREAD_COUNT]] [-e [EXTENSION]]
[-k [SKEW_ANGLE]] [-rk] [-wk] [-bl [BLUR]] [-rbl]
[-b [BACKGROUND]] [-hw] [-na NAME_FORMAT] [-d [DISTORSION]]
[-do [DISTORSION_ORIENTATION]] [-wd [WIDTH]] [-al [ALIGNMENT]]
[-or [ORIENTATION]] [-tc [TEXT_COLOR]] [-sw [SPACE_WIDTH]]
[-cs [CHARACTER_SPACING]] [-m [MARGINS]] [-fi] [-ft [FONT]]
[-ca [CASE]]

Generate synthetic text data for text recognition.

optional arguments:
-h, --help show this help message and exit
--output_dir [OUTPUT_DIR]
The output directory
-i [INPUT_FILE], --input_file [INPUT_FILE]
When set, this argument uses a specified text file as
source for the text
-l [LANGUAGE], --language [LANGUAGE]
The language to use, should be fr (French), en
(English), es (Spanish), de (German), or cn (Chinese).
-c [COUNT], --count [COUNT]
The number of images to be created.
-rs, --random_sequences
Use random sequences as the source text for the
generation. Set '-let','-num','-sym' to use
letters/numbers/symbols. If none specified, using all
three.
-let, --include_letters
Define if random sequences should contain letters.
Only works with -rs
-num, --include_numbers
Define if random sequences should contain numbers.
Only works with -rs
-sym, --include_symbols
Define if random sequences should contain symbols.
Only works with -rs
-w [LENGTH], --length [LENGTH]
Define how many words should be included in each
generated sample. If the text source is Wikipedia,
this is the MINIMUM length
-r, --random Define if the produced string will have variable word
count (with --length being the maximum)
-f [FORMAT], --format [FORMAT]
Define the height of the produced images if
horizontal, else the width
-t [THREAD_COUNT], --thread_count [THREAD_COUNT]
Define the number of thread to use for image
generation
-e [EXTENSION], --extension [EXTENSION]
Define the extension to save the image with
-k [SKEW_ANGLE], --skew_angle [SKEW_ANGLE]
Define skewing angle of the generated text. In
positive degrees
-rk, --random_skew When set, the skew angle will be randomized between
the value set with -k and it's opposite
(continues on next page)

6 Chapter 2. Overview
TextRecognitionDataGenerator Documentation, Release latest

(continued from previous page)

-wk, --use_wikipedia Use Wikipedia as the source text for the generation,
using this paremeter ignores -r, -n, -s
-bl [BLUR], --blur [BLUR]
Apply gaussian blur to the resulting sample. Should be
an integer defining the blur radius
-rbl, --random_blur When set, the blur radius will be randomized between 0
and -bl.
-b [BACKGROUND], --background [BACKGROUND]
Define what kind of background to use. 0: Gaussian
Noise, 1: Plain white, 2: Quasicrystal, 3: Pictures
-hw, --handwritten Define if the data will be "handwritten" by an RNN
-na NAME_FORMAT, --name_format NAME_FORMAT
Define how the produced files will be named. 0:
[TEXT]_[ID].[EXT], 1: [ID]_[TEXT].[EXT] 2: [ID].[EXT]
+ one file labels.txt containing id-to-label mappings
-d [DISTORSION], --distorsion [DISTORSION]
Define a distorsion applied to the resulting image. 0:
None (Default), 1: Sine wave, 2: Cosine wave, 3:
Random
-do [DISTORSION_ORIENTATION], --distorsion_orientation [DISTORSION_ORIENTATION]
Define the distorsion's orientation. Only used if -d
is specified. 0: Vertical (Up and down), 1: Horizontal
(Left and Right), 2: Both
-wd [WIDTH], --width [WIDTH]
Define the width of the resulting image. If not set it
will be the width of the text + 10. If the width of
the generated text is bigger that number will be used
-al [ALIGNMENT], --alignment [ALIGNMENT]
Define the alignment of the text in the image. Only
used if the width parameter is set. 0: left, 1:
center, 2: right
-or [ORIENTATION], --orientation [ORIENTATION]
Define the orientation of the text. 0: Horizontal, 1:
Vertical
-tc [TEXT_COLOR], --text_color [TEXT_COLOR]
Define the text's color, should be either a single hex
color or a range in the ?,? format.
-sw [SPACE_WIDTH], --space_width [SPACE_WIDTH]
Define the width of the spaces between words. 2.0
means twice the normal space width
-cs [CHARACTER_SPACING], --character_spacing [CHARACTER_SPACING]
Define the width of the spaces between characters. 2
means two pixels
-m [MARGINS], --margins [MARGINS]
Define the margins around the text when rendered. In
pixels
-fi, --fit Apply a tight crop around the rendered text
-ft [FONT], --font [FONT]
Define font to be used
-ca [CASE], --case [CASE]
Generate upper or lowercase only. arguments: upper or
lower. Example: --case upper

2.2. Getting help 7

TextRecognitionDataGenerator Documentation, Release latest

8 Chapter 2. Overview
CHAPTER 3

Tutorial

TextRecognitionDataGenerator comes with an (hopefully) easy to use CLI. The tutorial is actually multiple tutorials,
combined in a single page. Feel free to skip sections that are not relevant to your use case.

3.1 Just generating data

Fun fact, you don’t need to use any command line arguments if you want English data generated using multiple fonts.
Indeed, simply running python3 run.py will create 1000 English, single word images in the out/ directory such
as these:

Now maybe 1000 is too many or too few for your usecase. You can add the -c argument to set how many examples
will be generated.
python3 run.py -c 10
As expected, you will find 10 examples in the out/ directory.

3.2 Generating Chinese data

This is a common usecase, and one that is easy with TRDG.

9
TextRecognitionDataGenerator Documentation, Release latest

python3 run.py -c 10 -l cn
This will generate 10 samples using the Chinese dictionary that can be found in in dicts/cn.txt:

Since the concept of word in Chinese is a bit trickier, the dictionary is made of single characters (make your own!).
Let’s do this again with -w 5 to get something prettier.
python3 run.py -c 10 -l cn -w 5

Now that looks better, but what’s up with the spacing between the characters? We would rather have no spaces. Add
-sw 0.
python3 run.py -c 10 -l cn -w 5 -sw 0

Asian scripts can be written top to bottom, you might want to add the -or 1 argument to get vertical text.
python3 run.py -c 10 -l cn -w 5 -sw 0 -or 1

10 Chapter 3. Tutorial
TextRecognitionDataGenerator Documentation, Release latest

You can do much and more with TRDG, if you run into a missing feature, simply open an issue.

3.3 Text distorsions

For those familiar with the process of training a machine learning model, you often have to deal with overfitting,
which is when the model gets too good at predicting the samples in the training data and stops generalizing to unseen
examples. One trick to prevent this is by adding the distorsion to the data.
While TRDG does not dwelve too deeply in augmentations, as many better and more complete libraries already take
care of it, some operations are available for convenience through the -d argument which as 3 possible values:
• 0: None
• 1: Sine wave
• 2: Cosine wave
• 3: Random
python3 run.py -c 5 -w 5 -d 1

python3 run.py -c 5 -w 5 -d 3

3.3. Text distorsions 11

TextRecognitionDataGenerator Documentation, Release latest

3.4 A more advanced use case

Text in the real world is not always black, and most importantly, text in the real world is almost never straight. What
if we want to emulate that?
python3 run.py -c 10 -k 15 -rk -bl 0.5 -rbl -tc '#000000,#888888'
Which can be translated to: generate 10 examples with a skewing angle between -15 and 15 with an added gaussian
blur between 0 and 0.1. Finally, the text color should be picked randomly between black and gray (including all the
colors inbetween).
Sure enough, the output is much more colourful!

The default resolution might be too small to your taste (and I agree). By default the output is 32 pixels high because
it’s the height used by most text recognition papers. Now you can change that with -f 64.
python3 run.py -c 10 -k 15 -rk -bl 0.5 -rbl -tc '#000000,#888888' -f 64

12 Chapter 3. Tutorial
TextRecognitionDataGenerator Documentation, Release latest

3.5 Manipulating margins

TRDG allows you to control margins around the text using two parameters, --margins, --fit. The first one
controls margins, in pretty much the same way the CSS property margin does.
This is the result with no fit and the default (5, 5, 5, 5) margins: python3 run.py -c 1 -i texts/test.
txt

Now we can add --fit to apply a tight crop around the rendered text. This changes the size by removing the added
space for accents: python3 run.py -c 1 -i texts/test.txt --fit

Margins are applied the generated text, so even with 0,0,0,0, if you don’t use --fit you will get an apparence of
margins: python3 run.py -c 1 -i texts/test.txt --margins 0,0,0,0

Now if you add --fit, you get an absolutely no margins: python3 run.py -c 1 -i texts/test.txt
--margins 0,0,0,0 --fit

Margin values are comma separated top,left,bottom,right, so --margins 10,0,10,0 will return verti-
cal margins with tight cropping vertically.

And finally, with all margins: python3 run.py -c 1 -i texts/test.txt --margins 10,10,10,
10 --fit

3.5. Manipulating margins 13

TextRecognitionDataGenerator Documentation, Release latest

14 Chapter 3. Tutorial
CHAPTER 4

Module

TRDG is also a module that can be included in your favorite training pipeline. The easiest way to use it, is to import a
generator.

from trdg.generators import GeneratorFromStrings

generator = GeneratorFromStrings(['Test1', 'Test2', 'Test3'])

for img in generator:

# Do something with the pillow image here.

The basic one is GeneratorFromStrings which, as its name indicates, will take a list of strings, and generate an
image and label pair.
If you want to avoid having to maintain dictionaries, you can use GeneratorFromDicts which will use the bun-
dled ones, GeneratorFromRandom which generates random strings, and GeneratorFromWikipedia which
picks random article from Wikipedia as its source for strings.
Here are examples for each of those, respectively:

from trdg.generators import (

GeneratorFromDicts,
GeneratorFromRandom,
GeneratorFromWikipedia,
)

generator_from_dicts = GeneratorFromDicts()
generator_from_random = GeneratorFromRandom()
generator_from_wikipedia = GeneratorFromWikipedia()

for img, lbl in generator_from_dicts:

# Do something with the pillow image here.

The generators will not raise StopIteration, they will keep generating images until you break out of the loop.
Set a non-negative value for count if that’s an issue

15
TextRecognitionDataGenerator Documentation, Release latest

16 Chapter 4. Module
CHAPTER 5

Reference

Coming soon

5.1 DataGenerator

5.2 BackgroundGenerator

5.3 ComputerTextGenerator

5.4 DistorsionGenerator

5.5 HandwrittenTextGenerator

5.6 StringGenerator

Intrusion Detection Honeypots
From Everand
Intrusion Detection Honeypots
Chris Sanders
3/5 (2)
THE LTSPICE XVII SIMULATOR: Commands and Applications
From Everand
THE LTSPICE XVII SIMULATOR: Commands and Applications
Gilles Brocard
5/5 (1)
Bit by Bit: Social Research in the Digital Age
From Everand
Bit by Bit: Social Research in the Digital Age
Matthew J. Salganik
4/5 (1)
Biochemistry - Nucleic Acid Lesson Plan
100% (3)
Biochemistry - Nucleic Acid Lesson Plan
19 pages
Programming FPGAs: Getting Started with Verilog
From Everand
Programming FPGAs: Getting Started with Verilog
Simon Monk
3.5/5 (2)
Complete Audio Mastering: Practical Techniques
From Everand
Complete Audio Mastering: Practical Techniques
Gebre E. Waddell
5/5 (5)
Audio, Video, and Media in the Ministry
From Everand
Audio, Video, and Media in the Ministry
Clarence Floyd Richmond
No ratings yet
Code View
50% (2)
Code View
624 pages
DSSAT Guide Module
100% (3)
DSSAT Guide Module
34 pages
Setting Calculation
100% (4)
Setting Calculation
33 pages
Programming the Photon: Getting Started with the Internet of Things
From Everand
Programming the Photon: Getting Started with the Internet of Things
Christopher Rush
5/5 (1)
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Basic Research and Technologies for Two-Stage-to-Orbit Vehicles: Final Report of the Collaborative Research Centres 253, 255 and 259
From Everand
Basic Research and Technologies for Two-Stage-to-Orbit Vehicles: Final Report of the Collaborative Research Centres 253, 255 and 259
Dieter Jacob
No ratings yet
Building with Virtual LEGO: Getting Started with LEGO Digital Designer, LDraw, and Mecabricks
From Everand
Building with Virtual LEGO: Getting Started with LEGO Digital Designer, LDraw, and Mecabricks
John Baichtal
No ratings yet
Development Research in Practice: The DIME Analytics Data Handbook
From Everand
Development Research in Practice: The DIME Analytics Data Handbook
Kristoffer Bjärkefur
No ratings yet
Pollution Prevention: Methodology, Technologies and Practices
From Everand
Pollution Prevention: Methodology, Technologies and Practices
Kenneth L. Mulholland
No ratings yet
Teardowns: Learn How Electronics Work by Taking Them Apart
From Everand
Teardowns: Learn How Electronics Work by Taking Them Apart
Bryan Bergeron
No ratings yet
3D Printer Projects for Makerspaces
From Everand
3D Printer Projects for Makerspaces
Lydia Sloan Cline
4/5 (1)
Content Creation Revolution with chatGPT
From Everand
Content Creation Revolution with chatGPT
Maria Cowen
No ratings yet
The TAB Book of Arduino Projects: 36 Things to Make with Shields and Proto Shields
From Everand
The TAB Book of Arduino Projects: 36 Things to Make with Shields and Proto Shields
Simon Monk
5/5 (7)
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
From Everand
The Linux Terminal for Advanced Users - The Command Line Made Easy: First Edition
Michael Basler
No ratings yet
Programming Arduino Next Steps: Going Further with Sketches, Second Edition
From Everand
Programming Arduino Next Steps: Going Further with Sketches, Second Edition
Simon Monk
3/5 (3)
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
From Everand
Securing ChatGPT: Best Practices for Protecting Sensitive Data in AI Language Models
Matthew C. Smith
No ratings yet
Open Data Structures: An Introduction
From Everand
Open Data Structures: An Introduction
Pat Morin
4/5 (4)
Software Patterns Made Easy
From Everand
Software Patterns Made Easy
Justice Nanhou
No ratings yet
Programming the BBC micro:bit: Getting Started with MicroPython
From Everand
Programming the BBC micro:bit: Getting Started with MicroPython
Simon Monk
No ratings yet
GRE 5-Hour Quick Prep For Dummies
From Everand
GRE 5-Hour Quick Prep For Dummies
Ron Woldoff
No ratings yet
ChatGPT for Business: Strategies for Success
From Everand
ChatGPT for Business: Strategies for Success
Matthew C. Smith
1/5 (1)
Windows 11 All-in-One For Dummies, 2nd Edition
From Everand
Windows 11 All-in-One For Dummies, 2nd Edition
Ciprian Adrian Rusen
No ratings yet
Programming Arduino Next Steps: Going Further with Sketches
From Everand
Programming Arduino Next Steps: Going Further with Sketches
Simon Monk
3/5 (3)
Structural Dynamics in Industry
From Everand
Structural Dynamics in Industry
Alain Girard
No ratings yet
Price formation in the cryptocurrency market. A hypotheses driven econometric analysis of cryptocurrency price determinants
From Everand
Price formation in the cryptocurrency market. A hypotheses driven econometric analysis of cryptocurrency price determinants
Lukas M. König
No ratings yet
GED Test 5-Hour Quick Prep For Dummies
From Everand
GED Test 5-Hour Quick Prep For Dummies
Tim Collins
No ratings yet
Schaum's Outline of Programming with C++
From Everand
Schaum's Outline of Programming with C++
John R. Hubbard
No ratings yet
ai-image-generator
No ratings yet
ai-image-generator
37 pages
Control Systems
From Everand
Control Systems
Francisco Luis Pagola y de las Heras
No ratings yet
Facility Management: Business Process Integration
From Everand
Facility Management: Business Process Integration
Alexander Redlein
No ratings yet
Benefits of semantic data models. A study in the European goods transport industry
From Everand
Benefits of semantic data models. A study in the European goods transport industry
Andreas A. Pelekies
No ratings yet
Long Teeth: Stone Angel #4
From Everand
Long Teeth: Stone Angel #4
Marvin H. Albert
No ratings yet
The Satisfiability Problem: Algorithms and Analyses
From Everand
The Satisfiability Problem: Algorithms and Analyses
Uwe Schöning
No ratings yet
Chapter 1. An Introduction To Generative Media: A Note For Early Release Readers
No ratings yet
Chapter 1. An Introduction To Generative Media: A Note For Early Release Readers
17 pages
BTP Report
No ratings yet
BTP Report
27 pages
Image Captioning Using CNN and LSTM
No ratings yet
Image Captioning Using CNN and LSTM
9 pages
Extracting Text From Images With LangChain _ by Reflections on AI _ Nov, 2024 _ Python in Plain English
No ratings yet
Extracting Text From Images With LangChain _ by Reflections on AI _ Nov, 2024 _ Python in Plain English
22 pages
PagedOut 004 Beta1
No ratings yet
PagedOut 004 Beta1
68 pages
Towards Abstractive Captioning of Infographics
No ratings yet
Towards Abstractive Captioning of Infographics
94 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
unit 5 dl
No ratings yet
unit 5 dl
26 pages
ML Unit V
No ratings yet
ML Unit V
46 pages
Computers 2024 25
No ratings yet
Computers 2024 25
31 pages
GANS AND TENSORFLOW FOR DEV (Z-Library)
No ratings yet
GANS AND TENSORFLOW FOR DEV (Z-Library)
95 pages
Chapter2 Limitations of RNN
No ratings yet
Chapter2 Limitations of RNN
29 pages
Ocr Gtts
No ratings yet
Ocr Gtts
49 pages
CS5984 Final Report
No ratings yet
CS5984 Final Report
57 pages
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
No ratings yet
AI Trends of May 2023 You Need To Know by Gonzalo Recio Medium
1 page
Image To TXT Original Final
No ratings yet
Image To TXT Original Final
32 pages
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
No ratings yet
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
35 pages
Ocr Nanonets Tesseract
No ratings yet
Ocr Nanonets Tesseract
39 pages
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
No ratings yet
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
275 pages
The Game of Logic by Lewis Carrol
No ratings yet
The Game of Logic by Lewis Carrol
122 pages
Essay
No ratings yet
Essay
2 pages
The Dilemma of Home-Based Learning and Teaching PD
No ratings yet
The Dilemma of Home-Based Learning and Teaching PD
6 pages
CQRS: Event Processing To Query-Databases
No ratings yet
CQRS: Event Processing To Query-Databases
47 pages
NEET Notes
No ratings yet
NEET Notes
10 pages
BIPSE
No ratings yet
BIPSE
5 pages
Nursing Care Plan Ineffective Peripheral Tissue Perfusion
90% (10)
Nursing Care Plan Ineffective Peripheral Tissue Perfusion
3 pages
ACCA Paper F2 ACCA Paper F2 Management Accounting: Saa Global Education Centre Pte LTD
No ratings yet
ACCA Paper F2 ACCA Paper F2 Management Accounting: Saa Global Education Centre Pte LTD
17 pages
Continuous Miners
No ratings yet
Continuous Miners
8 pages
1st List of Not Eligible Students Ehsaas Scholarship Phase II For Website Iub
No ratings yet
1st List of Not Eligible Students Ehsaas Scholarship Phase II For Website Iub
44 pages
Summer Degree Program Lecture Agreement
No ratings yet
Summer Degree Program Lecture Agreement
26 pages
Perceived Advantage of Social Networking Sites in Selected Restaurants in Lucena City Chapter 3
0% (1)
Perceived Advantage of Social Networking Sites in Selected Restaurants in Lucena City Chapter 3
5 pages
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
100% (2)
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
1,447 pages
Assembly Line Balancing
No ratings yet
Assembly Line Balancing
13 pages
University of Illinois at Chicago Actg / Ids 475 - Database Accounting Systems Course Syllabus Fall Semester 2018 Instructor
No ratings yet
University of Illinois at Chicago Actg / Ids 475 - Database Accounting Systems Course Syllabus Fall Semester 2018 Instructor
5 pages
Med Admin Practice Questions
No ratings yet
Med Admin Practice Questions
5 pages
SHDH3110 L1
No ratings yet
SHDH3110 L1
13 pages
ModeMachines x0xb0x Socksbox TB-303 Clone Manual (English)
No ratings yet
ModeMachines x0xb0x Socksbox TB-303 Clone Manual (English)
19 pages
70 Important Question CS
No ratings yet
70 Important Question CS
29 pages
A1 - Basics of Designing
No ratings yet
A1 - Basics of Designing
14 pages
Cade, 0 0
No ratings yet
Cade, 0 0
1 page
FM Lecture Notes
No ratings yet
FM Lecture Notes
118 pages
Audit Dashboard
No ratings yet
Audit Dashboard
4 pages
Scisor Jack Presentation
100% (2)
Scisor Jack Presentation
21 pages
Ibm
No ratings yet
Ibm
2 pages
Vol10 Tab02
No ratings yet
Vol10 Tab02
87 pages

textrecognitiondatagenerator-readthedocs-io-en-latest

Uploaded by

textrecognitiondatagenerator-readthedocs-io-en-latest

Uploaded by

TextRecognitionDataGenerator

Aug 04, 2022

1.1 Official package

TRDG has a pip package with a matching name.

1.2 From source

2.1 Most useful arguments

2.2 Getting help

usage: trdg [-h] [--output_dir [OUTPUT_DIR]] [-i [INPUT_FILE]] [-l [LANGUAGE]]

Generate synthetic text data for text recognition.

(continued from previous page)

2.2. Getting help 7

3.1 Just generating data

3.2 Generating Chinese data

This is a common usecase, and one that is easy with TRDG.

3.3 Text distorsions

3.3. Text distorsions 11

3.4 A more advanced use case

3.5 Manipulating margins

3.5. Manipulating margins 13

from trdg.generators import GeneratorFromStrings

generator = GeneratorFromStrings(['Test1', 'Test2', 'Test3'])

for img in generator:

from trdg.generators import (

for img, lbl in generator_from_dicts:

You might also like