Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
machine-learning
python
data-science
ner
synthetic-data
synthetic-data-generation
data-generation
ocr-recognition
synthetic-images
text-alignment
Обновлено 2023-07-20 18:03:32 +03:00
Synthetic Dataset Insights
Обновлено 2022-09-23 22:35:49 +03:00
Main purpose of this repo is to generate fake data to support demos, tests and sandboxes. This repo contains open source code in python, designed to work inside a Synapse workspace. It is built inside a Synapse notebook which fills all the tables of any industry database model.
Обновлено 2022-06-12 22:11:05 +03:00