Examples

This repository contains training scripts to train a text detector based on manga-image-translator which can extract bounding-boxes, text lines and segmentation of text from manga or comics to help further comics translation procedures such as text-removal, recognition, lettering, etc.

There are some awesome projects such as Lancet.

Download the text detection model from zyddnys/manga-image-translator or Google Drive.

Examples

^{(source: manga109, © Yoshi Masako)}

Training Details

Our current model can be summarized as below.

All models were trained on around 13 thousand anime & comic style images, 1/3 from Manga109-s, 1/3 from DCM, and 1/3 are synthetic data in a weak supervision manner due to the lack of available high-quality annotations.

We used text detection model of manga-image-translator to generate text lines annotations for manga, and Manga-Text-Segmentation with some post-processing to generate masks for both manga and comics. Synthetic data were generated using around 4k text-free anime-girls pictures from https://t.me/SugarPic, text-rendering, Unet and DBNet training scripts can be found in this repo. Text block detector was trained using yolov5 official repository

We would not (don't have the right) share training sets or fonts publicly. 2/3 of the training set is not so clean anyway, so the training is reproducible only if you have enough images and fonts. You can use the models this repo provided to generate labels for comics/manga, and the comic style text rendering script to generate synthetic data. Please refer to examples.ipynb for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
data		data
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
basemodel.py		basemodel.py
db_dataset.py		db_dataset.py
examples.ipynb		examples.ipynb
inference.py		inference.py
requirements.txt		requirements.txt
seg_dataset.py		seg_dataset.py
text_rendering.py		text_rendering.py
train_db.py		train_db.py
train_seg.py		train_seg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Examples

Training Details

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Examples

Training Details

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages