GitHub - Mayonnaisu/manga-image-translator: Manga Image Translator simplified, kinda 😅

📂 Directory

NOTICE
ABOUT
DOWNLOAD
INSTALLATION
CONFIGURATION
- Required
- Optional
USAGE (CPU MODE)
USAGE (GPU MODE)
UPDATE
EXTRA INFO
- How to Get Gemini API Key
- Webtoon Mode

NOTICE

Some things have been changed & fixed, so it's recommended to update to newer components if you have installed it before. See the UPDATE section for more info.

Since the guide has become too long and complex than originally intended, I decided to simplify it. You can see the more detailed version of this guide here.

ABOUT

This fork doesn't change the core functions of the original program. This is still Manga Image Translator, but with some minor tweaks & extra components to make it easier and more convenient to set up and use.

Note

For more info about the main usage of MIT, always refer to https://github.com/zyddnys/manga-image-translator.

The Changes:

Add installer
Add updater
Add launchers
Add .env file
Add PyTorch checker
Add folder selection feature
Add XPU (Intel GPU) support (untested)
Improve handling of webtoon format (🛠️working but need improvement)
Sort input folders in natural order
Use recommended configurations by default
Disable some functions in order to bypass errors
Clean up result folder except for log file by default

Important

The installer only supports Windows 10 & 11.

DOWNLOAD

Click on the green button on the top.
Select "Download ZIP".
Right click on the downloaded .zip file.
Select "Extract Here" with WinRAR or 7-Zip.

INSTALLATION

Open PowerShell as Administrator.
Change PowerShell execution policy by entering the command below:

Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope LocalMachine

Enter y or yes.
Close the PowerShell.
Right click on MIT-installer.ps1.
Select "Run with PowerShell".
Select "Yes" if UAC prompt pops up.
Wait until you get ${{\color{lightgreen}{\textsf{INSTALLATION COMPLETED!}}}}$ message.

Tip

If you get a warning when opening the installer, uncheck the option, then Open. If you don't do this, the script won't be able to run properly.

View image

CONFIGURATION

Required

Open .env file with text/code editor (Notepad, VS Code, etc).
Paste your Gemini API key between the quotation marks.
Save.

Optional

Open my-config.json and gpt_config-example.yaml in examples folder & settings.json in my_tools folder with text/code editor.
Change the settings as you see fit.
Save.

USAGE (CPU MODE)

Note

All local modes support batch translation.
The first time you run the program, it will automatically download the selected detection, OCR, & inpainting models. After that, it won't need to do it again, unless you have changed the relevant configurations in my-config.json.

Local Mode

Right click on MIT-local-launcher.ps1.
Select "Run with PowerShell".
Select a folder containing your manga/hwa/hua.

Local Webtoon Mode

Warning

This launcher has a really high RAM usage!

Right click on MIT-local-webtoon-launcher.ps1.
Select "Run with PowerShell".
Select a folder containing your manga/hwa/hua.

Web Mode

Right click on MIT-web-launcher.ps1.
Select "Run with PowerShell".
Visit http://127.0.0.1:8000 (default).
Press Q to stop the server.

Real-Time Translation

There are a bunch of available browser extensions out there, but I will use ComicReadScript here. In general, they have similar configurations.
- Install Tampermonkey from https://www.tampermonkey.net
  
  Next, if you use Chromium-based browser, then you need to follow the steps here: https://www.tampermonkey.net/faq.php#Q209.
- Install ComicRead from https://sleazyfork.org/en/scripts/374903-comicread
- Visit RAW manga/hwa/hua website.
- Select a chapter from any available series.
- Click on Extensions menu bar > Tampermonkey.
- Select "Enter simple reading mode"
- Hover your mouse over the left side of the page.
- Click on "Scroll mode" button.
- Hover over the left > "Settings":
  
  View config
- Click on "Settings" button to close.
- Click on "Translate current page" or "Translate current page to the end".

Tip

If your PC is slow, it won't actually be real time. So, consider using GPU if you have a powerful one.
Check the PowerShell to see more detailed progress of the translation process.

USAGE (GPU MODE)

Go to here.

UPDATE

Warning

This updater will replace the old files with the newer ones, so make sure to back up the files you want to keep first. For more info, see here.

Download MIT-updater.ps1.

only if there is no ### VERSION ### in your MIT-updater.ps1.
Move it to your manga-image-translator-main folder.
Right click on it > Run with PowerShell.
Wait until you get ${{\color{lightgreen}{\textsf{UPDATE COMPLETED!}}}}$ message.

EXTRA INFO

How to Get Gemini API Key

Visit https://aistudio.google.com/app/apikey.
Accept the Terms and Conditions.
Click "Create API key".
Name your key.
Choose project > Create project.
Select the newly created project.
Click "Create key".
Click the code in the "Key" column.
Click "Copy key".

Tip

Gemini API Free Tier has rate limits, see: https://ai.google.dev/gemini-api/docs/rate-limits#current-rate-limits.

To check your quota:

Visit https://aistudio.google.com/app/usage
Make sure you are on the right account & project.
Click "Open in Cloud Console" on the bottom.
Scroll down > Click "Quotas & System Limits".
Scroll down > You will see your model quota usage on the top result. If you don't see it, use Filter to search it.

For example:

View image

Webtoon Mode

Warning

This mode will attempt to merge all images in each chapter folder into one really long image respectively first. MIT then will have to load and process the long-ass images for translation, which inevitably causes it to consume a lot more RAM and time than regular mode. Last but not least, it will merge the translated images into one* before splitting all translated images back into the same number of parts as the original images in each folder (the height and the split position won't be identical tho). For more info, see here.

_{*if the specified merged image number is greater than 1.}

Pros

Better translation result because the translator will get all texts from one chapter at once, so it will have more contexts than when it receives the texts from only one page at a time.
Better OCR result in a way as there is no potentially split speech bubble resulting in incomplete text detection.

Cons

Slower and heavier.
Speech bubbles are dirtier.
Some texts are not detected and/or inpainted at all just like in regular mode. It's just that the missed areas will be different because the difference in their image heights. That's why it's recommended to increase or decrease detection_size & inpainting_size in my-config.json to improve the result. See https://github.com/zyddnys/manga-image-translator?tab=readme-ov-file#tips-to-improve-translation-quality.
~~Prone to server overloaded error.~~ (just retry it XD)
It seems that it's not really caused by the launcher, or is it? 🤔, since even the paid users are experiencing the same issue, see: google-gemini/gemini-cli#4360. Alternatively, you can change the model in .env file or the translator in my-config.json.

Note

The webtoon mode can use up to around 20GB RAM on my laptop.

My PC Specs:

Model: ASUS VIVOBOOK 14X M1403QA
CPU: AMD Ryzen™ 5 5600H (6C/12T)
GPU: 512MB AMD Radeon™ Vega 7 Graphics (integrated)
RAM: 24GB DDR4 3200 MT/s
Storage: 512GB M.2 NVMe™ PCIe® 3.0 SSD
OS: Windows 11 Home Single Language 64-bit (25H2)

If your PC specs are equal or better than mine, then you should be fine, probably 😅.

Name		Name	Last commit message	Last commit date
Latest commit History 2,376 Commits
.github		.github
MangaStudio_Data		MangaStudio_Data
demo/doc		demo/doc
devscripts		devscripts
dict		dict
examples		examples
fonts		fonts
front		front
manga_translator		manga_translator
my_tools		my_tools
pip-modules/mit-renderer		pip-modules/mit-renderer
server		server
test		test
training		training
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CHANGELOG_CN.md		CHANGELOG_CN.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MIT-installer.ps1		MIT-installer.ps1
MIT-local-launcher.ps1		MIT-local-launcher.ps1
MIT-local-webtoon-launcher.ps1		MIT-local-webtoon-launcher.ps1
MIT-updater.ps1		MIT-updater.ps1
MIT-web-launcher.ps1		MIT-web-launcher.ps1
Makefile		Makefile
MangaStudioMain.py		MangaStudioMain.py
MangaStudioMainRun.bat		MangaStudioMainRun.bat
README.md		README.md
README_CN.md		README_CN.md
docker-compose.debug.yml		docker-compose.debug.yml
docker-compose.yml		docker-compose.yml
docker_prepare.py		docker_prepare.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run-as-kaggle.ipynb		run-as-kaggle.ipynb
run.bat		run.bat
run.sh		run.sh
run_as_colab.ipynb		run_as_colab.ipynb
sakura_dict.txt		sakura_dict.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📂 Directory

NOTICE

Some things have been changed & fixed, so it's recommended to update to newer components if you have installed it before. See the UPDATE section for more info.

Since the guide has become too long and complex than originally intended, I decided to simplify it. You can see the more detailed version of this guide here.