LocalScholar-Flow: Private, Offline Academic Paper Translation with Perfect Formatting (Docker + MinerU + Hunyuan) #4328
littleBu0210
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone! 👋
I built LocalScholar-Flow, an open-source pipeline designed to solve the biggest headache in academic research: Translating papers while keeping formulas, images, and layout intact, without uploading data to the cloud.
We all know standard translators break mathematical formulas and destroy document structure. I combined the best open-source tools into a single, easy-to-deploy Docker workflow.
👀 See the Results:
1. PDF to Markdown Parsing (Powered by MinerU)
It doesn't just extract text; it understands the layout.
(Check out the comparison images below/attached: Note how the complex layout and headers are preserved)
2. Context-Aware Translation (Powered by Tencent Hunyuan)
Using a local LLM specifically tuned for translation, it keeps the academic tone and preserves LaTeX formulas.
(See the translation output: The math formulas remain perfect inside the translated text)
✨ Why use this?
python scripts/download_models.pyanddocker compose up -d.🛠️ Tech Stack:
🔗 GitHub Repository:
👉 https://github.com/littleBu0210/LocalScholar-Flow
I’d love to hear your feedback! If this helps your research workflow, please consider giving it a ⭐!
Beta Was this translation helpful? Give feedback.
All reactions