Cumulative translation by sunyuechi · Pull Request #148 · yihong0618/bilingual_book_maker

sunyuechi · 2023-03-11T13:55:57Z

Wait for how many tokens have been accumulated before starting the translation

--accumulated_num num

The strange part: I don't know how to judge how the cumulative translation results correspond to the original paragraph, so use sep = "\n\n\n\n\n", if sep="\n\n", it is often translated into the same line

A possible bug is that even if \n\n\n\n\n, it may still be translated to the same line, and then there will be problems where the translation appears, this will only affect the last num characters, after which it will return to normal

A good effect is that it can understand the context more, and the prompt is more effective

I have used various prompts before to ask it not to translate people's names, but to no avail, with context it works(It will most likely work, and in a few cases it will still translate, but it can be translated to the same name)

left:

time OPENAI_API_SYS_MSG="You are an assistant who translates computer technology books but don't translate people's names, there is a blank line after each of your translation results" python3 "make_book.py" --book_name "$filepath" --openai_key "${openai_apikey_book}" --language "zh-hans" --accumulated_num 1500

right:

time python3 "make_book.py" --book_name "$filepath" --openai_key "${openai_apikey_book}" --language "zh-hans"

…aker

…aker into cumulative-translation

yihong0618 · 2023-03-12T06:54:36Z

Will take a look later for this PR.

yihong0618 · 2023-03-12T06:56:21Z

the function part let me do it is OK, when I got time

sunyuechi · 2023-03-12T07:23:45Z

@yihong0618 Ok, I'm trying to get a better prompt so that it can translate the same number of paragraphs as the original paragraphs, and still often have problems

yihong0618 · 2023-03-12T07:34:18Z

@hleft FYI, I want to keep the original prompt as default because of that it costs the least tokens cc @ConanChou, most users do not care how good of all, they just want it to help them read, so users can DIY their prompt but we do not change the default one unless we found a better one better for translate and for token cost.

sunyuechi · 2023-03-12T07:43:35Z

@yihong0618 I mean only modify the default prompt when --accumulated_num is enabled, because if don't modify it, you will get an error and not just a bad translation (the translated result appears in the wrong place).

yihong0618 · 2023-03-12T07:44:24Z

yep thanks~

sunyuechi · 2023-03-12T10:55:39Z

Still in testing, the current main progress is that I understand that it is not enough to just modify the prompt and retry, and some special cases must be manually handled, such as <sup>, Source: xxx link

change output

yihong0618

LGTM will test and merge.
amazing work!

zstone12 · 2023-03-15T12:55:54Z

LGTM+1.
The fantastic work!

zstone12 · 2023-03-15T12:58:51Z

Please add the relevant content of the --accumulated_num parameter to README:D

refactor: _process_paragraph refactor: translate_paragraphs_acc

…aker into cumulative-translation

…ingual_book_maker into cumulative-translation

yihong0618 · 2023-03-16T12:55:27Z

can not test...

yihong0618 · 2023-03-16T13:11:17Z

@hleft can this merge now?

sunyuechi · 2023-03-16T13:25:15Z

@yihong0618 yes ready

yihong0618 · 2023-03-16T13:26:22Z

please check new CI if it failed we need to fix it quick

…g0618#148

…ihong0618#148

sunyuechi added 8 commits March 8, 2023 21:53

support config tags to translate

502d909

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

7505d59

…aker

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

c31b01c

…aker

support system meesage in envirment

3277eef

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

c137fec

…aker

cumulative translation

f39b926

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

58d7939

…aker

fix

59ed6d9

yihong0618 reviewed Mar 11, 2023

View reviewed changes

zstone12 reviewed Mar 11, 2023

View reviewed changes

Comment thread book_maker/translator/chatgptapi_translator.py Outdated

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

7c91331

…aker into cumulative-translation

sunyuechi force-pushed the cumulative-translation branch from 8144fdd to 6ef21e7 Compare March 11, 2023 14:14

zstone12 reviewed Mar 11, 2023

View reviewed changes

Comment thread book_maker/loader/epub_loader.py Outdated

fix

2baf359

sunyuechi force-pushed the cumulative-translation branch from 6ef21e7 to 2baf359 Compare March 11, 2023 14:21

sunyuechi added 2 commits March 11, 2023 23:09

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

21478d9

…aker into cumulative-translation

clean

3523469

sunyuechi force-pushed the cumulative-translation branch from 4eecaa6 to 3523469 Compare March 11, 2023 16:08

prompt and retry

9aade4b

sunyuechi mentioned this pull request Mar 11, 2023

希望添加专业术语字典 #132

Closed

improve prompt and fix <sup>

8badb41

change output

sunyuechi force-pushed the cumulative-translation branch from 8e669d6 to 8badb41 Compare March 12, 2023 11:30

clean, fix link translate

b0d4f86

sunyuechi force-pushed the cumulative-translation branch 4 times, most recently from 22f3b2e to 7773778 Compare March 15, 2023 12:26

yihong0618 approved these changes Mar 15, 2023

View reviewed changes

zstone12 reviewed Mar 15, 2023

View reviewed changes

Comment thread book_maker/translator/chatgptapi_translator.py

sunyuechi added 3 commits March 15, 2023 21:14

refactor epub_loader by gpt4

6e714d3

refactor: _process_paragraph refactor: translate_paragraphs_acc

refactor

4e42860

update readme and help

cc85d6b

sunyuechi force-pushed the cumulative-translation branch from 5aa35c4 to cc85d6b Compare March 15, 2023 13:14

fix

c859c69

zstone12 approved these changes Mar 15, 2023

View reviewed changes

improve exception

173b756

sunyuechi force-pushed the cumulative-translation branch from 780147b to 173b756 Compare March 15, 2023 15:30

sunyuechi and others added 5 commits March 15, 2023 23:31

improve exception

e24a1c5

Merge branch 'main' of https://github.com/yihong0618/bilingual_book_m…

32cadfa

…aker into cumulative-translation

Merge branch 'yihong0618:main' into cumulative-translation

cf92918

use ordinals to ensure order instead of prompts

c535041

Merge branch 'cumulative-translation' of https://github.com/hleft/bil…

34ed9dc

…ingual_book_maker into cumulative-translation

comment debug output for merge

b73240a

yihong0618 merged commit e38a236 into yihong0618:main Mar 16, 2023

danparizher mentioned this pull request Mar 16, 2023

Cleanup #174

Merged

wayhome pushed a commit to wayhome/bilingual_book_maker that referenced this pull request Aug 29, 2024

fix: Preserve the br element and ensure correct text direction. yihon…

0851aad

…g0618#148

wayhome pushed a commit to wayhome/bilingual_book_maker that referenced this pull request Aug 29, 2024

feat: Side-by-side translation for content with line breaks. resolved y…

a046c1e

…ihong0618#148

Conversation

sunyuechi commented Mar 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yihong0618 commented Mar 12, 2023

Uh oh!

yihong0618 commented Mar 12, 2023

Uh oh!

sunyuechi commented Mar 12, 2023

Uh oh!

yihong0618 commented Mar 12, 2023

Uh oh!

sunyuechi commented Mar 12, 2023

Uh oh!

yihong0618 commented Mar 12, 2023

Uh oh!

sunyuechi commented Mar 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yihong0618 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zstone12 commented Mar 15, 2023

Uh oh!

zstone12 commented Mar 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yihong0618 commented Mar 16, 2023

Uh oh!

yihong0618 commented Mar 16, 2023

Uh oh!

sunyuechi commented Mar 16, 2023

Uh oh!

yihong0618 commented Mar 16, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sunyuechi commented Mar 11, 2023 •

edited

Loading

sunyuechi commented Mar 12, 2023 •

edited

Loading

zstone12 commented Mar 15, 2023 •

edited

Loading