[WIP] Chatgpt api translator by a0s · Pull Request #9 · translate-tools/linguist-translators

a0s · 2025-02-09T20:52:55Z

This is OpenAI API based translator module. Started at translate-tools/linguist#230

@vitonsky Any idea how to debug from inside eval() ? console.log and debugger; both are just not working.

vitonsky · 2025-02-09T21:58:20Z

@@ -0,0 +1,78 @@
+class ChatGPTTranslator {
+    constructor(model = 'gpt-4o', openApiKey = "") {


Can we use model gpt-4o-mini? It would save users money and it have quite good quality

vitonsky · 2025-02-09T22:06:17Z

+        const prompt = (from === 'auto' || from === '')
+            ? `Translate text to ${to}. Only return the translation without any additional text:\n\n${text}`
+            : `Translate text from ${from} to ${to}. Only return the translation without any additional text:\n\n${text}`;


Let's generate batch requests to translate array of texts with method translateBatch and then call this method with single text in translate method.

This approach may improve performance dramatically, since Linguist mostly call translateBatch method with as many texts as possible.

If we have 1k texts to translate, in current implementation user would sent 1k requests with prompt overhead, but if we will batch texts, linguist will send 50-200 texts in single request and eventually will sent only 5-20 requests.

vitonsky · 2025-02-09T22:08:06Z

+            console.error("Translation request failed:", error);
+            return '';


If exception has occurs, we must throw an error, to handle it properly. Otherwise user will see empty text in translations and will be confused

vitonsky · 2025-02-09T22:11:07Z

+    getLengthLimit() {
+        return 20000;
+    }


How you think is it optimal value?

Linguist uses this number to understand "can we put more text to current batch?" to translate as many texts as possible in single request.

How much chars ChatGPT can handle for single message?

vitonsky · 2025-02-09T22:22:34Z

+    test('handles length limit correctly', () => {
+        const longText = 'a'.repeat(21000);
+        expect(translator.checkLimitExceeding(longText)).toBe(1);
+        expect(translator.checkLimitExceeding('short text')).toBe(0);
+    });


This method must return the number of chars that is out of limit. It is necessary for Linguist batch scheduler to efficiently group texts by its size.

You may check example here

linguist-translators/translators/LibreTranslator.js

Lines 45 to 51 in 4e9c69c

checkLimitExceeding = (text) => {

const textLength = !Array.isArray(text)

? text.length

: text.reduce((len, text) => len + text.length, 0);

return textLength - this.getLengthLimit();

};

I think in this test a first assert should expect 1000 (that means we have to cut at least 1k chars) and second one is -19990 (that means we can add up to 19990 characters, and still fit in single request)

vitonsky · 2025-02-09T22:30:19Z

@vitonsky Any idea how to debug from inside eval() ? console.log and debugger; both are just not working.

You may use console log and maybe even debugger;, but code runs in extension context, not in web page context.

Try to open about:debugging#/runtime/this-firefox and click "Inspect" on Linguist extension if you use firefox or open chrome://extensions/ and toggle "Developer mode" then open and inspect Linguist service worker if you on chromium based browser.

There must be logs of custom translator module

a0o added 2 commits February 9, 2025 21:08

feat: base translation

8df8ec3

fix: try to switch fron json to raw text

08c4911

vitonsky reviewed Feb 9, 2025

View reviewed changes

vitonsky mentioned this pull request Feb 25, 2025

Add ChatGPT translator #4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Chatgpt api translator#9

[WIP] Chatgpt api translator#9
a0s wants to merge 2 commits into
translate-tools:masterfrom
a0s:chatgpt-api-translator

a0s commented Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Uh oh!

vitonsky commented Feb 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,78 @@
		class ChatGPTTranslator {
		constructor(model = 'gpt-4o', openApiKey = "") {

		console.error("Translation request failed:", error);
		return '';

	checkLimitExceeding = (text) => {
	const textLength = !Array.isArray(text)
	? text.length
	: text.reduce((len, text) => len + text.length, 0);

	return textLength - this.getLengthLimit();
	};

Conversation

a0s commented Feb 9, 2025

Uh oh!

vitonsky Feb 9, 2025

Choose a reason for hiding this comment

Uh oh!

vitonsky Feb 9, 2025

Choose a reason for hiding this comment

Uh oh!

vitonsky Feb 9, 2025

Choose a reason for hiding this comment

Uh oh!

vitonsky Feb 9, 2025

Choose a reason for hiding this comment

Uh oh!

vitonsky Feb 9, 2025

Choose a reason for hiding this comment

Uh oh!

vitonsky commented Feb 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants