🌐 [i18n-KO] Translated `main_classes/quantization.md` to Korean #33959

fabxoe · 2024-10-04T17:55:42Z

What does this PR do?

Translated the main_classes/quantization.md file of the documentation to Korean.
Thank you in advance for your review.

Part of #20179

Before reviewing

Check for missing / redundant translations (번역 누락/중복 검사)
Grammar Check (맞춤법 검사)
Review or Add new terms to glossary (용어 확인 및 추가)
Check Inline TOC (e.g. [[lowercased-header]])
Check live-preview for gotchas (live-preview로 정상작동 확인)

Who can review? (Initial)

@chhaewxn, @ahnjj, @jun048098, @fabxoe, @nuatmochoi, @heuristicwave

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review? (Final)

ahnjj · 2024-10-05T15:19:54Z

docs/source/ko/main_classes/quantization.md

+
+# 양자화[[quantization]]
+
+Quantization techniques reduce memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8). This enables loading larger models you normally wouldn't be able to fit into memory, and speeding up inference. Transformers supports the AWQ and GPTQ quantization algorithms and it supports 8-bit and 4-bit quantization with bitsandbytes.


Suggested change

Quantization techniques reduce memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8). This enables loading larger models you normally wouldn't be able to fit into memory, and speeding up inference. Transformers supports the AWQ and GPTQ quantization algorithms and it supports 8-bit and 4-bit quantization with bitsandbytes.

아래 번역 해두신것 같아 원문부분 삭제했습니다~!

ahnjj · 2024-10-05T15:20:08Z

docs/source/ko/main_classes/quantization.md

+
+Quantization techniques reduce memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8). This enables loading larger models you normally wouldn't be able to fit into memory, and speeding up inference. Transformers supports the AWQ and GPTQ quantization algorithms and it supports 8-bit and 4-bit quantization with bitsandbytes.
+
+Quantization techniques that aren't supported in Transformers can be added with the [`HfQuantizer`] class.


Suggested change

Quantization techniques that aren't supported in Transformers can be added with the [`HfQuantizer`] class.

아래 번역 해두신것 같아 원문부분 삭제했습니다~!

ahnjj · 2024-10-05T15:21:10Z

docs/source/ko/main_classes/quantization.md

+
+<Tip>
+
+이 [양자화](../quantization) 가이드를 통해서 모델을 양자화하는 방법을 배울 수 있습니다.


Suggested change

이 [양자화](../quantization) 가이드를 통해서 모델을 양자화하는 방법을 배울 수 있습니다.

모델을 양자화하는 방법은 이 [양자화](../quantization) 가이드를 통해 배울 수 있습니다.

좀더 가독성 있어보이게 바꾸었습니다!

fabxoe added 2 commits October 5, 2024 02:50

docs: ko: main_classes/quantization.md

6b5a3c2

feat: nmt draft

54df97d

ahnjj suggested changes Oct 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌐 [i18n-KO] Translated `main_classes/quantization.md` to Korean #33959

🌐 [i18n-KO] Translated `main_classes/quantization.md` to Korean #33959

fabxoe commented Oct 4, 2024 •

edited

Loading

ahnjj Oct 5, 2024

ahnjj Oct 5, 2024

ahnjj Oct 5, 2024


		# 양자화[[quantization]]

		Quantization techniques reduce memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8). This enables loading larger models you normally wouldn't be able to fit into memory, and speeding up inference. Transformers supports the AWQ and GPTQ quantization algorithms and it supports 8-bit and 4-bit quantization with bitsandbytes.


		Quantization techniques reduce memory and computational costs by representing weights and activations with lower-precision data types like 8-bit integers (int8). This enables loading larger models you normally wouldn't be able to fit into memory, and speeding up inference. Transformers supports the AWQ and GPTQ quantization algorithms and it supports 8-bit and 4-bit quantization with bitsandbytes.

		Quantization techniques that aren't supported in Transformers can be added with the [`HfQuantizer`] class.


		<Tip>

		이 [양자화](../quantization) 가이드를 통해서 모델을 양자화하는 방법을 배울 수 있습니다.

	이 [양자화](../quantization) 가이드를 통해서 모델을 양자화하는 방법을 배울 수 있습니다.
	모델을 양자화하는 방법은 이 [양자화](../quantization) 가이드를 통해 배울 수 있습니다.

🌐 [i18n-KO] Translated main_classes/quantization.md to Korean #33959

Are you sure you want to change the base?

🌐 [i18n-KO] Translated main_classes/quantization.md to Korean #33959

Conversation

fabxoe commented Oct 4, 2024 • edited Loading

What does this PR do?

Before reviewing

Who can review? (Initial)

Before submitting

Who can review? (Final)

ahnjj Oct 5, 2024

Choose a reason for hiding this comment

ahnjj Oct 5, 2024

Choose a reason for hiding this comment

ahnjj Oct 5, 2024

Choose a reason for hiding this comment

🌐 [i18n-KO] Translated `main_classes/quantization.md` to Korean #33959

🌐 [i18n-KO] Translated `main_classes/quantization.md` to Korean #33959

fabxoe commented Oct 4, 2024 •

edited

Loading