Description
Hi there 👋
Let's review the zh-CN
translation of the course, so that the whole community can benefit from this resource 🌎!
As well as fix: #118 ;-D
Below are the chapters and files that need reviewing - let us know here if you'd like to review any and we'll add your name to the list. Once you're finished, open a pull request and tag this issue by including #552
in the description.
💯 Considering that this is a course designed for beginners, it might be helpful to provide necessary explanations for some technical terms used throughout the course, making the content more accessible to the target audience.
The review work for this NLP course is also considered one of the "onboarding tasks" for new translation volunteers.
The recommended PR title is: docs(zh-cn): [NLP Course] Reviewed {{File_Name}}
like:
docs(zh-cn): [NLP Course] Reviewed `chapter0/1.mdx`
Note:
- Please claim one course review item at a time.
- Considering it will take times to review your PR, please try to 1) Fork this repo 2) Start a new branch to review 3) Open a Pull Request from that branch.
- Let the team know if you have any questions here on the Discord
chinese-simplified-translation
channel.
Chapters
0 - Setup
-
chapter0/1.mdx
- 课程简介
1 - Transformer models
-
chapter1/1.mdx
- 本章简介 -
chapter1/2.mdx
- 自然语言处理 -
chapter1/3.mdx
- Transformers 能做什么 -
chapter1/4.mdx
- Transformers 是如何工作的 -
chapter1/5.mdx
- “编码器”模型 -
chapter1/6.mdx
- “解码器”模型 -
chapter1/7.mdx
- 序列到序列模型 -
chapter1/8.mdx
- 偏见和局限性 -
chapter1/9.mdx
- 总结 -
chapter1/10.mdx
- 章末小测试
2 - Using 🤗 Transformers - WHOLE chapter claimed by @wwzhuang01
-
chapter2/1.mdx
- 本章简介 -
chapter2/2.mdx
- 管道的内部 -
chapter2/3.mdx
- 模型 -
chapter2/4.mdx
- 标记器 -
chapter2/5.mdx
- 处理多个序列 -
chapter2/6.mdx
- 把它们放在一起 -
chapter2/7.mdx
- 基本用法完成 -
chapter2/8.mdx
- 章末小测试
3 - Fine-tuning a pretrained model - WHOLE chapter claimed by @Decem-Y
-
chapter3/1.mdx
- 本章简介 -
chapter3/2.mdx
- 处理数据 -
chapter3/3.mdx
- 使用 Trainer API 微调模型 -
chapter3/3_tf.mdx
- 使用 Keras 微调一个模型 -
chapter3/4.mdx
- 一个完整的训练 -
chapter3/5.mdx
- 微调,检查 -
chapter3/6.mdx
- 章末小测验
4 - Sharing models and tokenizers
-
chapter4/1.mdx
- The Hugging Face Hub -
chapter4/2.mdx
- 使用预训练模型 -
chapter4/3.mdx
- 共享预训练模型 -
chapter4/4.mdx
- 构建模型卡片 -
chapter4/5.mdx
- Part 1 完结 -
chapter4/6.mdx
- 章末小测试
5 - The 🤗 Datasets library
-
chapter5/1.mdx
- 本章简介 -
chapter5/2.mdx
- 如果我的数据集不在 Hub 上怎么办 -
chapter5/3.mdx
- 是时候来学一下切片了 -
chapter5/4.mdx
- 大数据? 🤗 Datasets 来救援 -
chapter5/5.mdx
- 创建自己的数据集 -
chapter5/6.mdx
- 使用 FAISS 进行语义搜索 -
chapter5/7.mdx
- 🤗 Datasets 回顾 -
chapter5/8.mdx
- 章末小测试
6 - The 🤗 Tokenizers library
-
chapter6/1.mdx
- 本章简介 -
chapter6/2.mdx
- 根据已有的tokenizer训练新的tokenizer -
chapter6/3.mdx
- 快速标记器的特殊能力 -
chapter6/3b.mdx
- QA 管道中的快速标记器 -
chapter6/4.mdx
- 标准化和预标记化 -
chapter6/5.mdx
- 字节对编码标记化 -
chapter6/6.mdx
- WordPiece 标记化 -
chapter6/7.mdx
- Unigram标记化 -
chapter6/8.mdx
- 逐块地构建标记器 -
chapter6/9.mdx
- 标记器回顾 -
chapter6/10.mdx
- 章末小测试
7 - Main NLP tasks
-
chapter7/1.mdx
- 本章简介 -
chapter7/2.mdx
- Token 分类 -
chapter7/3.mdx
- 微调掩码语言模型 -
chapter7/4.mdx
- 翻译 -
chapter7/5.mdx
- 提取文本摘要 -
chapter7/6.mdx
- 从头开始训练因果语言模型 -
chapter7/7.mdx
- 问答 -
chapter7/8.mdx
- 精通自然语言处理 -
chapter7/9.mdx
- 章末小测试
8 - How to ask for help
-
chapter8/1.mdx
- 本章简介 -
chapter8/2.mdx
- 出现错误时该怎么办 -
chapter8/3.mdx
- 在论坛上寻求帮助 -
chapter8/4.mdx
- 调试训练管道 -
chapter8/4_tf.mdx
- Debugging the training pipeline -
chapter8/5.mdx
- 如何写一个好问题 -
chapter8/6.mdx
- 第2部分完成 -
chapter8/7.mdx
- 章末小测试
Events
-
events/1.mdx
- Live sessions and workshops (need translation) -
events/2.mdx
- Part 2 发布活动 -
events/3.mdx
- Gradio Blocks Party (need translation)