-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Open
Labels
documentationImprovements or additions to documentationImprovements or additions to documentation
Description
1. 遇到问题的章节 / Affected Chapter
chapter6
2. 具体问题描述 / Problem Description
缺失强化学习部分
3. 问题重现材料 / Reproduction Materials
第四章末尾有写:接下来,我们将依次实现如何从零开始训练一个 LLM,包括预训练、SFT 和 RLHF。
第六章只存在pretrain以及SFT,没有RLHF
确认事项 / Verification
- 此问题未在过往Issue中被报告过 / This issue hasn't been reported before
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
documentationImprovements or additions to documentationImprovements or additions to documentation