Skip to content

[问题/Issue] 章节4:在后续章节中没有RLHF的代码 #164

@Hbink-cx

Description

@Hbink-cx

1. 遇到问题的章节 / Affected Chapter

chapter6

2. 具体问题描述 / Problem Description

缺失强化学习部分

3. 问题重现材料 / Reproduction Materials

第四章末尾有写:接下来,我们将依次实现如何从零开始训练一个 LLM,包括预训练、SFT 和 RLHF。
第六章只存在pretrain以及SFT,没有RLHF

确认事项 / Verification

  • 此问题未在过往Issue中被报告过 / This issue hasn't been reported before

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions