Skip to content

step3 answer is not correct #547

Open
@BaiStone2017

Description

@BaiStone2017

setting as follow:
disable HE (HE + zero2 occur error)
pp_epochs=1
num_train_epochs=1
disable_actor_dropout
per_device_train_batch_size and per_device_mini_train_batch_size are 2

actor loss
image
inference demo:
image

while actor_ema seems normal but it's effect is same with sft model

Metadata

Metadata

Assignees

No one assigned

    Labels

    deespeed chatDeepSpeed ChatmodelingRelated to modeling questions.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions