Skip to content

use bloom-350m to train reward model in step2 #356

Open
@70557dzqc

Description

@70557dzqc

I want to train bloom_350m in chinese dataset, and run run_350m.sh, change the model_name_or_path. But the loss is nan, how should I solve it? Is the argument "num_padding_at_beginning" cause this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    deespeed chatDeepSpeed Chatnew-configA modified config from the given example

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions