Skip to content

Could this tool apply for encoder-decoder model, like Flan-T5? #340

Open
@henryxiao1997

Description

@henryxiao1997

It's great tool to show how to build a chatgpt-like model based on some foundation models. I wonder could it support the encoder-decoder model, like Flan-T5? Could we just directly load those models and run these scripts on them? Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    deespeed chatDeepSpeed Chatnew-configA modified config from the given example

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions