Skip to content

Readme train.py command doesn't create saved output once training is complete #71

@grctest

Description

@grctest

The readme instructs to use 601 steps: --total_steps 601

https://github.com/microsoft/KBLaM/blob/main/README.md?plain=1#L58

However the train.py file has a static value of 3000 steps for the save period: save_period=3000

https://github.com/microsoft/KBLaM/blob/main/experiments/train.py#L955

After reaching 100% training, the output folder was empty:

  Training ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% Loss: 9.2729 0:00:002 0:00:2704:29

Took about 4 hours to train, but then it didn't save the result 😅

So should the suggested steps in the readme be increased, or should the static save_period value be lowered?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions