Skip to content

Tool Calling Fine-Tuning fails because of validation logic in messages #2405

Open
@init27

Description

Hi Team,

First, please correct me if my understanding is wrong or if I missed something.

However, I believe the validation logic in torch.data.messages here doesn't account for tool calling flow.

I've a dataset which follows the following flow: system -> user -> assistant -> tool -> assistant -> user and so on and so forth.

I've setup the correct role mappings for this dataset and also tried extending SFTDataset by creating a Transform which ensures that eot=False logic is followed for a) Assistant call before Tool call, b) Tool Call

However both fail the validation.

Please let me know if I missed something super obvious.

Thanks!

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingcommunity help wantedWe would love the community's help completing this issue

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions