Tool Calling Fine-Tuning fails because of validation logic in messages

Hi Team, 

First, please correct me if my understanding is wrong or if I missed something. 

However, I believe the validation logic in torch.data.messages [here](https://github.com/pytorch/torchtune/blob/main/torchtune/data/_messages.py#L634) doesn't account for tool calling flow. 

I've a dataset which follows the following flow: system -> user -> assistant -> tool -> assistant -> user and so on and so forth.

I've setup the correct role mappings for this dataset and also tried extending `SFTDataset` by creating a Transform which ensures that `eot=False` logic is followed for a) Assistant call before Tool call, b) Tool Call 

However both fail the validation. 

Please let me know if I missed something super obvious. 

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tool Calling Fine-Tuning fails because of validation logic in messages #2405

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development