Skip to content

Conversation

@plaguss
Copy link
Contributor

@plaguss plaguss commented Dec 11, 2024

Description

  • Add PRM for MathShepherd tasks.

This PR adds a new method to the _Step class called _dataset_use. By default doesn't contain anything, but can be used to enhance the dataset card of a Distiset. The examples implemented here correspond to the classes that prepare the data for fine-tuning, it includes an example command using TRL.

To include a new "use" for a dataset associated with a step, we have to override the _dataset_use method, take a look at FormatTextGenerationSFT or FormatTextGenerationDPO for a example.

An example can be seen in this dummy dataset: https://huggingface.co/datasets/plaguss/test_dataset_use

image

@plaguss plaguss added documentation Improvements or additions to documentation enhancement New feature or request labels Dec 11, 2024
@plaguss plaguss added this to the 1.5.0 milestone Dec 11, 2024
@plaguss plaguss requested a review from gabrielmbmb December 11, 2024 15:19
@plaguss plaguss self-assigned this Dec 11, 2024
@plaguss plaguss marked this pull request as ready for review December 11, 2024 15:19
@github-actions
Copy link

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-1079/

@codspeed-hq
Copy link

codspeed-hq bot commented Dec 11, 2024

CodSpeed Performance Report

Merging #1079 will improve performances by ×7.2

Comparing dataset-card-help (ca07a58) with develop (63c75c5)

Summary

⚡ 1 improvements

Benchmarks breakdown

Benchmark develop dataset-card-help Change
test_cache_time 3,985.3 ms 553.1 ms ×7.2

@gabrielmbmb gabrielmbmb removed this from the 1.5.0 milestone Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants