Skip to content

[FEATURE] Compute the input/output tokens of a dataset #1046

@plaguss

Description

@plaguss

Is your feature request related to a problem? Please describe.
After merging #1034 we will have available the statistics generated by an LLM. We could add a functionality to compute some statistics on these variables.

Add an easier way of passing functions that can operate on these generated statistics to make simple doing computations on the statistics generated.

Describe the solution you'd like
A report in the README of a dataset with the number of tokens generated, and some descriptive stats per row for example.

Describe alternatives you've considered
Compute this by hand.

Additional context
Add any other context or screenshots about the feature request here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions