Open
Description
We need a simple way for a user to load a Unitxt dataset based on existing files or data structures.
This is required for ilab (@Roni-Friedman ) and also for other use cases.
I thought that maybe a dataframe is a common interface that everyone knows.
load_dataset(task="tasks.classification.multi_class", test_set = pd.from_csv("test.csv"))
load_dataset(task="tasks.classification.multi_class", test_set = pd.from_csv("test.csv"), train_set=pd.from_csv("train.csv" )
And
evaluate(predications, task="tasks.classification.multi_class", test_set = pd.from_csv("test.csv"))
The assumption is that the fields of input dataframe are the same as the task, and a clear error message will be presented if not.
Metadata
Assignees
Labels
No labels