Skip to content

Pull requests: huggingface/datasets

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix the environment variable for huggingface cache
#7200 opened Oct 5, 2024 by torotoki Loading…
Add with_rank to Dataset.from_generator
#7199 opened Oct 4, 2024 by muthissar Loading…
Add repeat method to datasets
#7198 opened Oct 4, 2024 by alex-hh Loading…
Support features in metadata configs
#7182 opened Sep 30, 2024 by albertvillanova Loading…
1 task
Support Python 3.11
#7179 opened Sep 27, 2024 by albertvillanova Loading…
fix grammar in fingerprint.py
#7176 opened Sep 26, 2024 by jxmorris12 Loading…
google colab ex
#7158 opened Sep 23, 2024 by docfhsp Loading…
Do not consume unnecessary memory during sharding
#7136 opened Sep 4, 2024 by janEbert Loading…
remove filecheck to enable symlinks
#7133 opened Aug 30, 2024 by fschlatt Loading…
Fix data file module inference
#7132 opened Aug 29, 2024 by HennerM Loading…
Add Arabic Docs to Datasets
#7094 opened Aug 7, 2024 by AhmedAlmaghz Loading…
Make BufferShuffledExamplesIterable resumable
#7056 opened Jul 22, 2024 by yzhangcs Loading…
Support folder-based datasets with large metadata.jsonl
#6859 opened May 2, 2024 by gbenson Loading…
Support downloading specific splits in load_dataset
#6832 opened Apr 23, 2024 by mariosasko Loading…
Make Image cast storage faster
#6786 opened Apr 5, 2024 by Modexus Loading…
3x Faster Text Preprocessing
#6711 opened Mar 3, 2024 by ashvardanian Loading…
__add__ for Dataset, IterableDataset
#6694 opened Feb 26, 2024 by oh-gnues-iohc Loading…
ProTip! Add no:assignee to see everything that’s not assigned.