Skip to content

Pull requests: huggingface/datasets

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[MINOR:TYPO] Update arrow_dataset.py
#7236 opened Oct 18, 2024 by cakiki Loading…
(Super tiny doc update) Mention to_polars
#7232 opened Oct 17, 2024 by fzyzcjy Loading…
Video support
#7230 opened Oct 15, 2024 by lhoestq Draft
handle config_name=None in push_to_hub
#7229 opened Oct 15, 2024 by alex-hh Loading…
fast array extraction
#7227 opened Oct 14, 2024 by alex-hh Loading…
Add with_rank to Dataset.from_generator
#7199 opened Oct 4, 2024 by muthissar Loading…
Add repeat method to datasets
#7198 opened Oct 4, 2024 by alex-hh Loading…
[WIP] Fix datasets export to JSON
#7181 opened Sep 29, 2024 by varadhbhatnagar Loading…
fix grammar in fingerprint.py
#7176 opened Sep 26, 2024 by jxmorris12 Loading…
google colab ex
#7158 opened Sep 23, 2024 by docfhsp Loading…
Do not consume unnecessary memory during sharding
#7136 opened Sep 4, 2024 by janEbert Loading…
remove filecheck to enable symlinks
#7133 opened Aug 30, 2024 by fschlatt Loading…
Fix data file module inference
#7132 opened Aug 29, 2024 by HennerM Loading…
Add Arabic Docs to Datasets
#7094 opened Aug 7, 2024 by AhmedAlmaghz Loading…
Make BufferShuffledExamplesIterable resumable
#7056 opened Jul 22, 2024 by yzhangcs Loading…
Support folder-based datasets with large metadata.jsonl
#6859 opened May 2, 2024 by gbenson Loading…
Support downloading specific splits in load_dataset
#6832 opened Apr 23, 2024 by mariosasko Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.