Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,401 workflow runs
1,401 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Merge pull request #276 from shizhediao/patch-2
Test & Check Code Quality #218: Commit 3b91550 pushed by hynky1999
August 28, 2024 09:02 3m 31s main
August 28, 2024 09:02 3m 31s
Merge pull request #276 from shizhediao/patch-2
Secret Leaks #54: Commit 3b91550 pushed by hynky1999
August 28, 2024 09:02 16s main
August 28, 2024 09:02 16s
Update filter_hf_dataset.py
Test & Check Code Quality #216: Pull request #274 opened by shizhediao
August 25, 2024 22:29 2m 54s shizhediao:patch-1
August 25, 2024 22:29 2m 54s
Video support for datatrove
Test & Check Code Quality #215: Pull request #271 synchronize by mfarre
August 22, 2024 20:53 3m 5s mfarre:f-mediabundle
August 22, 2024 20:53 3m 5s
Video support for datatrove
Test & Check Code Quality #214: Pull request #271 synchronize by mfarre
August 22, 2024 20:48 3m 45s mfarre:f-mediabundle
August 22, 2024 20:48 3m 45s
Video support for datatrove
Test & Check Code Quality #213: Pull request #271 synchronize by mfarre
August 22, 2024 20:46 3m 57s mfarre:f-mediabundle
August 22, 2024 20:46 3m 57s
Video support for datatrove
Test & Check Code Quality #212: Pull request #271 opened by guipenedo
August 21, 2024 22:55 3m 24s mfarre:f-mediabundle
August 21, 2024 22:55 3m 24s
Add expand_metadata Option to JsonlWriter
Test & Check Code Quality #211: Pull request #268 opened by justHungryMan
August 17, 2024 16:40 3m 4s justHungryMan:feature/expand_metadata
August 17, 2024 16:40 3m 4s
Merge pull request #252 from justHungryMan/examples/minhash_dedup
Secret Leaks #53: Commit 6102f59 pushed by hynky1999
August 14, 2024 12:15 18s main
August 14, 2024 12:15 18s
Merge pull request #252 from justHungryMan/examples/minhash_dedup
Test & Check Code Quality #210: Commit 6102f59 pushed by hynky1999
August 14, 2024 12:15 3m 17s main
August 14, 2024 12:15 3m 17s
Merge pull request #264 from tylerjthomas9/trafilatura-short-text
Secret Leaks #52: Commit 83dbddf pushed by hynky1999
August 14, 2024 11:52 24s main
August 14, 2024 11:52 24s
Merge pull request #264 from tylerjthomas9/trafilatura-short-text
Test & Check Code Quality #208: Commit 83dbddf pushed by hynky1999
August 14, 2024 11:52 3m 21s main
August 14, 2024 11:52 3m 21s
Fix test_basic_article_trafilatura test failure
Test & Check Code Quality #207: Pull request #264 synchronize by hynky1999
August 14, 2024 11:48 3m 10s tylerjthomas9:trafilatura-short-text
August 14, 2024 11:48 3m 10s
Fix test_basic_article_trafilatura test failure
Test & Check Code Quality #206: Pull request #264 synchronize by hynky1999
August 14, 2024 10:49 3m 13s tylerjthomas9:trafilatura-short-text
August 14, 2024 10:49 3m 13s
Merge pull request #253 from aiqwe/improve_readibility
Secret Leaks #51: Commit 451e593 pushed by hynky1999
August 14, 2024 10:41 21s main
August 14, 2024 10:41 21s
Merge pull request #253 from aiqwe/improve_readibility
Test & Check Code Quality #205: Commit 451e593 pushed by hynky1999
August 14, 2024 10:41 3m 7s main
August 14, 2024 10:41 3m 7s
fix correct type inference for cache filesystems (#257)
Secret Leaks #50: Commit b5443d2 pushed by guipenedo
July 22, 2024 09:17 18s main
July 22, 2024 09:17 18s
fix correct type inference for cache filesystems (#257)
Test & Check Code Quality #203: Commit b5443d2 pushed by guipenedo
July 22, 2024 09:17 3m 42s main
July 22, 2024 09:17 3m 42s
fix correct type inference for cached filesystems
Test & Check Code Quality #202: Pull request #257 opened by hynky1999
July 20, 2024 11:33 3m 10s filecache_handling
July 20, 2024 11:33 3m 10s
not -> is None
Secret Leaks #48: Commit 7349902 pushed by guipenedo
July 17, 2024 12:03 19s main
July 17, 2024 12:03 19s
not -> is None
Test & Check Code Quality #200: Commit 7349902 pushed by guipenedo
July 17, 2024 12:03 2m 45s main
July 17, 2024 12:03 2m 45s
Add token and char count to histogram stats (#251)
Secret Leaks #47: Commit 452e69a pushed by guipenedo
July 15, 2024 15:47 21s main
July 15, 2024 15:47 21s
Add token and char count to histogram stats (#251)
Test & Check Code Quality #194: Commit 452e69a pushed by guipenedo
July 15, 2024 15:47 2m 47s main
July 15, 2024 15:47 2m 47s