Skip to content

Actions: allenai/WildBench

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
148 workflow runs
148 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

readme
Deploy static content to Pages #112: Commit 0b422b7 pushed by yuchenlin
June 28, 2024 00:50 2m 35s main
June 28, 2024 00:50 2m 35s
scoring minor fixes
Deploy static content to Pages #111: Commit 58bbf16 pushed by yuchenlin
June 26, 2024 06:21 2m 35s main
June 26, 2024 06:21 2m 35s
revised scoring results without truncations
Deploy static content to Pages #110: Commit 88704b7 pushed by yuchenlin
June 26, 2024 02:53 2m 27s main
June 26, 2024 02:53 2m 27s
Merge branch 'main' of https://github.com/allenai/WildBench into main
Deploy static content to Pages #109: Commit 4ffcbde pushed by yuchenlin
June 20, 2024 18:25 2m 27s main
June 20, 2024 18:25 2m 27s
Delete evaluation/eval_template.score.v2.0522.md
Deploy static content to Pages #108: Commit 95b804a pushed by yuchenlin
June 20, 2024 18:17 2m 8s main
June 20, 2024 18:17 2m 8s
update to new elos
Deploy static content to Pages #107: Commit ebc39ab pushed by yuchenlin
June 19, 2024 18:26 2m 27s main
June 19, 2024 18:26 2m 27s
neo_7b_instruct_v0.1-ExPO data
Deploy static content to Pages #106: Commit 81d1f6b pushed by yuchenlin
June 19, 2024 17:59 2m 31s main
June 19, 2024 17:59 2m 31s
add SELM haiku reward
Deploy static content to Pages #105: Commit a3bfe65 pushed by yuchenlin
June 19, 2024 06:56 2m 27s main
June 19, 2024 06:56 2m 27s
expo data
Deploy static content to Pages #104: Commit cc8c546 pushed by yuchenlin
June 19, 2024 05:19 2m 24s main
June 19, 2024 05:19 2m 24s
visualization
Deploy static content to Pages #103: Commit ab17c64 pushed by yuchenlin
June 19, 2024 00:03 2m 56s main
June 19, 2024 00:03 2m 56s
add leaderboard preview feature here
Deploy static content to Pages #102: Commit 8ed3a40 pushed by yuchenlin
June 18, 2024 23:59 4m 25s main
June 18, 2024 23:59 4m 25s
add results for new models GLM-4; DeepSeek v2 coder. SELM
Deploy static content to Pages #101: Commit 7a7002e pushed by yuchenlin
June 18, 2024 23:59 2m 18s main
June 18, 2024 23:59 2m 18s
update common vllm script to support 70B models
Deploy static content to Pages #100: Commit b931c8d pushed by yuchenlin
June 18, 2024 21:41 2m 19s main
June 18, 2024 21:41 2m 19s
readme
Deploy static content to Pages #99: Commit 6049a71 pushed by yuchenlin
June 18, 2024 21:13 2m 35s main
June 18, 2024 21:13 2m 35s
add neo 7b
Deploy static content to Pages #98: Commit e705a4f pushed by yuchenlin
June 14, 2024 06:36 2m 28s main
June 14, 2024 06:36 2m 28s
common inference code
Deploy static content to Pages #97: Commit b2324ae pushed by yuchenlin
June 14, 2024 01:41 2m 16s main
June 14, 2024 01:41 2m 16s
update citation
Deploy static content to Pages #96: Commit 528f9be pushed by yuchenlin
June 14, 2024 01:20 2m 16s main
June 14, 2024 01:20 2m 16s
add todo models
Deploy static content to Pages #95: Commit 54e3a50 pushed by yuchenlin
June 13, 2024 07:28 2m 21s main
June 13, 2024 07:28 2m 21s
Update README.md
Deploy static content to Pages #94: Commit 693e144 pushed by yuchenlin
June 13, 2024 07:14 3m 30s main
June 13, 2024 07:14 3m 30s
Update README.md
Deploy static content to Pages #93: Commit 85b8aa0 pushed by yuchenlin
June 13, 2024 07:13 2m 22s main
June 13, 2024 07:13 2m 22s
Merge branch 'main' of https://github.com/allenai/WildBench into main
Deploy static content to Pages #92: Commit 1afa425 pushed by yuchenlin
June 13, 2024 05:40 2m 17s main
June 13, 2024 05:40 2m 17s
Merge pull request #13 from da03/yi
Deploy static content to Pages #91: Commit a2612f4 pushed by yuchenlin
June 12, 2024 22:20 2m 11s main
June 12, 2024 22:20 2m 11s
Update README.md
Deploy static content to Pages #90: Commit b39dbd7 pushed by yuchenlin
June 10, 2024 19:35 2m 9s main
June 10, 2024 19:35 2m 9s
Update eval_template.score.v2.md
Deploy static content to Pages #89: Commit 8d0a554 pushed by yuchenlin
June 10, 2024 00:52 2m 6s main
June 10, 2024 00:52 2m 6s
Update README.md
Deploy static content to Pages #88: Commit c7c3479 pushed by yuchenlin
June 7, 2024 04:12 3m 0s main
June 7, 2024 04:12 3m 0s