-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Final eval mtbench_branch #67
Conversation
2ee354e
to
bf8e33c
Compare
e63abf7
to
b5b22cc
Compare
@MichaelClifford hold off on reviewing this for a bit - I'm importing defs from ilab rather than cp/pasting them here - will push that change after i test that it works |
f6b6067
to
b6ef1f9
Compare
@MichaelClifford this is ready for review, test running now |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thanks @sallyom
b6ef1f9
to
b213b21
Compare
Will merge after the current run completes |
Signed-off-by: sallyom <[email protected]>
Signed-off-by: sallyom <[email protected]>
b213b21
to
dccf427
Compare
def git_clone_op( | ||
taxonomy: dsl.Output[dsl.Dataset], | ||
repo_branch: str, | ||
repo_pr: Optional[int], | ||
repo_url: Optional[str], | ||
): | ||
return | ||
return dsl.ContainerSpec( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why did we need this change?
This adds final eval mt_bench_branch
Also, adds a helpers package - for
launch_vllm
,stop_vllm
-used inmt_bench_branch
,mmlu_branch
, andmt_bench
, so need a central place for these.For mmlu_branch & mt_bench_branch, a taxonomy is required - this PR adds the taxonomy to the fake fixtures of SDG, to allow for mocking SDG with running final evaluation.