feat: add final evaluation #84

leseb · 2024-10-10T15:39:42Z

8ff3780 feat: add final evaluation
113078c wip:final eval
fd52c08 update sdg generated line in knowledge-yaml
fc7067b fix typo on task generate
339ae90 fix: do not use backticks
5e74166 fix: add final details to push model to S3
a28d739 bulk: mt-bench eval, final eval and trained model push to S3
0885fa5 chore: remove helpers directory

commit 8ff3780
Author: Sébastien Han [email protected]
Date: Thu Oct 10 17:39:25 2024 +0200

feat: add final evaluation

Signed-off-by: Sébastien Han <[email protected]>

commit 113078c
Author: Michael Clifford [email protected]
Date: Thu Oct 10 20:04:04 2024 -0400

wip:final eval

Signed-off-by: Michael Clifford <[email protected]>

commit fd52c08
Author: sallyom [email protected]
Date: Fri Oct 11 13:21:38 2024 -0400

update sdg generated line in knowledge-yaml

Signed-off-by: sallyom <[email protected]>

commit fc7067b
Author: Sébastien Han [email protected]
Date: Mon Oct 14 09:40:50 2024 +0200

fix typo on task generate

It used to be `/data/generated`.

Signed-off-by: Sébastien Han <[email protected]>

commit 339ae90
Author: Sébastien Han [email protected]
Date: Mon Oct 14 09:41:33 2024 +0200

fix: do not use backticks

The shell from the python executor will interpret the content of the
backtick.

Signed-off-by: Sébastien Han <[email protected]>

commit 5e74166
Author: Sébastien Han [email protected]
Date: Mon Oct 14 09:43:24 2024 +0200

fix: add final details to push model to S3

Signed-off-by: Sébastien Han <[email protected]>

commit a28d739
Author: Sébastien Han [email protected]
Date: Mon Oct 14 23:00:15 2024 +0200

bulk: mt-bench eval, final eval and trained model push to S3

- do not print final eval scores in logs
- use the correct model location for final push
- fix job/cr watch
- add progress bar for download/upload
- add a few globale variables for eval steps
- handle the case where pods are removed but the job exists

Signed-off-by: Sébastien Han <[email protected]>

commit 0885fa5
Author: Sébastien Han [email protected]
Date: Tue Oct 15 17:12:32 2024 +0200

chore: remove helpers directory

THe functions from the helper are embedded into the components
themselves. So they are left unused now.

Signed-off-by: Sébastien Han <[email protected]>

sallyom · 2024-10-11T18:46:53Z

eval/final/components.py

+                    updated_lines.append(new_line)
+
+                if changed:
+                    with open(file_path, "w", encoding="utf-8") as file:


@MichaelClifford the "utf-8" to fix the check_printable error, 🙏

tumido

Just a few nits. Haven't test run the code though. 🙂

tumido · 2024-10-15T14:59:53Z

eval/final/components.py

+        if gpu_count > 0:
+            command = [
+                sys.executable,
+                "-m",
+                "vllm.entrypoints.openai.api_server",
+                "--model",
+                model_path,
+                "--tensor-parallel-size",
+                str(gpu_count),
+            ]
+        else:
+            command = [
+                sys.executable,
+                "-m",
+                "vllm.entrypoints.openai.api_server",
+                "--model",
+                model_path,
+            ]


Nit, making this less repetitive

Suggested change

if gpu_count > 0:

command = [

sys.executable,

"-m",

"vllm.entrypoints.openai.api_server",

"--model",

model_path,

"--tensor-parallel-size",

str(gpu_count),

]

else:

command = [

sys.executable,

"-m",

"vllm.entrypoints.openai.api_server",

"--model",

model_path,

]

command = [

sys.executable,

"-m",

"vllm.entrypoints.openai.api_server",

"--model",

model_path,

]

if gpu_count > 0:

command += [

"--tensor-parallel-size",

str(gpu_count),

]

eval/final/components.py

Signed-off-by: Sébastien Han <[email protected]>

Signed-off-by: Michael Clifford <[email protected]>

Signed-off-by: sallyom <[email protected]>

It used to be `/data/generated`. Signed-off-by: Sébastien Han <[email protected]>

The shell from the python executor will interpret the content of the backtick. Signed-off-by: Sébastien Han <[email protected]>

Signed-off-by: Sébastien Han <[email protected]>

- do not print final eval scores in logs - use the correct model location for final push - fix job/cr watch - add progress bar for download/upload - add a few globale variables for eval steps - handle the case where pods are removed but the job exists Signed-off-by: Sébastien Han <[email protected]>

THe functions from the helper are embedded into the components themselves. So they are left unused now. Signed-off-by: Sébastien Han <[email protected]>

tumido

/lgtm
/approve

leseb force-pushed the final-eval branch 3 times, most recently from 5b9f7f2 to 832f13a Compare October 11, 2024 07:38

MichaelClifford force-pushed the final-eval branch from 832f13a to 8122f5b Compare October 11, 2024 12:24

sallyom force-pushed the final-eval branch from a9ad617 to adaf720 Compare October 11, 2024 17:59

MichaelClifford force-pushed the final-eval branch from adaf720 to 891f726 Compare October 11, 2024 18:05

sallyom force-pushed the final-eval branch from e824cf8 to 3e573b2 Compare October 11, 2024 18:44

sallyom reviewed Oct 11, 2024

View reviewed changes

sallyom force-pushed the final-eval branch 2 times, most recently from 55dfafd to 64c9224 Compare October 11, 2024 21:27

leseb force-pushed the final-eval branch from 64c9224 to 424f7aa Compare October 14, 2024 07:43

leseb marked this pull request as ready for review October 14, 2024 07:44

leseb force-pushed the final-eval branch 5 times, most recently from bc44932 to 6aa2b1b Compare October 14, 2024 21:04

cooktheryan requested review from sallyom, MichaelClifford and tumido October 15, 2024 02:58

leseb force-pushed the final-eval branch from 6aa2b1b to dd4cf8f Compare October 15, 2024 14:41

tumido reviewed Oct 15, 2024

View reviewed changes

leseb and others added 8 commits October 15, 2024 17:16

feat: add final evaluation

8ff3780

Signed-off-by: Sébastien Han <[email protected]>

wip:final eval

113078c

Signed-off-by: Michael Clifford <[email protected]>

update sdg generated line in knowledge-yaml

fd52c08

Signed-off-by: sallyom <[email protected]>

fix typo on task generate

fc7067b

It used to be `/data/generated`. Signed-off-by: Sébastien Han <[email protected]>

fix: do not use backticks

339ae90

The shell from the python executor will interpret the content of the backtick. Signed-off-by: Sébastien Han <[email protected]>

fix: add final details to push model to S3

5e74166

Signed-off-by: Sébastien Han <[email protected]>

chore: remove helpers directory

0885fa5

THe functions from the helper are embedded into the components themselves. So they are left unused now. Signed-off-by: Sébastien Han <[email protected]>

leseb force-pushed the final-eval branch from dd4cf8f to 0885fa5 Compare October 15, 2024 15:18

leseb requested a review from tumido October 15, 2024 15:18

tumido approved these changes Oct 15, 2024

View reviewed changes

leseb merged commit 3784007 into opendatahub-io:main Oct 15, 2024
1 check passed

leseb deleted the final-eval branch October 15, 2024 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add final evaluation #84

feat: add final evaluation #84

leseb commented Oct 10, 2024 •

edited

Loading

sallyom Oct 11, 2024

tumido left a comment

tumido Oct 15, 2024

tumido left a comment

feat: add final evaluation #84

feat: add final evaluation #84

Conversation

leseb commented Oct 10, 2024 • edited Loading

sallyom Oct 11, 2024

Choose a reason for hiding this comment

tumido left a comment

Choose a reason for hiding this comment

tumido Oct 15, 2024

Choose a reason for hiding this comment

tumido left a comment

Choose a reason for hiding this comment

leseb commented Oct 10, 2024 •

edited

Loading