Meet Judge Judy, she is your AI powered SME #985

epugh · 2024-03-21T13:04:01Z

Description

Work on enabling a LLM powered judge to help your users out.

Motivation and Context

Human rating is expensive and hard to scale.

Change user.prompt to user.system_prompt
Add tests that mock up openai calls
[ ]

epugh · 2025-01-14T23:01:45Z

Actually ran some OpenAI based LLM as a judge tests successfully!

shrinking image Integrate LlmService Better label for making judgements Look at unauthorized use case typo!

be clearer that there is a system prompt and a seperate "user prompt" that is generated at run time.

epugh added 23 commits March 17, 2024 18:44

copy pasta error

f44743a

have a bare bones judge judy

eecfdf6

allow ratings from case to make it to a book...

5b24171

testing of rating in a case ending up in associated book!

1f3ae61

making progress on integrating judge judy

84f39f8

Merge branch 'main' into judge_judy

110cb97

fix typo

ec412fa

Merge branch 'main' into judge_judy

b2a2533

some progress, so much more

6114688

Merge branch 'main' into judge_judy

ef1766d

oops

097a365

Missed Ahoy in merge

9676e5f

model causing an issue, is this all right?

58742d3

Sidekiq is gone!

3e7d65b

Clean up the basic integration test!

627f8c0

Temporary work around.

47ca507

Merge branch 'main' into judge_judy

5e630a3

Fix test by backing out the tracking... Sigh

6459a10

Add/remove AI Judges

4a7c4ec

tweak test formatting

65647f3

Now can run on demand a judge

7eeb274

Gem was released!

3289339

need the individual jduge

bc9fc43

epugh temporarily deployed to quepid-pr-985 January 7, 2025 14:51 Inactive

epugh added 2 commits January 7, 2025 10:54

Merge branch 'main' into judge_judy

ca12008

temp add

b391909

epugh temporarily deployed to quepid-pr-985 January 7, 2025 19:58 Inactive

epugh temporarily deployed to quepid-pr-985 January 7, 2025 20:11 Inactive

epugh added 2 commits January 7, 2025 15:11

tweaks

65aa836

Merge branch 'main' into judge_judy

8d32d93

epugh and others added 6 commits January 14, 2025 10:05

Better support for doing some or ALL pairs

ad827ad

speed up delete

15df5e5

Updated the prompt to reinforce JSON output

f9490ac

Fix prompt controller

6db98e1

First test

b6045ef

Update the range of judgments to 0-3

c0008ec

DOn't need patch

2ebbc29

shrinking image Integrate LlmService Better label for making judgements Look at unauthorized use case typo!

epugh force-pushed the judge_judy branch from 8454014 to 2ebbc29 Compare January 15, 2025 10:47

epugh temporarily deployed to quepid-pr-985 January 15, 2025 10:54 Inactive

epugh added 2 commits January 15, 2025 06:10

Add in unit test for job.

62bb3f0

Update flow to use Fixture

e935723

epugh temporarily deployed to quepid-pr-985 January 15, 2025 11:16 Inactive

Rename user.prompt to user.system_prompt

12802ca

be clearer that there is a system prompt and a seperate "user prompt" that is generated at run time.

epugh temporarily deployed to quepid-pr-985 January 15, 2025 11:29 Inactive

epugh added 2 commits January 15, 2025 09:26

Nicer lable

145ad22

Unused variable

6d98540

epugh temporarily deployed to quepid-pr-985 January 15, 2025 14:26 Inactive

epugh added 2 commits January 15, 2025 10:51

Use bin/jobs to see if that deals with intermittant constantize error

dbbfeb6

Humans suck at updating things. Should be 3 everywhere.

f30e6f7

epugh temporarily deployed to quepid-pr-985 January 15, 2025 15:51 Inactive

epugh temporarily deployed to quepid-pr-985 January 15, 2025 15:52 Inactive

Use same settings as someone online...

7ccc6ec

epugh temporarily deployed to quepid-pr-985 January 15, 2025 16:04 Inactive

Can we have less config?

7c04c7c

epugh temporarily deployed to quepid-pr-985 January 15, 2025 16:14 Inactive

Keep fewer message, get smaller table

db478e9

epugh temporarily deployed to quepid-pr-985 January 15, 2025 16:19 Inactive

epugh merged commit 3e38c65 into main Jan 15, 2025
4 of 5 checks passed

epugh mentioned this pull request Jan 15, 2025

add judge judy o19s/quepid-jupyterlite#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meet Judge Judy, she is your AI powered SME #985

Meet Judge Judy, she is your AI powered SME #985

epugh commented Mar 21, 2024 •

edited

Loading

epugh commented Jan 14, 2025

Meet Judge Judy, she is your AI powered SME #985

Meet Judge Judy, she is your AI powered SME #985

Conversation

epugh commented Mar 21, 2024 • edited Loading

Description

Motivation and Context

epugh commented Jan 14, 2025

epugh commented Mar 21, 2024 •

edited

Loading