-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluation data of rule_qa for GPT4, GPT3.5, and Claude #36
Comments
The answers are available here: https://huggingface.co/datasets/nguha/legalbench/viewer/rule_qa/test. Is this what you're looking for? |
Thanks for your response. I meant the evaluation of rule-based application: "Rule-application tasks were evaluated manually by a law-trained individual, who analyzed LLM responses for both correctness and analysis" |
Ah sorry I misunderstood.
|
Thanks for the answer. Do you plan to release that human judgement for the evaluation? |
Following up on this... |
Hi,
Thanks for the cool resource. According to the publication, " rule_qa was also manually evaluated by a law-trained individual". Do you plan to release the annotations for this evaluation? Thanks
The text was updated successfully, but these errors were encountered: