Ensuring Consistent Critic Selection Mechanism from Multiple Response Options in Role-Playing Scenarios for Non-OpenAI Models #985
Closed
coolbeevip
started this conversation in
General
Replies: 1 comment
-
Thanks, @coolbeevip! This is a fantastic idea—extending support for models that don't handle multiple responses at once. I raised an issue here: #996. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In the Role-Playing critic scenario, the user agent receives multiple response options through the n parameter of the OpenAI API, from which the critic selects one to guide the assistant agent. (If my understanding is incorrect, please feel free to point it out!)
However, this mechanism only works well with the OpenAI API; a brief investigation shows that other models do not support the n parameter. Should we ensure consistency in this behavior when using non-OpenAI models through engineering methods? (For example, by looping n times and concatenating the results.)
Beta Was this translation helpful? Give feedback.
All reactions