Ensuring Consistent Critic Selection Mechanism from Multiple Response Options in Role-Playing Scenarios for Non-OpenAI Models #985

coolbeevip · 2024-09-24T04:09:37Z

coolbeevip
Sep 24, 2024

In the Role-Playing critic scenario, the user agent receives multiple response options through the n parameter of the OpenAI API, from which the critic selects one to guide the assistant agent. (If my understanding is incorrect, please feel free to point it out!)

However, this mechanism only works well with the OpenAI API; a brief investigation shows that other models do not support the n parameter. Should we ensure consistency in this behavior when using non-OpenAI models through engineering methods? (For example, by looping n times and concatenating the results.)

Wendong-Fan · 2024-09-28T13:41:13Z

Wendong-Fan
Sep 28, 2024
Maintainer

Thanks, @coolbeevip! This is a fantastic idea—extending support for models that don't handle multiple responses at once. I raised an issue here: #996.
If you're interested in contributing, that would be also awesome!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

camel-ai.org

Ensuring Consistent Critic Selection Mechanism from Multiple Response Options in Role-Playing Scenarios for Non-OpenAI Models #985

{{title}}

Replies: 1 comment

{{title}}

Select a reply

camel-ai.org

Ensuring Consistent Critic Selection Mechanism from Multiple Response Options in Role-Playing Scenarios for Non-OpenAI Models #985

coolbeevip Sep 24, 2024

Replies: 1 comment

Wendong-Fan Sep 28, 2024 Maintainer

coolbeevip
Sep 24, 2024

Wendong-Fan
Sep 28, 2024
Maintainer