We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
例行检查
功能描述 gemini-2.0-flash-thinking-exp 模型非流调用时会返回两个 text ,一个是思考过程,一个是回复内容,当前项目在非流调用的时候会丢失第二个 text:
只有英文的思考过程:
实际包含两个text:
可以参考:Calcium-Ion/new-api@9a54b34
The text was updated successfully, but these errors were encountered:
这个不好弄。因为流式传输的接口,没有将思考和回答区分的很明确 流式传输不是每一个chunk的parts都有两个部分 只有在思考和回答的交界处“有可能”会出现两个部分 最好的做法是thinking模型不支持流式,再按你说的区分
Sorry, something went wrong.
你理解错我的意思了,我不是要区分流式里的思考和回复,是要防止非流的回复被吞了
哦我没看到你强调非流了 非流确实有清晰的两个部分 流很难处理
@songquanpeng @mxdlzg
抱歉打扰了,实测 gemini 类型的流式在思考和回复的交界处有两个 text ,经查当前的处理会导致“交界处”丢失一个 text 字段(回复开始的第一个 text)从而表现为“回答吞字”
oneapi 流式调用,“回答吞字”:
预期的流式调用:
Successfully merging a pull request may close this issue.
例行检查
功能描述
gemini-2.0-flash-thinking-exp 模型非流调用时会返回两个 text ,一个是思考过程,一个是回复内容,当前项目在非流调用的时候会丢失第二个 text:
只有英文的思考过程:
实际包含两个text:
可以参考:Calcium-Ion/new-api@9a54b34
The text was updated successfully, but these errors were encountered: