Skip to content

How to Return Intermediate Progress of Nested Reasoning? #10333

Answered by westey-m
ymuichiro asked this question in Q&A

You must be logged in to vote

Narrowing down the set of available functions for function calling using RAG might help to increase accuracy. See this sample on how to achieve that:
https://github.com/microsoft/semantic-kernel/blob/main/dotnet/samples/Concepts/Optimization/PluginSelectionWithFilters.cs#L104

This sounds like a complex system though, so multiple strategies may have to be considered to speed things up, e.g:

  1. Using smaller, and therefore also faster, models for tasks that don't require the full power of the larger models, to reduce latency, e.g. GPT-4o-mini instead of GPT-4o
  2. Use parallel function invocation / calls, see https://learn.microsoft.com/en-us/semantic-kernel/concepts/ai-services/chat-completion/f…

Replies: 1 comment 2 replies

You must be logged in to vote
2 replies
@westey-m

Answer selected by sophialagerkranspandey
@ymuichiro

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
python Pull requests for the Python Semantic Kernel
3 participants