Fix AssistantAgent Tool Call Behavior #4602

husseinmozannar · 2024-12-07T09:54:21Z

Resolves #4514

Limit 1 tool call per each on_messages invocation, by default return the tool call result as response.
Introduce reflect_on_tool_use to optionally reformat the tool call result using a model inference.
Beef up documentation.

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py

ekzhu

I think it warrants unit tests for this.

We can split the disable of parallel tool calling for handoff in a separate PR if the requirements for that is still under-specified.

husseinmozannar · 2024-12-08T01:49:47Z

I added unit tests.

I think now this PR is just for fixing the repeated tool calls, I will let the handoffs for another PR. I think this is crucial to merge first before the other stuff.

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py

…/_assistant_agent.py Co-authored-by: Eric Zhu <[email protected]>

python/uv.lock

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py

husseinmozannar · 2024-12-08T07:31:05Z

PR on hold.

Ideally AssistantAgent only does 1 LLM call with tools (if any). Return type of message should depend on tool call and allow other agents to easily convert that message to a string. Other agents and teams should be updated after AssistantAgent is updated

ekzhu · 2024-12-08T18:29:20Z

I have updated the code, so we limit just 1 tool call iteration always. I have updated the examples too. @husseinmozannar @victordibia please verify the behavior is working with other scenarios.

husseinmozannar · 2024-12-09T02:00:19Z

Behavior looks fine in a few sample teams I tried (multiple AssistantAgents in a round robin with varying access to tools), and tried the video surfer.

* 1 tool call iteration default * handoff first * return_only_response * add and remove tools * print out tool calls * pass checks * fix issues * add test * add unit tests * remove extra print * Update python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Co-authored-by: Eric Zhu <[email protected]> * documentation and none max_tools_calls * Always limit # tool call to 1 * Update notebooks for the changing behavior of assistant agent. * Merge branch 'main' into assistant_Agent_tools * add reflect_on_tool_use parameter to format the tool call result * wip * wip * fix pyright * Add unit tests * Merge remote-tracking branch 'origin/main' into assistant_Agent_tools * Update with custom formatting of tool call summary * format * Merge branch 'main' into assistant_Agent_tools

jspv · 2024-12-18T03:27:09Z

Hi, @husseinmozannar, I'm trying to understand the full intent with this change as all tools results are now returned in TextMessage (vs. only in ToolCallResultMessage) which makes it difficult to differentiate tool call results from LLM responses.

"Limit 1 tool call per each on_messages invocation, by default return the tool call result as response."

As per "by default", was this intended to be optional behavior? It was good to be able to easily differentiate tool call responses to the client from the client's response to the caller. Thanks.

husseinmozannar · 2024-12-18T03:58:03Z

Hey!

There was a problem with the previous version of AssistantAgent.

GPT-4 when it decides to call a tool, only returns the tool call and no other response.

In previous version of AssistantAgent, it called the LLM as many as time as needed so that the final LLM response was not a tool call i.e. a string.
Now we fix that issue as follows:

Referencing the API doc

If the model returns no tool call, then the response is immediately returned as a :class:~autogen_agentchat.messages.TextMessage in :attr:~autogen_agentchat.base.Response.chat_message.
When the model returns tool calls, they will be executed right away:

When reflect_on_tool_use is False (default), the tool call results are returned as a :class:~autogen_agentchat.messages.TextMessage in :attr:~autogen_agentchat.base.Response.chat_message. tool_call_summary_format can be used to customize the tool call summary.
We still yield the toolcall and toolcallresult prior to the final textmessage
When reflect_on_tool_use is True, the another model inference is made using the tool calls and results, and the text response is returned as a :class:~autogen_agentchat.messages.TextMessage in :attr:~autogen_agentchat.base.Response.chat_message.
We still yield the toolcall and toolcallresult prior to the final textmessage

Moreover the innermessages of the final textmessage will have the toolcall +results.

Does this make more sense now?

jspv · 2024-12-18T12:27:23Z

Thanks, I understand the intent, the challenge for me is that both tool call results and normal responses are now returned as :class:~autogen_agentchat.messages.TextMessage with no differentiating characteristics, making it difficult to determine the difference between a textmessage from the agent and the additional tool call response (or summary) from the tool without tracking additional state (e.g. did I just receive a toolcallresult prior to this message, if so, then the next TextMessage is the tool response, not from the LLM).

I think It would be better to not mix the types. There is already toolcallresultmessage which was was previously the unambiguous way to identify tool results; now tool results are coming in two forms (toolcallresultmessage and another copy in a different format as textmessage) and it is now requires extra logic to try and determine if the textmessage is the tool results or from the LLM.

Can we create a clear message types for the new messages (e.g. toolcallresultsummarymessage) or some other disambiguating way in the message the determine the type of message?

ekzhu · 2024-12-18T16:11:27Z

@husseinmozannar I think it's a valid point and we should create a new type of chat message for this.

It can be important for orchestration or termination condition. When inner messages are not emitted, we need to rely on typing to figure out what happened.

jspv · 2024-12-18T16:11:32Z

@husseinmozannar, I'm think I'm finding more side effects of this change. Using a simple RoundRobin with two agents:

writer, writes papers and has access to a web search tool
editor, reviews paper and provides suggestions and will terminate the team when paper is approved.

With the tool results now being returned as a TextMessage, I'm seeing the speaker move from writer->editor immediately after receiving the TextMessage tool response; so rather than the writer receiving the tool reply and using it to write the paper, the editor takes over prematurely.

Before:

task->writer runs tool -> writer writes paper -> editor provides feedback -> writer <-> editor ... -> editor approves

Now:
task-> writer runs tool -> editor provides feedback on tool results -> writer <-> editor ...

the writer never gets to act on the tool results.

ekzhu · 2024-12-18T16:27:18Z

We observed this effect as well. One way to fix this is by setting the reflect_on_tool_use=True when you create the writer

Alternatively, set allow_repeated_speaker=True in selector group chat. What you saw is because the selector by default moves on from the same speaker

husseinmozannar added 5 commits December 7, 2024 01:21

1 tool call iteration default

6113c5c

handoff first

a479926

return_only_response

6179293

add and remove tools

54a388b

print out tool calls

2b25291

husseinmozannar requested review from ekzhu and victordibia December 7, 2024 09:54

pass checks

3ffb7c5

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

ekzhu requested changes Dec 8, 2024

View reviewed changes

husseinmozannar added 3 commits December 7, 2024 17:13

fix issues

29a8866

add test

bfbbac5

add unit tests

dcd5069

husseinmozannar requested a review from ekzhu December 8, 2024 01:49

remove extra print

e85f3b1

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Outdated Show resolved Hide resolved

Update python/packages/autogen-agentchat/src/autogen_agentchat/agents…

09df6c6

…/_assistant_agent.py Co-authored-by: Eric Zhu <[email protected]>

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/uv.lock Outdated Show resolved Hide resolved

ekzhu reviewed Dec 8, 2024

View reviewed changes

python/packages/autogen-agentchat/src/autogen_agentchat/agents/_assistant_agent.py Show resolved Hide resolved

documentation and none max_tools_calls

a47fb51

ekzhu added 2 commits December 8, 2024 00:28

Always limit # tool call to 1

82e5ad2

Update notebooks for the changing behavior of assistant agent.

022374c

ekzhu added 10 commits December 9, 2024 10:11

Merge branch 'main' into assistant_Agent_tools

c4edf74

add reflect_on_tool_use parameter to format the tool call result

1a59071

wip

0fe3f62

wip

33d90ce

fix pyright

9cc9a4e

Add unit tests

8918bca

Merge remote-tracking branch 'origin/main' into assistant_Agent_tools

9f423b7

Update with custom formatting of tool call summary

f6ea559

format

da621e7

Merge branch 'main' into assistant_Agent_tools

f5ca240

ekzhu approved these changes Dec 10, 2024

View reviewed changes

ekzhu merged commit 871a83f into main Dec 10, 2024
45 checks passed

ekzhu deleted the assistant_Agent_tools branch December 10, 2024 03:03

ekzhu mentioned this pull request Dec 18, 2024

ToolCallResultSummaryMessage message type in AgentChat #4754

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix AssistantAgent Tool Call Behavior #4602

Fix AssistantAgent Tool Call Behavior #4602

husseinmozannar commented Dec 7, 2024 •

edited by ekzhu

Loading

ekzhu left a comment •

edited

Loading

husseinmozannar commented Dec 8, 2024

husseinmozannar commented Dec 8, 2024

ekzhu commented Dec 8, 2024

husseinmozannar commented Dec 9, 2024

jspv commented Dec 18, 2024

husseinmozannar commented Dec 18, 2024

jspv commented Dec 18, 2024

ekzhu commented Dec 18, 2024 •

edited

Loading

jspv commented Dec 18, 2024

ekzhu commented Dec 18, 2024

Fix AssistantAgent Tool Call Behavior #4602

Fix AssistantAgent Tool Call Behavior #4602

Conversation

husseinmozannar commented Dec 7, 2024 • edited by ekzhu Loading

ekzhu left a comment • edited Loading

Choose a reason for hiding this comment

husseinmozannar commented Dec 8, 2024

husseinmozannar commented Dec 8, 2024

ekzhu commented Dec 8, 2024

husseinmozannar commented Dec 9, 2024

jspv commented Dec 18, 2024

husseinmozannar commented Dec 18, 2024

jspv commented Dec 18, 2024

ekzhu commented Dec 18, 2024 • edited Loading

jspv commented Dec 18, 2024

ekzhu commented Dec 18, 2024

husseinmozannar commented Dec 7, 2024 •

edited by ekzhu

Loading

ekzhu left a comment •

edited

Loading

ekzhu commented Dec 18, 2024 •

edited

Loading