Self-Iterative Agent System for Complex Problem Solving

Link to the CN version: Here

This was also my solution for the Alibaba Mathematical Competition (AI Challenge). I was surprised that it was so simple yet achieved such good results compared to others who used complex multi-agent systems.

Let me share my general thoughts:

I let multiple LLMs engage in multi-round "self-questioning" to seek the optimal solution, borrowing from the idea of debate, iteratively refining the current problem-solving process, and finally selecting the optimal solution to give a conclusion.

This system contains two sets of models: the main model/student model and the evaluation model/teacher model. The overall process is roughly as follows:

The main model (such as GPT-4 Turbo or Claude-3 Opus) answers math problems step by step according to detailed instructions:
- List relevant knowledge points
- Write down initial thought process
- Expand on the specific calculation process (without skipping steps)
- Give the final result
The main model's answer is submitted to the evaluation model for review. The evaluation model also proceeds step by step:
- Overview to check for obvious loopholes and give initial feedback
- Carefully check the calculation process to find possible errors
- Evaluate whether the reasoning logic is rigorous
- Summarize all feedback
The evaluation model's feedback is returned to the main model for reference to revise the answer.
Repeat the above process iteratively several times (such as 5 times) to continuously improve the answer.
Iterate using two sets of models separately to finally obtain two revised answers.
Use GPT-4 Turbo to score and compare the two answers in terms of calculation, logic, clarity, etc., and provide the best answer.

Of course, this approach also has much room for improvement. I also exchanged ideas with members of another two teams and made a small summary and reflection:

Regardless of the form of the Agent solution, it should be ensured that the information communication and reasoning between Sub-agents cannot go wrong. To ensure this, I believe a global "supervisor" is needed to monitor the Agent process in real-time and intervene promptly when problems arise to prevent error amplification due to a single node error.
If the system extracts related concept knowledge points from a preset knowledge base using RAG in the concept sorting stage, and uses external tools for calculation, the results will certainly be better than a solution using only LLMs.
In complex tasks, Agent or Agent-like architectures appear more important and effective.
Currently, there are not many open-source or closed-source models with outstanding abilities in Math and Reasoning, and the models' capabilities in these two aspects still need to be strengthened.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
README.md		README.md
README_CN.md		README_CN.md
eval_instruction.txt		eval_instruction.txt
final_solution_selection_instruction.txt		final_solution_selection_instruction.txt
main_instruction.txt		main_instruction.txt
prompt_design_usage.md		prompt_design_usage.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Iterative Agent System for Complex Problem Solving

About

Releases

Packages

richards199999/Self-Iterative-Agent-System-for-Complex-Problem-Solving

Folders and files

Latest commit

History

Repository files navigation

Self-Iterative Agent System for Complex Problem Solving

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages