DAWN-ICL: Strategic Planning of Problem-solving Trajectories for Zero-Shot In-Context Learning

😀 Overview

We are the first to formalize ZS-ICL as a planning problem, which is closer to real-world scenarios.
We propose a novel demonstration-aware MCTS for ZS-ICL to achieve a more effective and efficient search for the problem-solving trajectories.
Extensive experiments demonstrate the effectiveness of our approach on in-domain and cross-domain scenarios, and it even outperforms ICL using human-annotated demonstrations.

Download models from huggingface, open the src/utils.py file and update the directory paths in lines 34-41.

model: large language models
method: method of zero-shot in-context learning: ZS, FS, SelfICL, DAIL, Search
dataset: evaluation dataset: bbh, bbh-mini, mmlu
shot_num: the shot number of in-context learning
select_strategy: the demonstration selection strategy of in-context learning
diverse_candidate: the number of retrieved candidates: $k_d$
search_strategy: the search strategy of zero-shot in-context learning: Greedy, MC, Beam_Search, MCTS
beam_search_expand_num: the expansion number of the beam search
select_num: the number of node retained of the beam search
expansion_num: the expansion number of MCTS: $k_a$
iterative_num: the iteration number of MCTS
uct: the type of UCT in the selection phase: random, uct, demo_uct
w_exp: $w_a$
c_exp: $w_Q$
use_cache: whether to use the cache strategy
epsilon: $\epsilon$
aggregation: whether to use the aggregation strategy
calibration: whether to use the calibration strategy