feature(tj): optimize AlphaZero with batch inference support #448

tAnGjIa520 · 2025-12-01T16:03:09Z

This PR introduces a batch-optimized AlphaZero MCTS implementation in C++, achieving 2x
speedup compared to the standard sequential version.

Batch MCTS Inference: The core improvement is the get_next_actions_batch() function (line
207-415 in mcts_alphazero.cpp), which processes multiple game states simultaneously. Instead of running MCTS
simulations sequentially for each environment, we now:

Batch Root Expansion: Initialize multiple root nodes and expand them with a single batched neural network call
(policy_value_func_batch), reducing GPU overhead
Parallel Simulation Phase: Run simulations for all environments simultaneously (line 280-369), collecting leaf
nodes that need expansion
Batch Leaf Expansion: Group all leaf nodes from unfinished games and perform batched inference via
_batch_expand_leaf_nodes() (line 162-205), minimizing individual network calls
Legal Action Caching: Cache legal actions at the environment level to avoid repeated Python calls during child
selection, significantly reducing Python-C++ interface overhead

本 PR 引入了批量优化的 AlphaZero MCTS C++ 实现，相比标准顺序版本实现了 2 倍加速。

批量 MCTS 推理: 核心改进是 get_next_actions_batch() 函数（mcts_alphazero.cpp 第 207-415
行），可同时处理多个游戏状态。我们不再为每个环境顺序运行 MCTS 模拟，而是：

批量根节点扩展: 初始化多个根节点并通过单次批量神经网络调用（policy_value_func_batch）进行扩展，减少 GPU 开销
并行模拟阶段: 同时为所有环境运行模拟（第 280-369 行），收集需要扩展的叶节点
批量叶节点扩展: 将所有未完成游戏的叶节点分组，通过 _batch_expand_leaf_nodes()（第 162-205
行）执行批量推理，最小化单独的网络调用
合法动作缓存: 在环境层级缓存合法动作，避免在子节点选择期间重复的 Python 调用，显著减少 Python-C++ 接口开销

lzero/policy/alphazero.py

lzero/mcts/ctree/ctree_alphazero/mcts_alphazero.cpp

Document C++-Python env interaction as bottleneck and suggest C++ implementation.

- Convert all Chinese comments to English - Add parameter documentation for batch MCTS calls - Follow Google Style documentation standards

lzero/mcts/ctree/ctree_alphazero/mcts_alphazero.cpp

- Document C++-Python interaction bottleneck in current architecture - Note env_cpp_ification as future optimization direction - Mark get_next_action as non-batch version with reference to batch alternative

lzero/mcts/ctree/ctree_alphazero/CMakeLists.txt

tAnGjIa520 added 2 commits December 2, 2025 00:00

111

09ac7c8

policy update

f615717

puyuan1996 added the enhancement New feature or request label Dec 4, 2025

puyuan1996 requested changes Dec 4, 2025

View reviewed changes

tAnGjIa520 added 2 commits December 6, 2025 01:20

docs(mcts): add performance optimization notes

398aa49

Document C++-Python env interaction as bottleneck and suggest C++ implementation.

docs(policy): convert comments to English with detailed docs

5c9b687

- Convert all Chinese comments to English - Add parameter documentation for batch MCTS calls - Follow Google Style documentation standards

puyuan1996 changed the title ~~feature(tj): add batch alpha-zero~~ feature(tj): optimize AlphaZero with batch inference support Dec 6, 2025

puyuan1996 reviewed Dec 8, 2025

View reviewed changes

lzero/mcts/ctree/ctree_alphazero/mcts_alphazero.cpp Show resolved Hide resolved

puyuan1996 mentioned this pull request Dec 8, 2025

WIP: feature(pu): add init version of alphazero batch #444

Closed

docs: Add performance bottleneck comments to MCTS AlphaZero module

34d3be7

- Document C++-Python interaction bottleneck in current architecture - Note env_cpp_ification as future optimization direction - Mark get_next_action as non-batch version with reference to batch alternative

puyuan1996 reviewed Dec 10, 2025

View reviewed changes

lzero/mcts/ctree/ctree_alphazero/CMakeLists.txt Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature(tj): optimize AlphaZero with batch inference support #448

feature(tj): optimize AlphaZero with batch inference support #448

Uh oh!

tAnGjIa520 commented Dec 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feature(tj): optimize AlphaZero with batch inference support #448

Are you sure you want to change the base?

feature(tj): optimize AlphaZero with batch inference support #448

Uh oh!

Conversation

tAnGjIa520 commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tAnGjIa520 commented Dec 1, 2025 •

edited

Loading