[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong) #5067

mianmianboom · 2026-01-27T12:28:02Z

What does this PR do?

This PR adds support for selecting between Ray and openYuanrong (ray_adapter) as the distributed backend via the DISTRIBUTED_BACKEND environment variable.

Checklist Before Starting

Search for similar PRs. Query: distributed backend selection ray adapter
Format the PR title as [{modules}] {type}: {description} (This will be checked by the CI)
- {modules} include fsdp, megatron, veomni, sglang, vllm, rollout, trainer, ci, training_utils, recipe, hardware, deployment, ray, worker, single_controller, misc, perf, model, algo, env, tool, ckpt, doc, data, cfg, reward
- If this PR involves multiple modules, separate them with , like [megatron, fsdp, doc]
- {type} is in feat, fix, refactor, chore, test
- If this PR breaks any API (CLI arguments, config, function signature, etc.), add [BREAKING] to the beginning of the title.
- Example: [BREAKING][fsdp, megatron] feat: dynamic batching

Test

Test commands:

# Test with default Ray backend
pytest tests/single_controller/test_ray_utils_on_cpu.py -v

# Test with openYuanrong backend
DISTRIBUTED_BACKEND=yr pytest tests/single_controller/test_ray_utils_on_cpu.py -v

API and Usage Example

Set the DISTRIBUTED_BACKEND environment variable to choose the backend:

DISTRIBUTED_BACKEND=ray or unset: Use default Ray backend (default)
DISTRIBUTED_BACKEND=yr or DISTRIBUTED_BACKEND=yuanrong: Use openYuanrong backend

# Use default Ray backend
bash examples/grpo_trainer/run_qwen2_5_7b_grpo_npu.sh

# Use openYuanrong backend (both 'yr' and 'yuanrong' are supported)
DISTRIBUTED_BACKEND=yr bash examples/grpo_trainer/run_qwen2_5_7b_grpo_npu.sh
# or
DISTRIBUTED_BACKEND=yuanrong bash examples/grpo_trainer/run_qwen2_5_7b_grpo_npu.sh

Design & Code Changes

Design Overview: To support the YuanRong distributed computing platform alongside Ray, we implemented a non-intrusive backend injection mechanism. By utilizing Python's sys.modules interception, we can dynamically redirect import ray to the YuanRong adapter ( ray_adapter ) at runtime based on an environment variable. This approach allows the existing codebase to remain largely unchanged, as it continues to import and use the ray namespace, while the underlying implementation is transparently switched to YuanRong when configured.

Specific Changes:

New file : verl/utils/distributed_backend.py - Core backend selection module. It checks the DISTRIBUTED_BACKEND environment variable and injects either ray_adapter or the standard ray module into sys.modules['ray'] .
Modified files :
- verl/init.py : Added import verl.utils.distributed_backend to ensure the backend injection logic is executed immediately upon package initialization.
- Test files : Updated import order in 17 test files to ensure import ray occurs after verl imports. This guarantees that the injection logic in verl/init.py runs before ray is loaded.

Checklist Before Submitting

Read the Contribute Guide.
Apply pre-commit checks: pre-commit install && pre-commit run --all-files --show-diff-on-failure --color=always
Add / Update the documentation.
Add unit or end-to-end test(s) to the CI workflow to cover all the code. If not feasible, explain why: ...
Once your PR is ready for CI, send a message in the ci-request channel in the verl Slack workspace. (If not accessible, please try the Feishu group (飞书群).)
If your PR is related to the recipe submodule, please also update the reference to the submodule commit via git submodule update --remote or cd recipe && git pull origin main.

CLAassistant · 2026-01-27T12:28:10Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Test User seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

feat: integrate yuanrong backend and fix import order in tests

d2ee41f

mianmianboom requested review from wuxibin89 and zw0610 as code owners January 27, 2026 12:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong) #5067

[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong) #5067

Uh oh!

mianmianboom commented Jan 27, 2026

Uh oh!

CLAassistant commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong) #5067

Are you sure you want to change the base?

[ray, single_controller, trainer, utils] feat: Add distributed backend selection support (Ray/openYuanrong) #5067

Uh oh!

Conversation

mianmianboom commented Jan 27, 2026

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

CLAassistant commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants