[Feature] Added OpenEnv environments by ParamThakkar123 · Pull Request #3470 · pytorch/rl

ParamThakkar123 · 2026-02-09T15:56:04Z

Description

Describe your changes in detail.

This PR adds OpenEnv environments to torchrl

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

This reverts commit 1f6f327.

pytorch-bot · 2026-02-09T15:56:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3470

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 15 New Failures, 3 Cancelled Jobs, 4 Unrelated Failures

As of commit f3eb268 with merge base 05212b8 ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Generate documentation / build-docs (3.12, 12.8) / linux-job (gh)
RuntimeError: Command docker exec -t cf601e9481df813f77f1ddee01a5ce0c37ce3f9a1a894083097245e4cd448b68 /exec failed with exit code 2
Lint / python-source-and-configs / linux-job (gh)
RuntimeError: Command docker exec -t 28afdfca7af3746a02179bd38c7c72f4ba3b4b2112691e9d61bd1fd727bd278e /exec failed with exit code 1
Unit-tests on Linux / tests-cpu (3.10) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'
Unit-tests on Linux / tests-cpu (3.12) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-gpu (3.12, 13.0, 1) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-gpu (3.12, 13.0, 2) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-gpu (3.12, 13.0, 3) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-gpu-distributed (3.12, 13.0) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-optdeps (3.12, 13.0) / linux-job (gh)
test/test_objectives.py::TestDDPG::test_ddpg_prioritized_weights
Unit-tests on Linux / tests-stable-gpu (3.12, 13.0, 1) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-stable-gpu (3.12, 13.0, 2) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-stable-gpu (3.12, 13.0, 3) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?
Unit-tests on Linux / tests-stable-gpu-distributed (3.12, 13.0) / linux-job (gh)
AttributeError: 'MjModel' object has no attribute 'actuator_delay'. Did you mean: 'actuator_gear'?

CANCELLED JOBS - The following jobs were cancelled. Please retry:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Libs Tests on Linux / unittests-openenv (3.10, 12.8) / linux-job (gh)
Libs Tests on Linux / unittests-robohive (3.9, 12.8) / linux-job (gh)
##[error]The operation was canceled.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Unit-tests on Windows / unittests-cpu (3.10, windows.4xlarge, cpu) / windows-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Libs Tests on Linux / unittests-brax (3.11, 12.8) / linux-job (gh) (trunk failure)
test/test_libs.py::TestBrax::test_brax_kwargs_preserved_with_seed[fast-device1]
Libs Tests on Linux / unittests-pettingzoo / linux-job (gh) (trunk failure)
test/test_libs.py::TestPettingZoo::test_reset_parallel_env
Libs Tests on Linux / unittests-procgen (3.10, 12.8) / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens

Awesome that you're working on this!
Would it make sense to use chatenv as a base class for consistency? No idea if that makes sense

torchrl/data/tensor_specs.py

torchrl/envs/libs/openenv.py

ParamThakkar123 · 2026-02-10T05:01:45Z

@vmoens I updated my code to use ChatEnv as a base class for OpenEnv

vmoens

See my comments below

vmoens · 2026-02-11T02:18:19Z

torchrl/envs/libs/openenv.py

+        EnvBase.__init__(
+            self,
+            device=device,
+            batch_size=batch_size,
+            allow_done_after_reset=allow_done_after_reset,
+            spec_locked=spec_locked,
+        )


This is basically saying that we have nominally have a ChatEnv but we're not using it.

Either (a) genuinely subclass ChatEnv by calling super().init() and only overriding the specific env-interaction methods (_reset_history, _step_history), or (b) inherit from EnvBase directly and be honest that this is a standalone env that happens to produce ChatHistory-shaped data. Option (a) is clearly preferable for consistency.

If it's not possible, please motivate why and let's move away from ChatEnv entirely.

vmoens · 2026-02-11T02:18:44Z

torchrl/envs/libs/openenv.py

+        if len(self.batch_size):
+            raise ValueError(
+                "OpenEnvWrapper does not support batched environments. "
+                "Use ParallelEnv to create multiple OpenEnv instances."
+            )


ChatEnv enforces batch_size >= (1,) (defaults to (1,) and its spec shapes expect at least one batch dim). OpenEnvWrapper defaults to torch.Size(()) and explicitly rejects non-empty batch sizes.

Use batch_size=(1,) to match the ChatEnv contract, or document why OpenEnv needs () and how that interacts with the existing LLM policy stack. (that will likely break down the line!)

vmoens · 2026-02-11T02:20:27Z

torchrl/envs/libs/openenv.py

Almost every interaction with the underlying OpenEnv client is wrapped in bare try/except. This is very bad practice. In general we make everything we can in torchrl to avoid try/except entirely because it can quickly deteriorate UX and performance.

.github/unittest/linux_libs/scripts_openenv/run-clang-format.py

vmoens · 2026-02-11T02:21:29Z

torchrl/envs/llm/datasets/gsm8k.py

 from tensordict.utils import _zip_strict
 from torch.utils.data import DataLoader
 from torchrl.data import TensorSpec
-from torchrl.envs import StepCounter, Transform


are the import changes necessary? Circular imports I imagine

yeah. Fixed that

I am getting this error while importing it using from torchrl.envs import StepCounter, Transform:

ImportError while importing test module 'E:\rl\test\test_libs.py'. Hint: make sure your test modules/packages have valid Python names. Traceback: E:\Miniconda3\Lib\importlib\__init__.py:88: in import_module return _bootstrap._gcd_import(name[level:], package, level) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ test\test_libs.py:15: in <module> import torchrl.testing.env_helper torchrl\testing\__init__.py:17: in <module> from torchrl.testing.env_creators import ( torchrl\testing\env_creators.py:12: in <module> from torchrl.envs import MultiThreadedEnv, ObservationNorm torchrl\envs\__init__.py:7: in <module> from .batched_envs import ParallelEnv, SerialEnv torchrl\envs\batched_envs.py:3156: in <module> from torchrl.envs.libs.envpool import ( # noqa: F401, E402 torchrl\envs\libs\__init__.py:23: in <module> from .openenv import OpenEnvEnv, OpenEnvWrapper torchrl\envs\libs\openenv.py:14: in <module> from torchrl.envs.llm.chat import ChatEnv torchrl\envs\llm\__init__.py:7: in <module> from .datasets import ( torchrl\envs\llm\datasets\__init__.py:7: in <module> from .gsm8k import GSM8KEnv, GSM8KPrepareQuestion, make_gsm8k_env torchrl\envs\llm\datasets\gsm8k.py:17: in <module> from torchrl.envs import StepCounter, Transform E ImportError: cannot import name 'StepCounter' from partially initialized module 'torchrl.envs' (most likely due to a circular import) (E:\rl\torchrl\envs\__init__.py)

So I changed the imports

test/test_libs.py

vmoens · 2026-02-11T02:22:38Z

torchrl/data/llm/__init__.py

 # This source code is licensed under the MIT license found in the
 # LICENSE file in the root directory of this source tree.

+from .history import add_chat_template, ContentBase, History


why this reorder?

ParamThakkar123 added 17 commits January 20, 2026 00:23

Fixed MultiSyncCollector set_seed and split_trajs issue

1f6f327

Merge branch 'main' of https://github.com/pytorch/rl

e2aaf6b

Revert "Fixed MultiSyncCollector set_seed and split_trajs issue"

40642d5

This reverts commit 1f6f327.

Merge branch 'main' of https://github.com/pytorch/rl

efdc89c

Merge branch 'main' of https://github.com/pytorch/rl

628f44b

Merge branch 'main' of https://github.com/pytorch/rl

a476a77

Merge branch 'main' of https://github.com/pytorch/rl

0f565c5

Merge branch 'main' of https://github.com/pytorch/rl

7fb086b

Merge branch 'main' of https://github.com/pytorch/rl

ff72793

Added Support for index_select in TensorSpec

69001ed

Merge branch 'main' of https://github.com/pytorch/rl

4ab13be

rebase

2e8face

Merge branch 'main' of https://github.com/pytorch/rl

56e1529

Merge branch 'main' of https://github.com/pytorch/rl

ba6a19f

Merge branch 'main' of https://github.com/pytorch/rl

8be545b

Merge branch 'main' of https://github.com/pytorch/rl

54abe29

Add OpenEnv environments

6b099f5

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 9, 2026

github-actions bot added Feature New feature Documentation Improvements or additions to documentation CI Has to do with CI setup (e.g. wheels & builds, tests...) Environments Adds or modifies an environment wrapper Trainers labels Feb 9, 2026

Updates

8f41393

vmoens reviewed Feb 9, 2026

View reviewed changes

torchrl/data/tensor_specs.py Outdated Show resolved Hide resolved

torchrl/envs/libs/openenv.py Outdated Show resolved Hide resolved

ParamThakkar123 added 2 commits February 10, 2026 10:24

Merge branch 'main' of https://github.com/pytorch/rl into add/openenv

4a9a6e7

Using ChatEnv as base class

9b8b119

github-actions bot added the llm/ LLM-related PR, triggers LLM CI tests label Feb 10, 2026

ParamThakkar123 added 2 commits February 10, 2026 21:13

Merge branch 'main' of https://github.com/pytorch/rl into add/openenv

f790843

Added History

cf6940c

vmoens requested changes Feb 11, 2026

View reviewed changes

ParamThakkar123 added 2 commits February 11, 2026 09:36

Merge branch 'main' of https://github.com/pytorch/rl into add/openenv

16ee592

Fixes

a6c0fda

ParamThakkar123 requested a review from vmoens February 11, 2026 05:38

ParamThakkar123 and others added 4 commits February 12, 2026 08:50

Fixed circular import error

9af50f2

Merge branch 'main' of https://github.com/pytorch/rl into add/openenv

4918107

edits

93c67a0

Merge remote-tracking branch 'origin/main' into add/openenv

f3eb268

Conversation

ParamThakkar123 commented Feb 9, 2026

Description

Motivation and Context

Types of changes

Checklist

Uh oh!

pytorch-bot bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3470

❌ 15 New Failures, 3 Cancelled Jobs, 4 Unrelated Failures

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ParamThakkar123 commented Feb 10, 2026

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

vmoens Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

vmoens Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vmoens Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

ParamThakkar123 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

ParamThakkar123 Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vmoens Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Feb 9, 2026 •

edited

Loading