refactor(lyd): refactor dt_policy in new pipeline and add img input support #693

AltmanD · 2023-07-25T09:31:44Z

Refactor DT to new pipeline.
Add img input support for atari.

dizoo/box2d/lunarlander/config/lunarlander_dt_config.py

PaParaZz1 · 2023-07-25T12:36:46Z

dizoo/box2d/lunarlander/config/lunarlander_decision_transformer.py

@@ -27,7 +27,7 @@
        embed_dim=128,


remove old decision transformer config

dizoo/d4rl/config/__init__.py

ding/policy/decision_transformer.py

ding/envs/env_wrappers/env_wrappers.py

ding/example/dt_atari.py

PaParaZz1 · 2023-07-25T14:48:21Z

ding/policy/dt.py

+            self._optimizer, lambda steps: min((steps + 1) / warmup_steps, 1)
+        )
+
+        self.max_env_score = -1.0


remove this

ding/policy/dt.py

ding/framework/middleware/functional/data_processor.py

ding/model/template/decision_transformer.py

ding/model/template/dt.py

ding/torch_utils/network/transformer.py

dizoo/atari/config/pong_dt_config.py

ding/policy/dt.py

PaParaZz1 · 2023-07-31T12:29:19Z

ding/policy/dt.py

+            self.states = torch.zeros((self.eval_batch_size, self.max_eval_ep_len,) + tuple(self.state_dim), dtype=torch.float32, device=self.device)
+            self.running_rtg = [self.rtg_target for _ in range(self.eval_batch_size)]
+        else:
+            self.running_rtg = [self.rtg_target / self.rtg_scale for _ in range(self.eval_batch_size)]


remove rtg_scale argument

ding/policy/dt.py

ding/utils/data/dataset.py

PaParaZz1 · 2023-07-31T12:36:42Z

ding/utils/data/dataset.py

+            return timesteps, states, actions, rtgs, traj_mask
+
+
+class FixedReplayBuffer(object):


can we merge this class into the above class?

ding/utils/data/dataset.py

AltmanD added 5 commits July 14, 2023 10:18

Revise old version dt pipline

12c9dd5

Add new dt pipline

887b587

Add DT in new pipeline

737b7b6

Add img input to support atari

fce01fc

Fix according to comment

8b330a6

AltmanD marked this pull request as ready for review July 25, 2023 09:31

AltmanD marked this pull request as draft July 25, 2023 09:35

AltmanD marked this pull request as ready for review July 25, 2023 09:36

AltmanD requested a review from PaParaZz1 July 25, 2023 11:14

AltmanD added the refactor refactor module or component label Jul 25, 2023

PaParaZz1 requested changes Jul 25, 2023

View reviewed changes

AltmanD and others added 3 commits July 31, 2023 13:27

Fix dt config files

1ccb2ec

Merge branch 'main' into dev-dt

1e7b774

Fix abs path

062520a

PaParaZz1 requested changes Jul 31, 2023

View reviewed changes

AltmanD added 10 commits August 7, 2023 14:51

Accelerate DT train iter by replacing dataloader

edce163

Simplify dt model and policy and config

0fe0c02

reformat

07601f9

Reformat

9bd641f

Change data fatcher func to class

126c1e3

Add threading shift data to gpu

1835242

Change action sample func

3406ab1

Add configure optimizers

49bfb3c

Add multi gpu support

bb15845

Add dt policy test serial

9712a29

PaParaZz1 mentioned this pull request Aug 9, 2023

Roadmap for DI-engine #548

Open

AltmanD and others added 3 commits August 10, 2023 13:40

Fix multi gpu support and data fetcher

b898184

Reformat

2d7c406

Merge branch 'main' into dev-dt

27dfc80

AltmanD added 6 commits August 14, 2023 11:48

Reformat

550aa58

Reformat

1f8db9c

Reformat

477640f

Reformat

22fc34f

Reformat

4f349b4

Reformat

0af7684

PaParaZz1 approved these changes Aug 19, 2023

View reviewed changes

PaParaZz1 merged commit 3a73dd4 into opendilab:main Aug 19, 2023
9 of 18 checks passed

AltmanD deleted the dev-dt branch August 23, 2023 07:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(lyd): refactor dt_policy in new pipeline and add img input support #693

refactor(lyd): refactor dt_policy in new pipeline and add img input support #693

AltmanD commented Jul 25, 2023

PaParaZz1 Jul 25, 2023

PaParaZz1 Jul 25, 2023

PaParaZz1 Jul 31, 2023

PaParaZz1 Jul 31, 2023

		return timesteps, states, actions, rtgs, traj_mask


		class FixedReplayBuffer(object):

refactor(lyd): refactor dt_policy in new pipeline and add img input support #693

refactor(lyd): refactor dt_policy in new pipeline and add img input support #693

Conversation

AltmanD commented Jul 25, 2023

PaParaZz1 Jul 25, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 25, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 31, 2023

Choose a reason for hiding this comment

PaParaZz1 Jul 31, 2023

Choose a reason for hiding this comment