[Question] Stochastic of environmental dynamics in Gym control tasks #229

return-sleep · 2023-11-28T02:50:45Z

Question

When I download the offline dataset and want to reproduce the trajectory with the same action sequences and initiate states, the subsequent state sequences (obs &reward) gradually offset the original offline trajectory over time.

test1.mp4

The following code is used.

dataset = env.get_dataset()
states = dataset['state'][0:H] # we sample the first trajectory
actions = dataset['state'][0:H]
init_state = states[0]
env.set_state[init_state]

for t in range(H):
      obs, reward, done,info = env.step(actions[t])

I hope for your reply!

The text was updated successfully, but these errors were encountered:

EdanToledo · 2024-08-29T15:26:20Z

I've also noticed this! Any update?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Stochastic of environmental dynamics in Gym control tasks #229

[Question] Stochastic of environmental dynamics in Gym control tasks #229

return-sleep commented Nov 28, 2023

EdanToledo commented Aug 29, 2024

[Question] Stochastic of environmental dynamics in Gym control tasks #229

[Question] Stochastic of environmental dynamics in Gym control tasks #229

Comments

return-sleep commented Nov 28, 2023

Question

EdanToledo commented Aug 29, 2024