ALE v0.6 differences in start state #291

JesseFarebro · 2020-01-11T21:05:54Z

There was an issue raised (openai/gym#1777) which describes differences between v0.5.2 and v0.6.0 of the ALE. I traced some of the issues to this commit 7bff96b#diff-d9d868097a7403416e6ef352d95dc4feR178 which changes how StellaEnvironment::softReset works.

The RESET action is called m_num_reset times which leads to a different starting state for the agent. Perhaps this was intended behaviour in StellaEnvironment::reset but has ill-intended consequences in StellaEnvironment::softReset.

For example, here are the starting states for Ms. Pacman in ALE v0.5.2 and v0.6.0. Note if you emulate one RESET action then we get the v0.5.2 starting state.

Ms. Pacman, ALE v0.5.2

Ms. Pacman, ALE v0.6.0

You can see the subtle changes between these two frames (e.g., the colour of ghosts in jail).

I haven't looked into why we repetitively call RESET. Should this be something that is investigated further? It wouldn't seem that this should affect asymptotic performance.

The text was updated successfully, but these errors were encountered:

mgbellemare · 2020-01-13T01:40:39Z

I wouldn't expect this to be a big driver of performance, no. The ALE determinism has always been brittle at best -- going through saveState/loadState should provide a more robust way to reproducibility. Thanks for flagging this!

JesseFarebro mentioned this issue Jan 11, 2020

Behaviors of Atari envs have changed by atari-py>=0.2 openai/gym#1777

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ALE v0.6 differences in start state #291

ALE v0.6 differences in start state #291

JesseFarebro commented Jan 11, 2020

mgbellemare commented Jan 13, 2020

ALE v0.6 differences in start state #291

ALE v0.6 differences in start state #291

Comments

JesseFarebro commented Jan 11, 2020

Ms. Pacman, ALE v0.5.2

Ms. Pacman, ALE v0.6.0

mgbellemare commented Jan 13, 2020