Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi output policies #502

Closed
H-Park opened this issue Oct 9, 2019 · 4 comments
Closed

Multi output policies #502

H-Park opened this issue Oct 9, 2019 · 4 comments
Labels
enhancement New feature or request

Comments

@H-Park
Copy link

H-Park commented Oct 9, 2019

Example: in pysc2, some actions require positional arguments, such as move commands.

This might be too environment specific, but is there a want to allow this?

@araffin araffin added the enhancement New feature or request label Oct 9, 2019
@araffin
Copy link
Collaborator

araffin commented Oct 9, 2019

Hello,

Could you be more precise?
You mean having different action spaces at the same time?

@Miffyli
Copy link
Collaborator

Miffyli commented Oct 9, 2019

@araffin
I understood it that way, yes. I.e. Support for spaces.Tuple/spaces.Dict but for actions rather than observations, as discussed in #133 .

The major modification is same for both: Currently everything is more or less bundled up into arrays, which makes overall processing of things quite a bit tidier. A support for tuple/dict spaces would require storing variable length arrays/tuples somewhere along the line as well as creating dynamic number of placeholders and whatnot.

However, if the next backend is going to be non-graph one (i.e. TF2 eager mode and/or PyTorch), this should be easier to do with those frameworks. I hate to move more and more stuff to "next-backend" project, but this sounds much more reasonable enhancement alongside new backend, rather than implementing it in current graph-version and then translating it to the new backend.

@H-Park
Copy link
Author

H-Park commented Oct 9, 2019

Is there a timeline in place for this transition?

@Miffyli
Copy link
Collaborator

Miffyli commented Oct 9, 2019

No specific timeline other than "one day™". The first step would be to support TF2.0 via the backwards-compatible API in 1.14 (see #366), but that would be mostly renaming function calls.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants