Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model-based reinforcement learning for biological sequence design #40

Open
nagataka opened this issue Oct 27, 2021 · 0 comments
Open

Model-based reinforcement learning for biological sequence design #40

nagataka opened this issue Oct 27, 2021 · 0 comments

Comments

@nagataka
Copy link
Owner

Summary

Link

Model-based reinforcement learning for biological sequence design

Author/Institution

Christof Angermueller, David Dohan, David Belanger, Ramya Deshpande, Kevin Murphy, Lucy Colwell

What is this

Ref: Algorithm 1: DyNA PPO

Comparison with previous researches. What are the novelties/good points?

Key points

Our method updates the policy’s parameters using sequences x generated by the current policy πθ(x), but evaluated using a learned surrogate f'(x), instead of the true, but unknown, oracle reward function f(x).

How the author proved effectiveness of the proposal?

Any discussions?

What should I read next?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant