You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm a PhD student and uses mujoco_mpc for my research. When experimenting with the project, I found that there is a potential bug in the robust planner implementation. I found this out by printing the times parameter for the sampling planner, and found that they stay all 0 all the time.
I traced out the bug to be in robust_planner.cc,
The problem is that in sampling planner, we call UpdateNominalPolicy(horizon) before optimizing the policy, but this is not done in the robust planner.
One solution to this is to add the function UpdateNominalPolicy(horizon) in the RankedPlanner, and call
delegate_->UpdateNominalPolicy(horizon);
before optimizing the policy candidates.
Thanks and please let me know what you think.
The text was updated successfully, but these errors were encountered:
Thanks for diagnosing this issue. I think you are right!
RobustPlanner is meant to be agnostic to which delegate it's using, though, and UpdateNominalPolicy is not part of the RankedPlanner interface. So I think that the fix would be to move this call:
this->UpdateNominalPolicy(horizon);
from SamplingPlanner::OptimizePolicy to SamplingPlanner::OptimizePolicyCandidates.
Do you want to send us a pull request, or would you rather we fixed it?
Sorry for the late reply. Due to my schedule I might not be able to fix it and debug. Could you fix it instead, since you might be more familiar with the project code? Thanks!
Hi,
I'm a PhD student and uses mujoco_mpc for my research. When experimenting with the project, I found that there is a potential bug in the robust planner implementation. I found this out by printing the times parameter for the sampling planner, and found that they stay all 0 all the time.
I traced out the bug to be in robust_planner.cc,
The problem is that in sampling planner, we call
UpdateNominalPolicy(horizon)
before optimizing the policy, but this is not done in the robust planner.One solution to this is to add the function
UpdateNominalPolicy(horizon)
in the RankedPlanner, and callbefore optimizing the policy candidates.
Thanks and please let me know what you think.
The text was updated successfully, but these errors were encountered: