How to deal with the effect of episode_length_s and roll_out on reward functions and robot behaviour? #2017

celestialdr4g0n · 2025-03-05T06:55:21Z

celestialdr4g0n
Mar 5, 2025

In my robot reaching task using the skrl library, I’ve observed that modifying the episode_length_s (in *_cfg.py) or the roll_out (in *.yaml) drastically affects the impact of my reward function. Specifically, when using a negative reward for target orientation misalignment:
• Longer episodes or higher roll_out values: The robot overly focuses on adjusting the end effector’s orientation, which causes it to fail in reaching the object.
• Shorter episodes or lower roll_out values: The robot reaches the target but exhibits shaking at the end.

It’s also very challenging to continually recalibrate the reward scales with every adjustment of these hyperparameters. What strategies or modifications can I apply to balance these effects, ensuring that the robot both reaches the target reliably and maintains a stable orientation?

RandomOakForest · 2025-03-08T14:34:38Z

RandomOakForest
Mar 8, 2025
Maintainer

Thank you for posting this. The team will engage soon. @Toni-SM, any thoughts? Thanks.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to deal with the effect of episode_length_s and roll_out on reward functions and robot behaviour? #2017

{{title}}

Replies: 1 comment

{{title}}

Select a reply

How to deal with the effect of episode_length_s and roll_out on reward functions and robot behaviour? #2017

celestialdr4g0n Mar 5, 2025

Replies: 1 comment

RandomOakForest Mar 8, 2025 Maintainer

celestialdr4g0n
Mar 5, 2025

RandomOakForest
Mar 8, 2025
Maintainer