Correlation of constraints and rewards #60

99Abdel · 2023-06-19T21:52:17Z

99Abdel
Jun 19, 2023

I want to know where and in which form do you integrate the cost function of safety constraints into the overall training session?
If the constraint should not affect the rewards function how do you update the policy without being influenced by the constraints?
And where do they play a critical role?
do you have 2 separate policies?
I would appreciate if you could guide me also in terms of code that you have written.
Thank you.

muchvo · 2023-06-24T01:08:58Z

muchvo
Jun 24, 2023
Maintainer

This question is strongly related to algorithm rather than environments, because environments only provide information. If you want to figure out how algorithms handle observations from environments, you can check out OmniSafe. It is a comprehensive library for Safe RL.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correlation of constraints and rewards #60

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Correlation of constraints and rewards #60

Uh oh!

99Abdel Jun 19, 2023

Replies: 1 comment

Uh oh!

Uh oh!

muchvo Jun 24, 2023 Maintainer

99Abdel
Jun 19, 2023

muchvo
Jun 24, 2023
Maintainer