When should intrinsic rewards be given? #9

jmichaux · 2019-04-29T23:33:01Z

Right now intrinsic rewards are given on any transition. Intuitively, this isn't quite right because we end up rewarding the agent when it dies. This could, in theory, lead to excessive exploration a la the Noisy TV problem. In practice it doesn't really seem to matter. But it would be interesting to see if we can speed up learning by only giving the exploration bonus on transition where the agent doesn't fail.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When should intrinsic rewards be given? #9

When should intrinsic rewards be given? #9

jmichaux commented Apr 29, 2019

When should intrinsic rewards be given? #9

When should intrinsic rewards be given? #9

Comments

jmichaux commented Apr 29, 2019