Skip to content

Commit

Permalink
Merge pull request #25 from Eclectic-Sheep/fix/dreamer_v3_typo
Browse files Browse the repository at this point in the history
fix: dreamerV3 blogpost symexp definition
  • Loading branch information
DavideTr8 authored Jan 30, 2024
2 parents 04ae1a4 + 849dd1a commit 0a0171f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion _posts/2023-08-10-dreamer_v3.md
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ $$ \text{symlog}(x) \doteq \text{sign}(x) \ln \left(\lvert x \rvert + 1\right) $

Given the neural network prediction, it is possible to obtain the non-transformed target by appling the inverse transformation (i.e., the *symexp*):

$$ \text{symexp}(x) \doteq \text{sign}(x) \left(\exp \left(\lvert x \rvert \right) + 1\right) $$
$$ \text{symexp}(x) \doteq \text{sign}(x) \left(\exp \left(\lvert x \rvert \right) - 1\right) $$

The last detail to report is that the symlog prediction is used in the decoder, the reward model and the critic. Moreover, the inputs of the MLP encoder (the one that encodes observations in vector form) are squashed with the *symlog* function ([Figure 2](#fig-symlog){: .fig-link}).

Expand Down

0 comments on commit 0a0171f

Please sign in to comment.