Create custom rewards handler #52

duyminh1998 · 2023-11-25T21:00:16Z

Why

user of pyCMO

to be able to specify different reward models for my scenarios

I can train RL agents

we currently only export the player's side's total score as the reward

we implement a way for users to specify a reward model

we get closer to being able to train RL agents

One idea is to create a custom RewardHandler class that gets passed into CMOEnv that can calculate the reward based on the current observation

The text was updated successfully, but these errors were encountered:

duyminh1998 · 2023-12-11T23:10:54Z

gymnasium provides reward wrappers

duyminh1998 added the enhancement New feature or request label Nov 25, 2023

duyminh1998 added this to @duyminh1998's pyCMO Nov 25, 2023

duyminh1998 moved this to Backlog in @duyminh1998's pyCMO Nov 25, 2023