Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create custom rewards handler #52

Open
duyminh1998 opened this issue Nov 25, 2023 · 1 comment
Open

Create custom rewards handler #52

duyminh1998 opened this issue Nov 25, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@duyminh1998
Copy link
Owner

Why

As a

user of pyCMO

I want

to be able to specify different reward models for my scenarios

So that

I can train RL agents

Acceptance Criteria

Given

we currently only export the player's side's total score as the reward

When

we implement a way for users to specify a reward model

Then

we get closer to being able to train RL agents

Notes

One idea is to create a custom RewardHandler class that gets passed into CMOEnv that can calculate the reward based on the current observation

@duyminh1998 duyminh1998 added the enhancement New feature or request label Nov 25, 2023
@duyminh1998 duyminh1998 moved this to Backlog in @duyminh1998's pyCMO Nov 25, 2023
@duyminh1998
Copy link
Owner Author

gymnasium provides reward wrappers

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Backlog
Development

No branches or pull requests

1 participant