Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Algorithm Request: more DQN-based approaches #68

Open
samlobel opened this issue Aug 7, 2023 · 2 comments
Open

Algorithm Request: more DQN-based approaches #68

samlobel opened this issue Aug 7, 2023 · 2 comments
Labels
algorithm help wanted Extra attention is needed

Comments

@samlobel
Copy link

samlobel commented Aug 7, 2023

I am thinking of using sheeprl as the base for my RL experiments! My work usually builds off of DQN-type algorithms: in increasing level of complexity, off of DDQN, Rainbow, or R2D2. Having some of these implemented would make this library much more convenient for research IMO.

It's super cool that you have Dreamer and Plan2Explore implemented, but that's not a great starting point for RL research because of how complex and opinionated they are. It would be great to just have a simple DDQN baseline implemented! All the PPO and A2C stuff is great if you work in the online setting, but DQN type things would let us build things for batch training. In a perfect world it'd also have an implementation of something like R2D2 since that's a good SOTA-ish DQN upgrade.

Excited to start trying it out!

@belerico
Copy link
Member

belerico commented Aug 8, 2023

Hi @samlobel and thank you for using sheeprl for your experiments! If you want we can start with a standard DDQN agent: you can have a look at our PPO implementations from this branch where we have the possibility to encode both images and vectors. We can start after the mentioned branch is merged, hopefully this week

@belerico
Copy link
Member

belerico commented Aug 9, 2023

@samlobel you can now try out directly the main branch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants