Algorithm Request: more DQN-based approaches #68

samlobel · 2023-08-07T18:01:41Z

I am thinking of using sheeprl as the base for my RL experiments! My work usually builds off of DQN-type algorithms: in increasing level of complexity, off of DDQN, Rainbow, or R2D2. Having some of these implemented would make this library much more convenient for research IMO.

It's super cool that you have Dreamer and Plan2Explore implemented, but that's not a great starting point for RL research because of how complex and opinionated they are. It would be great to just have a simple DDQN baseline implemented! All the PPO and A2C stuff is great if you work in the online setting, but DQN type things would let us build things for batch training. In a perfect world it'd also have an implementation of something like R2D2 since that's a good SOTA-ish DQN upgrade.

Excited to start trying it out!

belerico · 2023-08-08T09:48:05Z

Hi @samlobel and thank you for using sheeprl for your experiments! If you want we can start with a standard DDQN agent: you can have a look at our PPO implementations from this branch where we have the possibility to encode both images and vectors. We can start after the mentioned branch is merged, hopefully this week

belerico · 2023-08-09T13:13:57Z

@samlobel you can now try out directly the main branch

belerico added the algorithm label Aug 11, 2023

belerico added the help wanted Extra attention is needed label Sep 26, 2023

belerico mentioned this issue Feb 20, 2024

Pure python training, evaluation and rollout documentation request. #209

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Algorithm Request: more DQN-based approaches #68

Algorithm Request: more DQN-based approaches #68

samlobel commented Aug 7, 2023

belerico commented Aug 8, 2023

belerico commented Aug 9, 2023

Algorithm Request: more DQN-based approaches #68

Algorithm Request: more DQN-based approaches #68

Comments

samlobel commented Aug 7, 2023

belerico commented Aug 8, 2023

belerico commented Aug 9, 2023