[Question] Is it possible to use custom LSTM and transformer models with MlpPolicy ActorCriticPolicy? #1407

HaakonFlaaronning · 2023-03-23T17:48:49Z

❓ Question

I want to use PPO and A2C with a custom LSTM and transformer network. PPO only have native support for "MlpPolicy" "CnnPolicy" and "MultiInputPolicy". Can I still use "MlpPolicy" but specify a custom LSTM or transformer network, or can it purely be used with networks that only have linear layers? Does it mess up the training if I specify a LSTM network?

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
If code there is, it is minimal and working
If code there is, it is formatted using the markdown code blocks for both code and stack traces.

araffin · 2023-03-24T09:24:51Z

I have checked that there is no similar issue in the repo

Duplicate of #1387, #1077 #177 and Stable-Baselines-Team/stable-baselines3-contrib#165

For RecurrentPPO, please take a look at SB3 contrib (as written in the doc): https://sb3-contrib.readthedocs.io/en/master/modules/ppo_recurrent.html

Pythoniasm · 2023-03-27T13:28:01Z

I can recommend to use a custom features extractor that does everything you want. However, you might want to adjust the MLP heads for critic and actor accordingly - but still using the MlpPolicy.

HaakonFlaaronning added the question Further information is requested label Mar 23, 2023

araffin added the duplicate This issue or pull request already exists label Mar 23, 2023

araffin closed this as not planned Won't fix, can't repro, duplicate, stale Mar 31, 2023

araffin mentioned this issue Apr 6, 2023

[Feature Request] Adding transformer (self-attention) policies/torch layers #1432

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Is it possible to use custom LSTM and transformer models with MlpPolicy ActorCriticPolicy? #1407

[Question] Is it possible to use custom LSTM and transformer models with MlpPolicy ActorCriticPolicy? #1407

HaakonFlaaronning commented Mar 23, 2023

araffin commented Mar 24, 2023 •

edited

Loading

Pythoniasm commented Mar 27, 2023

[Question] Is it possible to use custom LSTM and transformer models with MlpPolicy ActorCriticPolicy? #1407

[Question] Is it possible to use custom LSTM and transformer models with MlpPolicy ActorCriticPolicy? #1407

Comments

HaakonFlaaronning commented Mar 23, 2023

❓ Question

Checklist

araffin commented Mar 24, 2023 • edited Loading

Pythoniasm commented Mar 27, 2023

araffin commented Mar 24, 2023 •

edited

Loading