LSTM based policy in stable baselines3 model
I am trying to make a PPO model using the stable-baselines3 library. I want to use a policy network with an LSTM layer in it. However, I can't find such a possibility on the library's website although it exists on the previous version of stable-baselines here https://stable-baselines.readthedocs.io/en/master/modules/policies.html#stable_baselines.common.policies.MlpLstmPolicy.
Does this possibility exist in stable-baselines3 (not stable-baselines)? if not, is there any other possibility I can do this? Thanx.
Solution 1:
From the migration doc.
https://stable-baselines3.readthedocs.io/en/master/guide/migration.html
Breaking Changes¶
LSTM policies (
MlpLstmPolicy
,CnnLstmPolicy
) are not supported for the time being (see PR #53 for a recurrent PPO implementation)