tianshou icon indicating copy to clipboard operation
tianshou copied to clipboard

[question] LSTM for A2C with discrete action space

Open cbschen opened this issue 1 year ago • 2 comments

  • [x] I have marked all applicable categories:
    • [] exception-raising bug
    • [] RL algorithm bug
    • [ ] documentation request (i.e. "X is missing from the documentation.")
    • [ ] new feature request
  • [x] I have visited the source website
  • [x] I have searched through the issue tracker for duplicates
  • [ ] I have mentioned version numbers, operating system and environment, where applicable:

Hi, I am relatively new to Tianshou and RL and I have been trying to apply LSTM to A2C algorithm with discrete action space. From the documentation, it says to use recurrent policy we need to use RecurrentActorProb, but it seems this is for continuous action space only.

Is there a way to get it work on discrete action space? I tried to use Recurrent + Actor but that does not seem to work.

Also, I saw a lot of opening issues regarding the potential bugs in the RNN functionality in tianshou. Have those been fixed already?

Thank you very much!


This is the error I got when I try to use Recurrent feature extraction net with Actor (for discrete action space):

I apologize if the question seems rudimentary, but I am not quite sure how to get it to work tianshou question

cbschen avatar Feb 27 '23 20:02 cbschen