DeepQLearning.jl Action masking feature (legal actions)

Action masking feature (legal actions)

Open filchristou opened this issue 1 year ago • 1 comments

POMDPs.jl supports state-dependent action spaces

However, DeepQLearning.jl is always picking the full action space. That's because the solve enumerates the actions once here, hands them into the policy, which are broadly used there after.

Do you think of a way to have action masking with the current implementation ?

Jan 24 '24 15:01 filchristou

DeepQLearning.jl DeepQLearning.jl copied to clipboard

Action masking feature (legal actions)

DeepQLearning.jl
DeepQLearning.jl copied to clipboard