rlcard icon indicating copy to clipboard operation
rlcard copied to clipboard

In Mahjong game prediction, it appears that the order of state['current_hand'] influences the result of eval_step, what could be the reason?

Open jacy opened this issue 1 year ago • 1 comments

jacy avatar Feb 04 '24 02:02 jacy

found the root cause: in mahjong extract_state function the raw_legal_actions and legal_actions doesn't match, legal_actions is the unique list of player's hand, but raw_legal_actions is the list of player's hand

jacy avatar Feb 05 '24 01:02 jacy