Marc Lanctot

http://mlanctot.info

@deepmind

Results 167 comments of


                                            Marc Lanctot

Infostate Tree Memory Usage

Hi @dpmaloney, yeah what I'd really love is to have a data structure to represent sequence-form strategies (much like we have for behavioral strategies, e.g. `TabularPolicy`). But even separately from...

Loading LibTorch AlphaZero checkpoints in Python

I think that documentation is referring to the Tensorflow-based C++ and Python versions, we should update it. There's an example evaluation against MCTS here: https://github.com/deepmind/open_spiel/blob/f522d174a1e2e8fbaf2007294985869d6e520669/open_spiel/algorithms/alpha_zero_torch/alpha_zero.cc#L248, but we should also have...

Loading LibTorch AlphaZero checkpoints in Python

No that is what I meant.. did not realize it already exists!

Add new game: Stratego/Yorktown

Thanks @BluemlJ. I agree Stratego is a great game for RL research. However, we cannot import it into our repos without explicit permission from the publishers because the game is...

Add new game: Stratego/Yorktown

Ok, great. Yes permissions have been obtained for hosting implementations of games before, e.g. for the Hanabi Learning Environment. It does take some time and the chance of getting them...

Bug in pickle serialization for Universal poker

Hmm, we don't normally serialize the solver -- does the above work for a different game, e.g. leduc_poker?

Bug in pickle serialization for Universal poker

Can you try serializing the game or the state separately from the solver.. does that work?

Bug in pickle serialization for Universal poker

I have a suspicion. The serialization requires reconstructing the state via the game string, and you're loading it by specifying a string that contains newlines, which works fine based on...

Bug in pickle serialization for Universal poker

I think this might be easily fixed by using a custom serializer for universal poker, which I believe has been on the TODO list for quite some time. I can...

Bug in pickle serialization for Universal poker

I took a quick look. This is non-trivial to fix properly because it's currently impossible to have custom *game* serializers/deserializers (having custom *state* serializers / deserializers is easy). We should...

‹
1
2
3
4
5
6
7
8
9
10
...
16
17
›