Sacha Chernyavskiy comments

Results 61 comments of


                                            Sacha Chernyavskiy

Deep cfr jax refactor

@fuyuan-li, I erroneously had an odd relu in the network. Sorry! Now, both impls should converge.

@lanctot both versions now converge for kuhn, look at the https://github.com/google-deepmind/open_spiel/pull/1406 However, without additional code improvements (e.g. jitting the buffer which causes constant recompilation due to changing size thereof), the...

Deep cfr jax refactor

@fuyuan-li , thank you for your testing. Good that the results are reproducible. The newest commit should lower the comp. time like twice (on my mac, at least). The further...

Deep cfr jax refactor

@fuyuan-li check it out now. `jax` version should be only 4-6 times slower than pytorch It's partly because of https://github.com/jax-ml/jax/issues/16587 (at `append_to_reservior` function) and because I allocate the whole buffer...

Deep cfr jax refactor

Pytorch implementation hasn't really changed in terms of readability. In `jax` implementation, I replaced the buffer with a set of functions and made a jittable training loop. Should still stay...

Deep cfr jax refactor

@fuyuan-li thank you for your updates. Yes, I will update it locally. When everything is confirmed, we can merge. Just for the sake of interest, can you compare GPU performances?...

Deep cfr jax refactor

@fuyuan-li, thank you, will back tomorrow with the fixed results. Let's say that 0.66 is what we're looking for. Thank you for the testing, couldn't have done it without you

Deep cfr jax refactor

My final results with the latest commit for `leduc_poker` and default hyperparameters: 1) w/ `pytorch` ``` I1221 22:48:52.097683 8456724608 deep_cfr_pytorch.py:76] Final policy loss: 4.595390796661377 I1221 22:48:52.189984 8456724608 deep_cfr_pytorch.py:82] Deep CFR...

Deep cfr jax refactor

Yes, I just trimmed a couple of lines, it doesnt affect the results

Deep cfr jax refactor

@lanctot I am sorry I checked the tests they didn't fail but I forgot(?) to rerun before the commit. Also, I am not sure that they will pass on python3.10...