AlphaNLHoldem
AlphaNLHoldem copied to clipboard
An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.
"Rlcard environment sucks, 50bb pot, wrong pot sizes, wrong action order after flop, I don't know where to start. But it's the only environment I konw out there suitable for...
Thanks for your wonderful project. But I wonder why Trinal-Clip PPO in Alphaholdem is not used.
> Even after ~ 1 billion self-play, over 1000 checkpoints, the model seems still not converge Thanks for your wonderful project. May I ask how do you judge the convergency...
This was apparently written in 2023 but is using a version of ray released in early 2020. Not sure why that is but it makes it impossible to use because...
ERROR: Could not find a version that satisfies the requirement tensorflow==1.15.2 (from versions: 2.8.0rc0, 2.8.0rc1, 2.8.0, 2.8.1, 2.8.2, 2.8.3, 2.8.4, 2.9.0rc0, 2.9.0rc1, 2.9.0rc2, 2.9.0, 2.9.1, 2.9.2, 2.9.3, 2.10.0rc0, 2.10.0rc1, 2.10.0rc2,...