open_spiel icon indicating copy to clipboard operation
open_spiel copied to clipboard

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Results 87 open_spiel issues
Sort by recently updated
recently updated
newest added

Does alphazero support reuse-tree?

question

Hello, I'm trying to create a strong Mancala bot. I chose Q-learning: `# Let's do independent Q-learning in Mancala, and play it against random. # RL is based on python/examples/independent_tabular_qlearning.py...

This is a feature proposal. While basing my work on this version of Alpha Zero (Tensorflow, both in Python and C++), I have many points that I have addressed, including:...

contribution welcome

MF-PPO algorithm implemented in the paper: ``` @inproceedings{algumaei2023regularization, title={Regularization of the policy updates for stabilizing Mean Field Games}, author={Algumaei, Talal and Solozabal, Ruben and Alami, Reda and Hacid, Hakim and...

I noticed that the information tensor for Universal Poker works when the game is abstracted, but when the game is unabstracted (i.e. full game) the information tensor is still abstracted,...

bug

Implementation of CFR that uses JAX. This allows running CFR with GPU acceleration. Speed up over python CFR implementation is ~10-times on CPU only. The goal was to make it...

In the example for RNaD, the importance sampling correction for get_loss_nerd is 1. This is because the example provided is the on-policy case, and there are synchronous updates of the...

A Python implementation of the EFR (https://arxiv.org/abs/2102.06973) algorithm with the deviation types defined in the proposing paper. The implementation was developed as part of my undergraduate dissertation and I thought...

Hi, I am currently working on experiments for my algorithm and intend to test it on the subgame of Libratus mentioned in [^1]. I have noticed that the **universal_poker** has...

So, I'm playing quoridor, and I was trying to figure out which action IDs corresponded to moving the agent. Therefore, after I placed all the walls, I went to look...

help wanted
contribution welcome