InstaDeep Ltd
InstaDeep Ltd
Mava
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
tunbert
TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA...
catx
🐈⬛ Contextual bandits library for continuous action trees with smoothing in JAX
awesome-marl
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
fastpbrl
Vectorization techniques for fast population-based training.
jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
manyfold
🧬 ManyFold: An efficient and flexible library for training and validating protein folding models
poppy
:hibiscus: Population-Based Reinforcement Learning for Combinatorial Optimization
flashbax
⚡ Flashbax: Accelerated Replay Buffers in JAX