Costa Huang issues

Results 96 issues of


                                            Costa Huang

Modify the `MultiDiscrete` action space definition

I just realized we can use a multi-dimensional array as the `nvec` for gym's MultiDiscrete action space. We should re-define the action space to have shape `(h, w, 7)` as...

Weird issue with `peewee`

I seem to have encountered a weird issue with `peewee` when running https://github.com/vwxyzjn/gym-microrts/blob/oai5/experiments/ppo_gridnet_oai5.py: ``` ne 3160, in execute_sql cursor.execute(sql, params or ()) peewee.OperationalError: database is locked exception calling callback for...

Add random selfplay

Continue from #35

Better Trueskill evaluation

Continuing the thread from #43 here because #43 is closed. @kachayev mentioned > Note sure this is the most convenient avenue for the discussion (let me know if you want...

Add Pettingzoo Bindings

**TLDR**: [Petting Zoo](https://www.pettingzoo.ml/) has become the standard library for getting multi-agent environments & we want to support Petting Zoo's bindings in gym-microrts. This project https://github.com/vwxyzjn/gym-microrts is an RL environment for...

Faster Convergence

Training an agent now still takes a long time. The particular [experiment](https://wandb.ai/costa-huang/gym-microrts/runs/2v658xqx) in #36 took 4d 9h 11m 14s to finish. Looking at the reward chart, it appears the agent...

Reproduce Gridnet's SOTA agent with Trueskill Evaluation

Now that we are trying to get the self-play agent working, it's important to set baselines that we want to achieve and excel. Our best past experiment is [this](https://wandb.ai/vwxyzjn/gym-microrts-paper/runs/asrpz468) (which...

TUEplots Integration / Example

Hey @wookayin, thought you might be interested in this: https://twitter.com/pnkraemer/status/1555187017394298880?s=21&t=xwcG__bRBVursgt_roUq0w https://github.com/pnkraemer/tueplots

new feature

plotting

Poetry 1.2

## Description Adopting poetry 1.2 support. The locking and adding dependencies are orders of magnitude faster! ## Types of changes - [ ] Bug fix - [ ] New feature...

JAX Integration with CleanRL

## Problem Description Given the incredible performance of the DDPG + JAX prototype (https://github.com/vwxyzjn/cleanrl/pull/187), it's worth prototyping JAX with other algorithms as well! This issue tracks the overall progress of...