pgx issues

3

PGX on TPUs seems to be slower than CPUs. With a TPU v3-8, PGX is only achieving 1638 steps / sec on the game of chess. **Minimal Reproducible Example** PGX...

wtedw

Handling of truncated trajectories in AlphaZero training example

### Problem Description In examples/alphazero/train.py, we compute `value_mask` as follows: https://github.com/sotetsuk/pgx/blob/87278d2d6e677fd87248c457207b59cfa42e578d/examples/alphazero/train.py#L179 The purpose is to avoid updating the critic network on incomplete trajectories, as is evident by masking of value...

shehper

add go_13x13.

added go_13x13 env,. did not add hosi_pos and tests.

ahe6

[Shogi] Update the explanation of the action space in the document.

素晴らしいライブラリをご開発いただき、ありがとうございます。いつも楽しんで使わせていただいております。 ## 提案以下の通り、shogi ドキュメントの action の direction の説明を、「`direction` **from** which the piece moves and」から「`direction` **to** which the piece moves and」に変更することを提案します。 ### 変更前 ``` There are `2187 = 81 x...

KazukiOhta

Observation in Kuhn Poker

1

Hi, shouldn't the first observation in Kuhn Poker have all zeros in the betting part of the vector? For instance, if player 1 gets a Q, then the observation should...

axelbr

Change examples away from Haiku

3

A bit later than I meant to, but this addresses https://github.com/sotetsuk/pgx/issues/1059. Pretty straightforward change to equinox style NNs. Some minor speed differences that could be optimized (see https://github.com/patrick-kidger/equinox/issues/928, https://github.com/patrick-kidger/equinox/issues/926), but...

lockwo

pgx
pgx copied to clipboard

Metadata

Make baseline models unbatched

Use NamedTuple instead of dataclass

Low performance on TPUs

Update animal shogi baseline

Untrack test svgs

Handling of truncated trajectories in AlphaZero training example

add go_13x13.

[Shogi] Update the explanation of the action space in the document.

Observation in Kuhn Poker

Change examples away from Haiku

← Metadata

Owner

Metadata

pgx pgx copied to clipboard

Metadata

← Metadata

Owner

Metadata

pgx
pgx copied to clipboard