Thomas Tumiel
Thomas Tumiel
In the spirit of Hacktoberfest, submit your favourite Ethereum resources!
Consider renaming: - Code - Tutorials - Learn - Programming To clarify what each one should be about and unite similar topics better.
## Description Added handling of complex observations to `atari_ppo.py`. Closes #353 I also wrote a jax version for the #338 branch (I can put it in another PR when #338...
## Problem Description Would it be useful to add a complex (nested/dictionary) action and obs space variant of the PPO algo? I did this for `minerl` and wondered if it...
```python import torch from mingpt.bpe import BPETokenizer tokenizer = BPETokenizer() print(tokenizer("")) # tensor([[ 27, 91, 437, 1659, 5239, 91, 29]]) print(tokenizer.decode(torch.tensor([50256]))) # '' print(tokenizer(tokenizer.decode(torch.tensor([50256])))) # tensor([[ 27, 91, 437, 1659,...