Costa Huang

Results 96 issues of Costa Huang

## Description This PR closes #265. Had some preliminary results w/ multi-objective stuff, as shown in the following figure. The x-axis is the normalized score of CartPole-v1 and Acrobat-v1, and...

# Overview #228 prototyped a great initial integration with optuna to do hyperparameter optimization. However, it has a couple of downsides: 1. lack of support for tuning multiple environments when...

## Describe the bug Related to https://github.com/sail-sg/envpool/issues/33. When an environment is "done", the autoreset feature in openai/gym' API will reset this environment and return the initial observation from the next...

discussion

## Motivation Preprocessing Atari environments is error-prone. E.g., #195. ## Solution Could we simply this by providing an API like `envpool.make_atari(style="openai/baseline's")` or `envpool.make_atari(style="Machado et al 2017")` Typical settings: ```python #...

enhancement

## Describe the bug The `max_episode_steps` does not count the frame skip steps. ## To Reproduce Steps to reproduce the behavior. Please try to provide a minimal example to reproduce...

Hello, this is a really cool project! Would you be up to making a pypi release so that people can install via `pip install gym-hybrid`? It pins the version which...

Hi, this is an amazing piece of software. I was wondering if there is any way to always run the cluster the `autostop` mode using the configuration from the yaml...

good first issue
Initial-User-Issue
friction-log

Hi @karpathy this is such an awesome repo. I was implementing it from scratch using JAX and learned quite a bit :) This PR proposes a more consistent naming convention....

Hello, this is such a cool project. I was playing with using tinygrad to create some reinforcement learnings scripts, but I ran into a minor issue with using Tensor as...

Hi @wookayin, was wondering if it would be easy to integrate with [Vega-Altair](https://altair-viz.github.io/gallery/index.html#line-charts), which helps export interactive HTML such as the one used in [Tianshou's benchmark](https://tianshou.readthedocs.io/en/master/tutorials/benchmark.html)

new feature
plotting