deep-reinforcement-learning issues

Show differences from optimal

1

Show the differences in the mc backjack policy plot from the optimal policy. Just puts some little red X's on the graph which show where your blackjack policy deviates from...

tkharris

calculate fan-in correctly

Fan-in is defined to be the maximum number of inputs to a layer. The weight matrix is transposed. This means that the number of inputs are equal to the second...

dantp-ai

Update REINFORCE.ipynb

* Use gymnasium instead of gym * Reflect interface change of env.reset and env.step * Enable rendering in Jupyter Notebook * Use CartPole-v1

ruddyscent

Update CEM.ipynb

* Use gymnasium==0.29.1 instead of gym * Reflect interface change of env.reset and env.step * Enable rendering in Jupyter Notebook * Remove unused import

ruddyscent

Update Hill_Climbing.ipynb

* Use gymnasium instead of gym * Reflect interface change of env.reset and env.step * Enable rendering in Jupyter Notebook * Use CartPole-v1

ruddyscent

Banana environment throws a timeout on Windows64

1

This issue refers to the Navigation task [here](https://github.com/udacity/deep-reinforcement-learning/tree/master/p1_navigation) This won't work on Windows64, as the environment throws a timeout error and fails to produce the required 'env' object. Refer to...

JasperStolte

torch requirement outdated

4

The requirements.txt file includes torch==0.4.0 This throws an error as this version is not available any longer, also preventing the packages further down the list from being installed. ![image](https://user-images.githubusercontent.com/80971599/194817745-c1fd3a9e-38c9-4288-a551-6d324d6316a5.png)

JasperStolte

deep-reinforcement-learning
deep-reinforcement-learning copied to clipboard

Metadata

Show differences from optimal

calculate fan-in correctly

Update REINFORCE.ipynb

Update CEM.ipynb

Update Hill_Climbing.ipynb

Banana environment throws a timeout on Windows64

torch requirement outdated

← Metadata

Owner

Metadata

deep-reinforcement-learning deep-reinforcement-learning copied to clipboard

Metadata

Show differences from optimal

calculate fan-in correctly

Update REINFORCE.ipynb

Update CEM.ipynb

Update Hill_Climbing.ipynb

Banana environment throws a timeout on Windows64

torch requirement outdated

← Metadata

Owner

Metadata

deep-reinforcement-learning
deep-reinforcement-learning copied to clipboard