deep-rl-class icon indicating copy to clipboard operation
deep-rl-class copied to clipboard

This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.

Results 132 deep-rl-class issues
Sort by recently updated
recently updated
newest added

The last equal sign in the equation chain should be on a new line. The current format suggests equivalence where there is none, which is misleading. ![image](https://github.com/huggingface/deep-rl-class/assets/586349/28f9414e-2ae9-4129-b6b6-d74b71ed282f)

Hi, In "unit2.ipynb", function "record_video", it seems like there is an issue with "not" and "or" logical operators precedence in the following "while" expression: "while not terminated or truncated" The...

Hi, My remark is about the mathematical notation of Q-Learning pseudocode in unit2.ipynb. I found the following notation a little bit confusing: Q(s,a) + lr [R(s,a) + gamma * max...

Hi thanks for the course! It seems that here a newline is missing. ![image](https://github.com/huggingface/deep-rl-class/assets/5236035/988e67e4-f890-42be-8357-efeea962bc79)

# Pull Request Description **Issue**: The chapter had issues related to wording structure and redundancy. **Proposed Changes**: - Improved the wording for better clarity and readability. - Removed duplicated information...

Translated unit0 and a part of unit1 in Korean.

Hi! Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/transformers/blob/main/docs/TRANSLATING.md)....

Part of https://github.com/huggingface/deep-rl-class/issues/370 Hello @simoninithomas , This PR marks the beginning of our effort to internationalize the Reinforcement Learning Course by adding Korean translations. The main goal of this initiative...

(Updated for clarity) Apologies if I'm wrong, but it seems to me that there are some mathematical issues in [unit 4 "diving deeper..."](https://github.com/huggingface/deep-rl-class/blob/main/units/en/unit4/policy-gradient.mdx?plain=1) as well as in the optional section...

Added a few specific comments for Mac Silicon users as I had a few struggles to be able to run ml-agents on my device.