deep-rl-class issues

[UPDATE] Formula Formatting Issuse in pg-theorem

1

The last equal sign in the equation chain should be on a new line. The current format suggests equivalence where there is none, which is misleading. ![image](https://github.com/huggingface/deep-rl-class/assets/586349/28f9414e-2ae9-4129-b6b6-d74b71ed282f)

Ginurx

unit2-record_video-logical operator precedence

Hi, In "unit2.ipynb", function "record_video", it seems like there is an issue with "not" and "or" logical operators precedence in the following "while" expression: "while not terminated or truncated" The...

fardinafdideh

Q-Learning pseudocode | Mathematical notation

Hi, My remark is about the mathematical notation of Q-Learning pseudocode in unit2.ipynb. I found the following notation a little bit confusing: Q(s,a) + lr [R(s,a) + gamma * max...

fardinafdideh

Super tiny fix format

5

Hi thanks for the course! It seems that here a newline is missing. ![image](https://github.com/huggingface/deep-rl-class/assets/5236035/988e67e4-f890-42be-8357-efeea962bc79)

fzyzcjy

Proposed rewording for Unit 1, Chapter 3: The Reinforcement Learning …

1

# Pull Request Description **Issue**: The chapter had issues related to wording structure and redundancy. **Proposed Changes**: - Improved the wording for better clarity and readability. - Removed duplicated information...

adelaparras

Translate unit0, (partial) unit1 in Korean

Translated unit0 and a part of unit1 in Korean.

yeounyi

🌐 [i18n-KO] Translating rl-course to Korean

1

Hi! Let's bring the reinforcement learning course to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com/huggingface/transformers/blob/main/docs/TRANSLATING.md)....

wonhyeongseo

feat: initiate Korean i18n effort

1

Part of https://github.com/huggingface/deep-rl-class/issues/370 Hello @simoninithomas , This PR marks the beginning of our effort to internationalize the Reinforcement Learning Course by adding Korean translations. The main goal of this initiative...

wonhyeongseo

Unit4 policy gradient errors

1

(Updated for clarity) Apologies if I'm wrong, but it seems to me that there are some mathematical issues in [unit 4 "diving deeper..."](https://github.com/huggingface/deep-rl-class/blob/main/units/en/unit4/policy-gradient.mdx?plain=1) as well as in the optional section...

dylwil3

Fix issue #518

Added a few specific comments for Mac Silicon users as I had a few struggles to be able to run ml-agents on my device.

bpugnaire

deep-rl-class
deep-rl-class copied to clipboard

Metadata

[UPDATE] Formula Formatting Issuse in pg-theorem

unit2-record_video-logical operator precedence

Q-Learning pseudocode | Mathematical notation

Super tiny fix format

Proposed rewording for Unit 1, Chapter 3: The Reinforcement Learning …

Translate unit0, (partial) unit1 in Korean

🌐 [i18n-KO] Translating rl-course to Korean

feat: initiate Korean i18n effort

Unit4 policy gradient errors

Fix issue #518

← Metadata

Owner

Metadata

deep-rl-class deep-rl-class copied to clipboard

Metadata

← Metadata

Owner

Metadata

deep-rl-class
deep-rl-class copied to clipboard