deep-rl-class
deep-rl-class copied to clipboard
๐ [i18n-KO] Translating rl-course to Korean
Hi!
Let's bring the reinforcement learning course to all the Korean-speaking community ๐ (currently 9 out of 77 complete)
Would you want to translate? Please follow the ๐ค TRANSLATING guide. Here is a list of the files ready for translation. Let us know in this issue if you'd like to translate any, and we'll add your name to the list.
Some notes:
- Please translate using an informal tone (imagine you are talking with a friend about transformers ๐ค).
- Please translate in a gender-neutral way.
- Add your translations to the folder called
ko
inside the source folder. - Register your translation in
ko/_toctree.yml
; please follow the order of the English version. - Once you're finished, open a pull request and tag this issue by including #issue-number in the description, where issue-number is the number of this issue. Please ping @simoninithomas for review.
์๋ ํ์ธ์!
ํ๊ตญ์ด๋ฅผ ์ฌ์ฉํ๋ ๋ชจ๋๊ฐ ๊ฐํํ์ต ์ฝ์ค๋ฅผ ์ฝ์ ์ ์๊ฒ ํด๋ณด์์ ๐
๋ฒ์ญ์ ์ฐธ์ฌํ๊ณ ์ถ์ผ์ ๊ฐ์? ๐ค ๋ฒ์ญ ๊ฐ์ด๋๋ฅผ ๋จผ์ ์ฝ์ด๋ณด์๊ธฐ ๋ฐ๋๋๋ค. ๋ ๋ถ๋ถ์ ๋ฒ์ญํด์ผํ ํ์ผ๋ค์ด ๋์ด๋์ด ์์ต๋๋ค. ์์
ํ๊ณ ๊ณ์ ํ์ผ์ด ์๋ค๋ฉด ์ฌ๊ธฐ์ ๊ฐ๋จํ ์๋ ค์ฃผ์ธ์. ์ค๋ณต๋์ง ์๋๋ก ์์
์ค
์ผ๋ก ํ์ํด๋๊ฒ์.
์ฐธ๊ณ ์ฌํญ:
- ๊ธฐ์ ๋ฌธ์์ด์ง๋ง (์น๊ตฌ์๊ฒ ์ค๋ช ๋ฃ๋ฏ์ด) ์ฝ๊ฒ ์ฝํ๋ฉด ์ข๊ฒ ์ต๋๋ค. ์กด๋๋ง ๋ก ์จ์ฃผ์๋ฉด ๊ฐ์ฌํ๊ฒ ์ต๋๋ค.
- ์ฑ๋ณ์ ์ผ๋ถ ์ธ์ด(์คํ์ธ์ด, ํ๋์ค์ด ๋ฑ)์๋ง ์ ์ฉ๋๋ ์ฌํญ์ผ๋ก, ํ๊ตญ์ด์ ๊ฒฝ์ฐ ๋ฒ์ญ๊ธฐ๋ฅผ ์ฌ์ฉํ์ ํ ๋ฌธ์ฅ ๊ธฐํธ์ ์กฐ์ฌ ๋ฑ์ด ์๋ง๋์ง ํ์ธํด์ฃผ์๊ธฐ ๋ฐ๋๋๋ค.
-
์์ค ํด๋ ์๋
ko
ํด๋์ ๋ฒ์ญ๋ณธ์ ๋ฃ์ด์ฃผ์ธ์. - ๋ชฉ์ฐจ(
ko/_toctree.yml
)๋ ํจ๊ป ์ ๋ฐ์ดํธํด์ฃผ์ธ์. ์์ด ๋ชฉ์ฐจ์ ์์๊ฐ ๋์ผํด์ผ ํฉ๋๋ค. - ๋ชจ๋ ๋ง์น์ จ๋ค๋ฉด, ๊ธฐ๋ก์ด ์ํํ๋๋ก PR์ ์ฌ์ค ๋ ํ์ฌ ์ด์(``)๋ฅผ ๋ด์ฉ์ ๋ฃ์ด์ฃผ์๊ธฐ ๋ฐ๋๋๋ค. ๋ฆฌ๋ทฐ ์์ฒญ์ @simoninithomas ๋๊ป ์์ฒญํด์ฃผ์ธ์.
- ๐ ์ปค๋ฎค๋ํฐ์ ๋ง์๊ป ํ๋ณดํด์ฃผ์๊ธฐ ๋ฐ๋๋๋ค! ๐ค ํฌ๋ผ์ ์ฌ๋ฆฌ์ ๋ ์ข์์.
- [ ] Unit 0. Welcome to the course
- [ ] Welcome to the course ๐ค
- [ ] Setup
- [ ] Discord 101
- [ ] Unit 1. Introduction to Deep Reinforcement Learning
- [ ] Introduction
- [ ] What is Reinforcement Learning?
- [ ] The Reinforcement Learning Framework
- [ ] The type of tasks
- [ ] The Exploration/ Exploitation tradeoff
- [ ] The two main approaches for solving RL problems
- [ ] The โDeepโ in Deep Reinforcement Learning
- [ ] Summary
- [ ] Glossary
- [ ] Hands-on
- [ ] Quiz
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Bonus Unit 1. Introduction to Deep Reinforcement Learning with Huggy
- [ ] Introduction
- [ ] How Huggy works?
- [ ] Train Huggy
- [ ] Play with Huggy
- [ ] Conclusion
- [ ] Live 1. How the course work, Q&A, and playing with Huggy
- [ ] Live 1. How the course work, Q&A, and playing with Huggy ๐ถ
- [ ] Unit 2. Introduction to Q-Learning
- [ ] Introduction
- [ ] What is RL? A short recap
- [ ] The two types of value-based methods
- [ ] The Bellman Equation, simplify our value estimation
- [ ] Monte Carlo vs Temporal Difference Learning
- [ ] Mid-way Recap
- [ ] Mid-way Quiz
- [ ] Introducing Q-Learning
- [ ] A Q-Learning example
- [ ] Q-Learning Recap
- [ ] Glossary
- [ ] Hands-on
- [ ] Q-Learning Quiz
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Unit 3. Deep Q-Learning with Atari Games
- [ ] Introduction
- [ ] From Q-Learning to Deep Q-Learning
- [ ] The Deep Q-Network (DQN)
- [ ] The Deep Q Algorithm
- [ ] Glossary
- [ ] Hands-on
- [ ] Quiz
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Bonus Unit 2. Automatic Hyperparameter Tuning with Optuna
- [ ] Introduction
- [ ] Optuna
- [ ] Hands-on
- [ ] Unit 4. Policy Gradient with PyTorch
- [ ] Introduction
- [ ] What are the policy-based methods?
- [ ] The advantages and disadvantages of policy-gradient methods
- [ ] Diving deeper into policy-gradient
- [ ] (Optional) the Policy Gradient Theorem
- [ ] Hands-on
- [ ] Quiz
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Unit 5. Introduction to Unity ML-Agents
- [ ] Introduction
- [ ] How ML-Agents works?
- [ ] The SnowballTarget environment
- [ ] The Pyramids environment
- [ ] (Optional) What is curiosity in Deep Reinforcement Learning?
- [ ] Hands-on
- [ ] Bonus. Learn to create your own environments with Unity and MLAgents
- [ ] Conclusion
- [ ] Unit 6. Actor Critic methods with Robotics environments
- [ ] Introduction
- [ ] The Problem of Variance in Reinforce
- [ ] Advantage Actor Critic (A2C)
- [ ] Hands-on: Advantage Actor Critic (A2C) using Robotics Simulations with PyBullet and Panda-Gym ๐ค
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Unit 7. Introduction to Multi-Agents and AI vs AI
- [ ] Introduction
- [ ] An introduction to Multi-Agents Reinforcement Learning (MARL)
- [ ] Designing Multi-Agents systems
- [ ] Self-Play
- [ ] Hands-on: Let's train our soccer team to beat your classmates' teams (AI vs. AI)
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Unit 8. Part 1 Proximal Policy Optimization (PPO)
- [ ] Introduction
- [ ] The intuition behind PPO
- [ ] Introducing the Clipped Surrogate Objective Function
- [ ] Visualize the Clipped Surrogate Objective Function
- [ ] PPO with CleanRL
- [ ] Conclusion
- [ ] Additional Readings
- [ ] Unit 8. Part 2 Proximal Policy Optimization (PPO) with Doom
- [ ] Introduction
- [ ] PPO with Sample Factory and Doom
- [ ] Conclusion
- [ ] Bonus Unit 3. Advanced Topics in Reinforcement Learning
- [ ] Introduction
- [ ] Model-Based Reinforcement Learning
- [ ] Offline vs. Online Reinforcement Learning
- [ ] Reinforcement Learning from Human Feedback
- [ ] Decision Transformers
Hey there๐ thanks for your work. As mentionned in Discord we don't support other languages for this course for now contrary to the transformer course.
However, what we can do is I can create a moon-ci-docs.huggingface.co link that will allow you to share to people who want to follow the course in Korean (and also see what the course looks like).
Currently there's an error in the Build PR documentation so I can't provide you this link (check the Failing error, from what I see is because you don't have a table of contents
Have a nice day ๐ค