mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-14 02:11:17 +08:00
Added: Contributions
This commit is contained in:
@@ -38,3 +38,14 @@ It implies that the action taken by our agent is conditional solely on the prese
|
||||
### Value-based Methods:
|
||||
- Another approach to solving RL problems.
|
||||
- Here, instead of training a policy, we train a **value function** that maps each state to the expected value of being in that state.
|
||||
|
||||
Contributions are welcomed :hugs:
|
||||
|
||||
If you want to improve the course, you can [open a Pull Request.](https://github.com/huggingface/deep-rl-class/pulls)
|
||||
|
||||
This glossary was made possible thanks to:
|
||||
|
||||
- [@lucifermorningstar1305](https://github.com/lucifermorningstar1305)
|
||||
- [@daspartho](https://github.com/daspartho)
|
||||
- [@misza222](https://github.com/misza222)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user