diff --git a/.units/unit1/glossary.mdx b/.units/unit1/glossary.mdx index 92396ac..03d440e 100644 --- a/.units/unit1/glossary.mdx +++ b/.units/unit1/glossary.mdx @@ -38,3 +38,14 @@ It implies that the action taken by our agent is conditional solely on the prese ### Value-based Methods: - Another approach to solving RL problems. - Here, instead of training a policy, we train a **value function** that maps each state to the expected value of being in that state. + +Contributions are welcomed :hugs: + +If you want to improve the course, you can [open a Pull Request.](https://github.com/huggingface/deep-rl-class/pulls) + +This glossary was made possible thanks to: + +- [@lucifermorningstar1305](https://github.com/lucifermorningstar1305) +- [@daspartho](https://github.com/daspartho) +- [@misza222](https://github.com/misza222) +