Update units/en/unit2/two-types-value-based-methods.mdx

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
2026-04-13 18:00:45 +08:00 · 2022-12-03 11:13:04 +01:00
parent 783beabfe9
commit cdb25393c4
1 changed files with 1 additions and 1 deletions
--- a/units/en/unit2/two-types-value-based-methods.mdx
+++ b/units/en/unit2/two-types-value-based-methods.mdx
@@ -12,7 +12,7 @@ But what does it mean to act according to our policy? After all, we don't have a

 Remember that the goal of an **RL agent is to have an optimal policy π.**

-To find it, we learned that there are two different methods:
+To find the optimal policy, we learned about two different methods:

 - *Policy-based methods:* **Directly train the policy** to select what action to take given a state (or a probability distribution over actions at that state). In this case, we **don't have a value function.**