From a31043822e7f23d4f55df549b1b9896e2c56f29d Mon Sep 17 00:00:00 2001 From: Juan Martinez <36634572+josejuanmartinez@users.noreply.github.com> Date: Sun, 3 Dec 2023 18:58:37 +0000 Subject: [PATCH] Create quiz for unit 6 --- units/en/unit6/quiz.mdx | 119 ++++++++++++++++++++++++++++++++++++++++ 1 file changed, 119 insertions(+) create mode 100644 units/en/unit6/quiz.mdx diff --git a/units/en/unit6/quiz.mdx b/units/en/unit6/quiz.mdx new file mode 100644 index 0000000..f9832a9 --- /dev/null +++ b/units/en/unit6/quiz.mdx @@ -0,0 +1,119 @@ +# Quiz + +The best way to learn and [to avoid the illusion of competence](https://www.coursera.org/lecture/learning-how-to-learn/illusions-of-competence-BuFzf) **is to test yourself.** This will help you to find **where you need to reinforce your knowledge**. + + +### Q1: What of the following interpretations of bias-variance tradeoff is the most accurate in the field of Reinforcement Learning? + + + +### Q2: Which of the following statements are correct? + + + +### Q3: Which of the following statements are true about Monte-carlo method? + + +### Q4: What is the Advanced Actor-Critic Method (A2C)? +
+Solution + +The idea behind Actor-Critic is the following - we learn two function approximations: +1. A policy that controls how our agent acts (π) +2. A value function to assist the policy update by measuring how good the action taken is (q) + +Actor-Critic, step 2 + +
+ +### Q5: Which of the following statemets are True about the Actor-Critic Method? + + + + +### Q6: What is Advantege in the A2C method? +
+Solution + +Instead of using directly the Action-Value function of the Critic as it is, we calculate an Advantage function, the relative advantage of an action compared to the others possible at a state. +In other words: how taking that action at a state is better compared to the average value of the state + +Advantage in A2C + +
+ +Congrats on finishing this Quiz 🥳, if you missed some elements, take time to read the chapter again to reinforce (😏) your knowledge.