diff --git a/units/en/unit6/quiz.mdx b/units/en/unit6/quiz.mdx new file mode 100644 index 0000000..f9832a9 --- /dev/null +++ b/units/en/unit6/quiz.mdx @@ -0,0 +1,119 @@ +# Quiz + +The best way to learn and [to avoid the illusion of competence](https://www.coursera.org/lecture/learning-how-to-learn/illusions-of-competence-BuFzf) **is to test yourself.** This will help you to find **where you need to reinforce your knowledge**. + + +### Q1: What of the following interpretations of bias-variance tradeoff is the most accurate in the field of Reinforcement Learning? + + + +### Q2: Which of the following statements are correct? + + + +### Q3: Which of the following statements are true about Monte-carlo method? + + +### Q4: What is the Advanced Actor-Critic Method (A2C)? +
+Solution + +The idea behind Actor-Critic is the following - we learn two function approximations: +1. A policy that controls how our agent acts (π) +2. A value function to assist the policy update by measuring how good the action taken is (q) + +Actor-Critic, step 2 + +
+ +### Q5: Which of the following statemets are True about the Actor-Critic Method? + + + + +### Q6: What is Advantege in the A2C method? +
+Solution + +Instead of using directly the Action-Value function of the Critic as it is, we calculate an Advantage function, the relative advantage of an action compared to the others possible at a state. +In other words: how taking that action at a state is better compared to the average value of the state + +Advantage in A2C + +
+ +Congrats on finishing this Quiz 🥳, if you missed some elements, take time to read the chapter again to reinforce (😏) your knowledge.