mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-08 21:30:45 +08:00
Adds newline after ###
This commit is contained in:
@@ -21,6 +21,7 @@ The best way to learn and [to avoid the illusion of competence](https://www.cour
|
||||
/>
|
||||
|
||||
### Q2: Which of the following statements are True, when talking about models with bias and/or variance in RL?
|
||||
|
||||
<Question
|
||||
choices={[
|
||||
{
|
||||
@@ -48,6 +49,7 @@ The best way to learn and [to avoid the illusion of competence](https://www.cour
|
||||
|
||||
|
||||
### Q3: Which of the following statements are true about Monte-carlo method?
|
||||
|
||||
<Question
|
||||
choices={[
|
||||
{
|
||||
@@ -69,6 +71,7 @@ The best way to learn and [to avoid the illusion of competence](https://www.cour
|
||||
/>
|
||||
|
||||
### Q4: What is the Advanced Actor-Critic Method (A2C)?
|
||||
|
||||
<details>
|
||||
<summary>Solution</summary>
|
||||
|
||||
@@ -81,6 +84,7 @@ The idea behind Actor-Critic is that we learn two function approximations:
|
||||
</details>
|
||||
|
||||
### Q5: Which of the following statemets are True about the Actor-Critic Method?
|
||||
|
||||
<Question
|
||||
choices={[
|
||||
{
|
||||
@@ -104,6 +108,7 @@ The idea behind Actor-Critic is that we learn two function approximations:
|
||||
|
||||
|
||||
### Q6: What is `Advantage` in the A2C method?
|
||||
|
||||
<details>
|
||||
<summary>Solution</summary>
|
||||
|
||||
|
||||
Reference in New Issue
Block a user