mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-13 18:00:45 +08:00
Update advantage-actor-critic.mdx
This commit is contained in:
@@ -1,4 +1,5 @@
|
||||
# Advantage Actor-Critic (A2C)
|
||||
# Advantage Actor-Critic (A2C) [[advantage-actor-critic]]
|
||||
|
||||
## Reducing variance with Actor-Critic methods
|
||||
|
||||
The solution to reducing the variance of the Reinforce algorithm and training our agent faster and better is to use a combination of Policy-Based and Value-Based methods: *the Actor-Critic method*.
|
||||
|
||||
Reference in New Issue
Block a user