Last updated 4 years ago
Was this helpful?
1. 首先Bellman方程 2. 策略迭代Policy Iteration求解 3. Value Iteration 价值迭代求解 4. Q-Learning
吴恩达对于增强学习的形象论述(上)