4 Dynamic Programming

4.1 Policy Evaluation

Iterative policy evaluation

4.2 Policy Improvement

4.3 Policy Iteration

Example 4.2: Jack’s Car Rental

4.4 Value Iteration

Example 4.3: Gambler’s Problem

code

download

4.5 Asynchronous Dynamic Programming

4.6 Generalized Policy Iteration

4.7 Efficiency of Dynamic Programming

4.8 Summay