Optimal Control and Planning
How can we make decisions if we know the dynamics of the environment?
Stochastic optimization
Stochastic optimization for open-loop planning:
We wish to choose
Guess and Check
An extremely simple method, that’s parallelizable:
- pick
from some distribution - choose
based on .
Cross-entropy Method (CEM)
- pick
from some initial distribution - Evaluate
- pick the elites
with the highest value - fit distribution $P(A) to the elites
With continuous inputs, a multi-variate normal distribution is a common choice
for
Using Derivatives
- Differentiable Dynamic Programming (DDP)
- LQR