Control, Inference and Statistical Physics
Path Integral Control Theory
Path integral control is a method for solving stochastic optimal control problems by using a Monte Carlo (MC) approach to approximate the Hamilton-Jacobi-Bellman (HJB) equation. It transforms the value function into an expectation over all possible trajectories (a path integral) and then uses Monte Carlo simulations to find the optimal control. This approach can be applied to control problems with moderate to large dimensions by avoiding the need for a global grid of the HJB equation.
Presentations given by Dr. Bert Kappen:
Control as Variational Inference
Readings
- Todorov(2006) Linearly-solvable Markov decision problems, pdf
- Todorov(2008) General duality between optimal control and estimation, pdf
- Kappen(2009) Optimal control as a graphical model inference problem, pdf
- Ziebart(2010) Modeling Interaction via the Principle of Maximum Causal Entropy, pdf
- Rawlik, Toussaint, Vijayakumar (2013) On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference, pdf
- Nachum, Norouzi, Xu, Schuurmans(2017) Bridging the Gap Between Value and Policy Based Reinforcement Learning, pdf
- Schulman, Chen, Abbeel(2017) Equivalence Between Policy Gradients and Soft Q-Learning, pdf
- Haarnoja, Zhou, Abbeel, Levine (2018) Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor, pdf
- Levine(2018) Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review, pdf