Lecture 1, 2025, Course overview_ RL and DP, AlphaZero, deterministic DP, exampl
Lecture 2, 2025, Stochastic finite and infinite horizon DP, approximation in val
Lecture 3, 2025, LQ Problems, Approximation in Value Space, VI, and PI, Newtons
Lecture 4, 2025, POMDP, Systems with Changing Parameters, Adaptive Control, Mode
Lecture 5, 2025, Deterministic Rollout and Animations.zh_en
Lecture 6, 2025, Multistep Approximation in Value Space, Constrained Rollout, Mu
Lecture 7, 2025, Case studies_ Multi-robot warehouse, data association.zh_en
Lecture 8, 2025; GPT, HMM, and Markov chains_ Rollout variants for most likely s
Abstract Dynamic Programming, Reinforcement Learning, Newtons Method, and Gradi
Lecture 11, 2025; Adversarial Problems, Minimax Rollout, Use of MPC Methods, Com
Lecture 12, 2025; Training of cost functions, approximation in policy space, pol