Jump to content

ELL729

From IITD Wiki
Revision as of 10:03, 4 March 2026 by Prashantt492 (talk | contribs) (Creating course page via bot)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
ELL729
Stochastic Control and Reinforcement Learning
Credits 3
Structure 3-0-0
Pre-requisites
Overlaps

ELL729 : Stochastic Control and Reinforcement Learning

[edit]

Basics of dynamic programming, Finite horizon MDP with quadratic cost, Optimal stopping problems, Partially observable MDP, Infinite horizon discounted cost problems, Stochastic shortest path problems, Undiscounted cost problems, Average cost problems, Semi-Markov decision process, Constrained MDP, Basics of stochastic approximation, Kiefer-Wolfowitz algorithm, Simultaneous perturbation stochastic approximation, Q learning and its convergence analysis.