ECE271C
Download as PDF
ECE 271C - Dynamic Programming and Reinforcement Learning
Electrical and Computer EngineeringCollege of Engineering
Full Course Title
Dynamic Programming and Reinforcement Learning
Instructor Name(s)
Bamieh; Marden
Course Description
This graduate course provides a comprehensive treatment of dynamic programming (DP) as the fundamental framework for staged optimization problems under uncertainty. Core topics include the principle of optimality, Bellman equations, value and policy iteration, deterministic and stochastic dynamic programming, and Markov decision processes. Applications span linear-quadratic optimal control, inventory control, asset management, hypothesis testing, the Viterbi algorithm, among others. The course concludes with modern reinforcement learning topics including Q-learning, temporal difference learning, and value function approximation, demonstrating how classical DP principles form the theoretical foundation for contemporary AI algorithms.
Unit Value
4
Maximum number of times course can be repeated for additional credit
99
Maximum Units
99
Prerequisites
Advisory Enrollment Comments
ECE 271C and ME 254 are cross-listed and are the same course.