ECE271C

Download as PDF

ECE 271C - Dynamic Programming and Reinforcement Learning

Electrical and Computer EngineeringCollege of Engineering

General

Full Course Title

Dynamic Programming and Reinforcement Learning

Instructor Name(s)

Bamieh; Marden

Course Description

This graduate course provides a comprehensive treatment of dynamic programming (DP) as the fundamental framework for staged optimization problems under uncertainty. Core topics include the principle of optimality, Bellman equations, value and policy iteration, deterministic and stochastic dynamic programming, and Markov decision processes. Applications span linear-quadratic optimal control, inventory control, asset management, hypothesis testing, the Viterbi algorithm, among others. The course concludes with modern reinforcement learning topics including Q-learning, temporal difference learning, and value function approximation, demonstrating how classical DP principles form the theoretical foundation for contemporary AI algorithms.

Units Fixed

Unit Value

Maximum number of times course can be repeated for additional credit

Maximum Units

Prerequisites

ECE 230A or ME 243A or equivalent

Advisory Enrollment Comments

ECE 271C and ME 254 are cross-listed and are the same course.