Date Lecture Readings Logistics
M 08/25 Lecture #1 :
Welcome and Introduction to the Class
[ slides | slides 2 ]

W 08/27 Lecture #2 :
Introduction to Reinforcement Learning
[ slides ]

F 08/29 Recitation #1:
Neural Nets, PyTorch, Gymnasium
[ slides | notes ]

M 09/01 No Class, Labor Day

W 09/03 Lecture #3 :
Policy Gradient Methods
[ slides ]

HW1 out

F 09/05 Recitation #2:
MDPs, Policy Gradients, & HW1
[ slides ]

M 09/08 Lecture #4 :
Actor-Critic Methods (cont.) Evolutionary Methods for Policy Search
[ slides | slides 2 ]

W 09/10 Lecture #5 :
Value-based Methods
[ slides | slides 2 ]

F 09/12 Recitation #3:
Value-Based Methods and Actor-Critic Methods
[ slides ]

HW1 Due, HW2 out

M 09/15 Lecture #6 :
Value based methods (Cont.)
[ slides | slides 2 ]

W 09/17 Lecture #7 :
Advanced Policy Gradient Methods
[ slides ]

F 09/19 Recitation #4:
Lecture 7b Advanced Policy Gradient Methods (Cont.)
[ slides | slides 2 ]

HW2 Due,
Video Recitation HW3

M 09/22 Lecture #8 :
Advanced Policy Gradient Methods (Cont.)
[ slides | slides 2 ]

HW3 Out

W 09/24 Lecture #9 :
Advanced Policy Gradient Methods (Cont.) MaxEntropy RL
[ slides | slides 2 ]

F 09/26 Recitation #5:
Midterm Review and HW4
[ slides | slides 2 | video ]

M 09/29 Lecture #10 :
Imitation Learning
[ slides ]

T 09/30

Project Description Out (tentative)

W 10/01 Lecture #11 :
Imitation Learning(Cont.)
[ slides ]

HW4 Out

F 10/03 Midterm

M 10/06 Lecture #12 :
Model-based Methods
[ slides ]

W 10/08 Lecture #13 :
Model-based Methods(Cont.)
[ slides ]

HW3 Due

F 10/10 Lecture #14 :
Model-based Methods(Cont.), Learning and Tree-Search Planning
[ slides ]

F 10/10 Recitation #6:
IL Diffusion Policies and HW5 (Video)
[ slides ]

S 10/12

HW4 Due, HW5 Out

M 10/13 Fall Break - No Classes

W 10/15 Fall Break - No Classes

F 10/17 Fall Break - No Classes

M 10/20 Lecture #15 :
Offline RL
[ slides ]

W 10/22 Lecture #16 :
Offline RL (Cont.)
[ slides ]

F 10/24 Recitation #7:
Midterm Solution Session
[ slides ]

Project Team Formation and Project Proposal Due (tentative)

M 10/27 Lecture #17 :
Model-based Methods for offline RL
[ slides ]

HW5 Due, HW6 Out (tentative)

W 10/29 Lecture #18 :
Exploration
[ slides ]

F 10/31 Recitation #8:
Offline RL and HW6
[ slides ]

M 11/03 Lecture #19 :
Exploration (Cont.)
[ slides ]

W 11/05 Lecture #20 :
Sim2Real Learning
[ slides ]

F 11/07 Recitation #9:
Exploration
[ slides ]

M 11/10 Lecture #21 :
Sim2Real Learning (Cont.)
[ slides ]

HW6 Due, HW7 Out (tentative)

W 11/12 Lecture #22 :
RL with Foundation Models
[ slides ]

F 11/14 Recitation #10:
Sim2Real and HW7
[ slides ]

Project Midway Report Due (tentative)

M 11/17 Lecture #23 :
RL with Foundation Models (Cont.)
[ slides ]

W 11/19 Lecture #24 :
Generative Models for RL
[ slides ]

F 11/21 Recitation #11:
RL with Foundation Models
[ slides ]

M 11/24 Lecture #25 :
Generative Models for RL (Cont.)
[ slides ]

HW7 Due (tentative)

W 11/26 Thanksgiving Break - No Classes

F 11/28 Thanksgiving Break - No Recitation

M 12/01 Lecture #26 :
Guest Lecture
[ slides ]

W 12/03 Lecture #27 :
Course Review
[ slides ]

F 12/05 Recitation #12:
Generative Models for RL
[ slides ]

M 12/08 Project Final Presentation, 8:30AM-11:30AM

Project Final Report due