Schedule
Date | Lecture | Readings | Logistics | |
---|---|---|---|---|
M 08/25 |
Lecture #1
:
Welcome and Introduction to the Class [ slides | slides 2 ] |
|
||
W 08/27 |
Lecture #2
:
Introduction to Reinforcement Learning [ slides ] |
|
||
F 08/29 |
Recitation #1:
Neural Nets, PyTorch, Gymnasium [ slides | notes ] |
|
||
M 09/01 | No Class, Labor Day | |||
W 09/03 |
Lecture #3
:
Policy Gradient Methods [ slides ] |
HW1 out |
||
F 09/05 |
Recitation #2:
MDPs, Policy Gradients, & HW1 [ slides ] |
|||
M 09/08 |
Lecture #4
:
Actor-Critic Methods (cont.) Evolutionary Methods for Policy Search [ slides | slides 2 ] |
|||
W 09/10 |
Lecture #5
:
Value-based Methods [ slides | slides 2 ] |
|
||
F 09/12 |
Recitation #3:
Value-Based Methods and Actor-Critic Methods [ slides ] |
HW1 Due, HW2 out |
||
M 09/15 |
Lecture #6
:
Value based methods (Cont.) [ slides | slides 2 ] |
|
||
W 09/17 |
Lecture #7
:
Advanced Policy Gradient Methods [ slides ] |
|||
F 09/19 |
Recitation #4:
Lecture 7b Advanced Policy Gradient Methods (Cont.) [ slides | slides 2 ] |
HW2 Due, |
||
M 09/22 |
Lecture #8
:
Advanced Policy Gradient Methods (Cont.) [ slides | slides 2 ] |
HW3 Out |
||
W 09/24 |
Lecture #9
:
Advanced Policy Gradient Methods (Cont.) MaxEntropy RL [ slides | slides 2 ] |
|||
F 09/26 |
Recitation #5:
Midterm Review and HW4 [ slides | slides 2 | video ] |
|||
M 09/29 |
Lecture #10
:
Imitation Learning [ slides ] |
|
||
T 09/30 |
Project Description Out (tentative) |
|||
W 10/01 |
Lecture #11
:
Imitation Learning(Cont.) [ slides ] |
|
HW4 Out |
|
F 10/03 | Midterm | |||
M 10/06 |
Lecture #12
:
Model-based Methods [ slides ] |
|||
W 10/08 |
Lecture #13
:
Model-based Methods(Cont.) [ slides ] |
|
HW3 Due |
|
F 10/10 |
Lecture #14
:
Model-based Methods(Cont.), Learning and Tree-Search Planning [ slides ] |
|
||
F 10/10 |
Recitation #6:
IL Diffusion Policies and HW5 (Video) [ slides ] |
|||
S 10/12 |
HW4 Due, HW5 Out |
|||
M 10/13 | Fall Break - No Classes | |||
W 10/15 | Fall Break - No Classes | |||
F 10/17 | Fall Break - No Classes | |||
M 10/20 |
Lecture #15
:
Offline RL [ slides ] |
|
||
W 10/22 |
Lecture #16
:
Offline RL (Cont.) [ slides ] |
|
||
F 10/24 |
Recitation #7:
Midterm Solution Session [ slides ] |
Project Team Formation and Project Proposal Due (tentative) |
||
M 10/27 |
Lecture #17
:
Model-based Methods for offline RL [ slides ] |
|
HW5 Due, HW6 Out (tentative) |
|
W 10/29 |
Lecture #18
:
Exploration [ slides ] |
|
||
F 10/31 |
Recitation #8:
Offline RL and HW6 [ slides ] |
|||
M 11/03 |
Lecture #19
:
Exploration (Cont.) [ slides ] |
|
||
W 11/05 |
Lecture #20
:
Sim2Real Learning [ slides ] |
|||
F 11/07 |
Recitation #9:
Exploration [ slides ] |
|||
M 11/10 |
Lecture #21
:
Sim2Real Learning (Cont.) [ slides ] |
HW6 Due, HW7 Out (tentative) |
||
W 11/12 |
Lecture #22
:
RL with Foundation Models [ slides ] |
|||
F 11/14 |
Recitation #10:
Sim2Real and HW7 [ slides ] |
Project Midway Report Due (tentative) |
||
M 11/17 |
Lecture #23
:
RL with Foundation Models (Cont.) [ slides ] |
|||
W 11/19 |
Lecture #24
:
Generative Models for RL [ slides ] |
|||
F 11/21 |
Recitation #11:
RL with Foundation Models [ slides ] |
|||
M 11/24 |
Lecture #25
:
Generative Models for RL (Cont.) [ slides ] |
HW7 Due (tentative) |
||
W 11/26 | Thanksgiving Break - No Classes | |||
F 11/28 | Thanksgiving Break - No Recitation | |||
M 12/01 |
Lecture #26
:
Guest Lecture [ slides ] |
|||
W 12/03 |
Lecture #27
:
Course Review [ slides ] |
|||
F 12/05 |
Recitation #12:
Generative Models for RL [ slides ] |
|||
M 12/08 | Project Final Presentation, 8:30AM-11:30AM
Project Final Report due |