Schedule
| Date | Lecture | Readings | Logistics | |
|---|---|---|---|---|
| M 08/25 |
Lecture #1
:
Welcome and Introduction to the Class [ slides | slides 2 ] |
|
||
| W 08/27 |
Lecture #2
:
Introduction to Reinforcement Learning [ slides ] |
|
||
| F 08/29 |
Recitation #1:
Neural Nets, PyTorch, Gymnasium [ slides | notes ] |
|
||
| M 09/01 | No Class, Labor Day | |||
| W 09/03 |
Lecture #3
:
Policy Gradient Methods [ slides ] |
HW1 out |
||
| F 09/05 |
Recitation #2:
MDPs, Policy Gradients, & HW1 [ slides ] |
|||
| M 09/08 |
Lecture #4
:
Actor-Critic Methods (cont.) Evolutionary Methods for Policy Search [ slides | slides 2 ] |
|||
| W 09/10 |
Lecture #5
:
Value-based Methods [ slides | slides 2 ] |
|
||
| F 09/12 |
Recitation #3:
Value-Based Methods and Actor-Critic Methods [ slides ] |
HW1 Due, HW2 out |
||
| M 09/15 |
Lecture #6
:
Value based methods (Cont.) [ slides | slides 2 ] |
|
||
| W 09/17 |
Lecture #7
:
Advanced Policy Gradient Methods [ slides ] |
|||
| F 09/19 |
Recitation #4:
Lecture 7b Advanced Policy Gradient Methods (Cont.) [ slides | slides 2 ] |
HW2 Due, |
||
| M 09/22 |
Lecture #8
:
Advanced Policy Gradient Methods (Cont.) [ slides | slides 2 ] |
HW3 Out |
||
| W 09/24 |
Lecture #9
:
Advanced Policy Gradient Methods (Cont.) MaxEntropy RL [ slides | slides 2 ] |
|||
| F 09/26 |
Recitation #5:
Midterm Review and HW4 [ slides | slides 2 | video ] |
|||
| M 09/29 |
Lecture #10
:
Imitation Learning [ slides ] |
|
||
| T 09/30 |
Project Description Out (tentative) |
|||
| W 10/01 |
Lecture #11
:
Imitation Learning(Cont.) [ slides ] |
|
HW4 Out |
|
| F 10/03 | Midterm | |||
| M 10/06 |
Lecture #12
:
Model-based Methods [ slides ] |
|||
| W 10/08 |
Lecture #13
:
Model-based Methods(Cont.) [ slides ] |
|
HW3 Due |
|
| F 10/10 |
Lecture #14
:
Model-based Methods(Cont.), Learning and Tree-Search Planning [ slides ] |
|
||
| F 10/10 |
Recitation #6:
IL Diffusion Policies and HW5 (Video) [ slides | video ] |
|||
| S 10/12 |
HW4 Due, HW5 Out |
|||
| M 10/13 | Fall Break - No Classes | |||
| W 10/15 | Fall Break - No Classes | |||
| F 10/17 | Fall Break - No Classes | |||
| M 10/20 |
Lecture #15
:
Offline RL [ slides ] |
|
||
| W 10/22 |
Lecture #16
:
Offline RL (Cont.) [ slides ] |
|
||
| F 10/24 |
Recitation #7:
Midterm Solution Session [ slides ] |
Project Team Formation and Project Proposal Due (tentative) |
||
| M 10/27 |
Lecture #17
:
Model-based Methods for offline RL [ slides | slides 2 ] |
|
HW5 Due |
|
| W 10/29 |
Lecture #18
:
Guided Diffusion [ slides ] |
HW6 Out |
||
| F 10/31 |
Recitation #8:
Offline RL and HW6 [ slides ] |
|||
| M 11/03 |
Lecture #19
:
Exploration [ slides ] |
|
||
| W 11/05 |
Lecture #20
:
Sim2Real Learning [ slides ] |
|||
| F 11/07 |
Recitation #9:
Exploration [ slides ] |
|||
| M 11/10 |
Lecture #21
:
Sim2Real Learning (Cont.) [ slides ] |
|||
| W 11/12 |
Lecture #22
:
RL with Foundation Models [ slides ] |
|||
| F 11/14 |
Recitation #10:
Sim2Real and HW7 [ slides ] |
HW6 Due, HW7 Out (tentative), Project Midway Report Due (tentative) |
||
| M 11/17 |
Lecture #23
:
RL with Foundation Models (Cont.) [ slides ] |
|||
| W 11/19 |
Lecture #24
:
Generative Models for RL [ slides ] |
|||
| F 11/21 |
Recitation #11:
RL with Foundation Models [ slides ] |
|||
| M 11/24 |
Lecture #25
:
Generative Models for RL (Cont.) [ slides ] |
|||
| W 11/26 | Thanksgiving Break - No Classes | |||
| F 11/28 | Thanksgiving Break - No Recitation
HW7 Due (tentative) |
|||
| M 12/01 |
Lecture #26
:
Guest Lecture [ slides ] |
|||
| W 12/03 |
Lecture #27
:
Course Review [ slides ] |
|||
| F 12/05 |
Recitation #12:
Generative Models for RL [ slides ] |
|||
| M 12/08 | Project Final Presentation, 8:30AM-11:30AM
Project Final Report due |
|||