Schedule
Date | Lecture | Readings | Logistics | |
---|---|---|---|---|
M 08/25 |
Lecture #1
:
Welcome and Introduction to the Class [ slides | slides 2 ] |
|
||
W 08/27 |
Lecture #2
:
Introduction to Reinforcement Learning [ slides ] |
|
||
F 08/29 |
Recitation #1:
Neural Nets, PyTorch, Gymnasium [ slides | notes ] |
|
||
M 09/01 | No Class, Labor Day | |||
W 09/03 |
Lecture #3
:
Policy Gradient Methods [ slides ] |
HW1 out (tentative) |
||
F 09/05 |
Recitation #2:
MDPs, Policy Gradients, & HW1 [ slides ] |
|||
M 09/08 |
Lecture #4
:
Actor-Critic Methods [ slides ] |
|||
W 09/10 |
Lecture #5
:
Value-based Methods [ slides ] |
|
||
F 09/12 |
Recitation #3:
Value-Based Methods and Actor-Critic Methods [ slides ] |
HW1 Due, HW2 out (tentative) |
||
M 09/15 |
Lecture #6
:
Value based methods (Cont.) [ slides ] |
|||
W 09/17 |
Lecture #7
:
Advanced Policy Gradient Methods [ slides ] |
|
||
F 09/19 |
Recitation #4:
Value-Based Methods and Policy Gradient Methods [ slides ] |
HW2 Due, HW3 Out (tentative) |
||
M 09/22 |
Lecture #8
:
Advanced Policy Gradient Methods (Cont.) [ slides ] |
|
||
W 09/24 |
Lecture #9
:
Model-based Methods [ slides ] |
|
||
F 09/26 |
Recitation #5:
TBD [ slides ] |
|||
M 09/29 |
Lecture #10
:
Model-based Methods(Cont.) [ slides ] |
|
HW3 Due, HW4 Out (tentative) |
|
T 09/30 |
Lecture #11
:
[ slides ] |
Project Description Out (tentative) |
||
W 10/01 |
Lecture #12
:
Imitation Learning [ slides ] |
|
||
F 10/03 |
Recitation #6:
Midterm [ slides ] |
|||
M 10/06 |
Lecture #13
:
Imitation Learning(Cont.) [ slides ] |
|||
W 10/08 | Buffer | |||
F 10/10 |
Recitation #7:
TBD [ slides ] |
HW4 Due (tentative), HW5 Out (tentative) |
||
M 10/13 | Fall Break - No Classes | |||
W 10/15 | Fall Break - No Classes | |||
F 10/17 | Fall Break - No Classes | |||
M 10/20 | Buffer | |||
W 10/22 |
Lecture #14
:
Offline RL [ slides ] |
|
||
F 10/24 |
Recitation #8:
TBD [ slides ] |
Project Team Formation and Project Proposal Due (tentative) |
||
M 10/27 |
Lecture #15
:
Offline RL (Cont.) [ slides ] |
|
HW5 Due, HW6 Out (tentative) |
|
W 10/29 |
Lecture #16
:
Exploration [ slides ] |
|
||
F 10/31 |
Recitation #9:
TBD [ slides ] |
|||
M 11/03 |
Lecture #17
:
Exploration (Cont.) [ slides ] |
|
||
W 11/05 |
Lecture #18
:
Sim2Real Learning [ slides ] |
|||
F 11/07 |
Recitation #10:
TBD [ slides ] |
|||
M 11/10 |
Lecture #19
:
Sim2Real Learning (Cont.) [ slides ] |
HW6 Due, HW7 Out (tentative) |
||
W 11/12 |
Lecture #20
:
RL with Foundation Models [ slides ] |
|||
F 11/14 |
Recitation #11:
TBD [ slides ] |
Project Midway Report Due (tentative) |
||
M 11/17 |
Lecture #21
:
RL with Foundation Models (Cont.) [ slides ] |
|||
W 11/19 |
Lecture #22
:
Generative Models for RL [ slides ] |
|||
F 11/21 |
Recitation #12:
TBD [ slides ] |
|||
M 11/24 |
Recitation #13:
Generative Models for RL (Cont.) [ slides ] |
HW7 Due (tentative) |
||
W 11/26 | Thanksgiving Break - No Classes | |||
F 11/28 |
Recitation #14:
Thanksgiving Break - No Recitation [ slides ] |
|||
M 12/01 |
Lecture #23
:
Guest Lecture [ slides ] |
|||
W 12/03 |
Lecture #24
:
Course Review [ slides ] |
|||
F 12/05 |
Recitation #15:
TBD [ slides ] |
|||
Finals Week | [ slides ] |
Project Final Poster/Report due |