M 01/13 |
Lecture #1
:
Introduction to Reinforcement and Representation Learning
[
slides
]
|
|
|
W 01/15 |
Lecture #2
:
Multi-armed Bandits
[
slides
]
|
|
|
F 01/17 |
Recitation #1:
Neural Nets, PyTorch, OpenAI Gym, Bandits
[
slides
]
|
|
|
M 01/20 |
No Class, MLK Jr Day
|
W 01/22 |
Lecture #3
:
Value-based Methods
[
slides
]
|
|
|
F 01/24 |
Recitation #2:
Bandits, MDPs
[
slides
]
|
|
|
M 01/27 |
Lecture #4
:
Value-based Methods (cont.)
[
slides
]
|
|
|
W 01/29 |
Lecture #5
:
Value based methods cont. (DQN, MCTS)
[
slides
| slides 2
]
|
|
|
F 01/31 |
No Recitation
|
M 02/03 |
Lecture #6
:
Actor-Critic Methods
[
slides
]
|
|
HW1 out (tentative)
|
W 02/05 |
Recitation #3:
HW1
[
slides
]
|
|
|
F 02/07 |
Lecture #7
:
Actor Critic Methods (cont.)
[
slides
]
|
|
|
M 02/10 |
Lecture #8
:
Trust Region Methods
[
slides
]
|
|
|
W 02/12 |
Lecture #9
:
Trust Region methods
[
slides
]
|
|
|
F 02/14 |
Recitation #4:
Quiz 1 Review
[
slides
]
|
|
|
M 02/17 |
Lecture #10
:
Trust Region Methods
[
slides
]
|
|
HW1 due 11:59PM
|
W 02/19 |
Lecture #11
:
Behavior Cloning, Generative Adversarial Imitation Learning
[
slides
]
|
|
|
F 02/21 |
Quiz 1
|
M 02/24 |
Lecture #12
:
Multimodel Policies, Diffusion Policies
[
slides
]
|
|
|
W 02/26 |
Lecture #13
:
Diffusion Policies (cont.) Evolutionary Methods for Policy Search
[
slides
]
|
|
HW2 out (tentative)
|
F 02/28 |
Recitation #5:
Solutions to Quiz 1
[
slides
]
|
|
|
M 03/03 |
Spring Break - No Classes
|
W 03/05 |
Spring Break - No Classes
|
F 03/07 |
Spring Break - No Classes
|
M 03/10 |
Lecture #14
:
Maximum Entropy RL, SAC, DDPG
[
slides
]
|
|
|
W 03/12 |
Lecture #15
:
Maximum Entropy RL, SAC, DDPG
[
slides
]
|
|
HW2 due Thursday 3/13 11:59PM
|
F 03/14 |
Recitation #6:
Diffusion policies (cont.)
[
slides
]
|
|
|
M 03/17 |
Lecture #16
:
Introduction to Model-Based Reinforcement Learning
[
slides
]
|
|
|
W 03/19 |
Lecture #17
:
AlphaGo, AlphaGoZero, AlphaZero
[
slides
]
|
|
HW3 out
|
F 03/21 |
Recitation #7:
HW3
[
slides
]
|
|
|
M 03/24 |
Lecture #18
:
MBRL from sensory input
[
slides
]
|
|
|
W 03/26 |
Lecture #19
:
MBRL (cont.)
[
slides
]
|
|
|
F 03/28 |
Lecture #20
:
Visual Imitation / Quiz 2 Review
[
slides
]
|
|
|
M 03/31 |
Lecture #21
:
Multigoal Reinforcement Learning, MBRL with multimodal dynamics
[
slides
| slides 2
]
|
|
|
W 04/02 |
Quiz 2
|
F 04/04 |
No Class, Spring Carnvial
|
M 04/07 |
Lecture #22
:
Offline RL 1: going beyond imitation, problem statement, challenges in doing offline RL, policy gradient methods / policy constraints
[
slides
]
|
|
|
W 04/09 |
Lecture #23
:
Offline RL 2: conservative methods, model-based approaches, modern model-free algorithms
[
slides
]
|
|
HW4 out HW3 due 11:59pm
|
F 04/11 |
Recitation #8:
HW 4
[
slides
]
|
|
|
M 04/14 |
Lecture #24
:
Intelligent Exploration
[
slides
]
|
|
|
W 04/16 |
Lecture #25
:
Intelligent Exploration (cont.), Sim2Real Policy Learning
[
slides
| slides 2
]
|
|
|
F 04/18 |
Recitation #9:
Sim2Real Policy Learning (cont.), Quiz 2 Solutions
[
slides
]
|
|
|
M 04/21 |
Lecture #26
:
Foundation Models for RL
[
slides
]
|
|