Schedule
Date | Lecture | Readings | Logistics | |
---|---|---|---|---|
T 02/02 |
Lecture #1
:
Introduction [ slides | video ] |
|
||
Th 02/04 |
Lecture #2
:
Evolutionary methods for policy search [ slides | video ] |
|
||
F 02/05 |
Recitation #1:
Neural Nets, TensorFlow & Keras, OpenAI Gym [ slides | slides 2 | video ] |
|
||
T 02/09 |
Lecture #3
:
Evolutionary methods for policy search (cont.) [ slides | video ] |
|
||
Th 02/11 |
Lecture #4
:
Multi-armed bandits [ slides | video ] |
|
HW1 out |
|
F 02/12 |
Recitation #2:
Gaussian Processes [ slides | video ] |
|||
T 02/16 |
Lecture #5
:
Bayesian Optimization with Gaussian Processes [ slides | video ] |
|
||
Th 02/18 |
Lecture #6
:
Bayesian Optimization with Gaussian Processes (cont.) [ slides | video ] |
|
||
F 02/19 |
Recitation #3:
HW1 [ slides | video ] |
|||
T 02/23 | Break day - No classes | |||
Th 02/25 |
Lecture #7
:
Imitation learning with behavior cloning [ slides | video ] |
|
||
F 02/26 |
Recitation #4:
REC: Review of MDP, Value Functions, Bellman Equation [ slides | video ] |
HW1 due 02/26 11:59pm |
||
T 03/02 |
Lecture #8
:
Markov Decision Processes, Value Iteration, Policy Iteration [ slides | video ] |
|
||
Th 03/04 |
Lecture #9
:
Monte Carlo Learning and Temporal Difference Learning [ slides | video ] |
|
||
F 03/05 |
Recitation #5:
A comparative overview of DP vs. MC vs. TD [ slides | video ] |
HW2 out |
||
T 03/09 |
Lecture #10
:
Monte Carlo Learning and Temporal Difference Learning (cont.) [ slides | video ] |
|
||
Th 03/11 |
Lecture #11
:
Function approximation in prediction and control [ slides | video ] |
|
||
F 03/12 | Quiz 1 (online) | |||
T 03/16 |
Lecture #12
:
Monte Carlo Tree search [ slides | video ] |
|
||
Th 03/18 |
Lecture #13
:
REINFORCE, Actor-Critic methods [ slides | video ] |
|
||
F 03/19 | Mid-semester break - No classes
HW2 due 03/19 11:59pm |
|||
T 03/23 |
Lecture #14
:
REINFORCE, Actor-Critic methods (cont.) [ slides | video ] |
|
||
Th 03/25 |
Lecture #15
:
REINFORCE, Actor-Critic methods (cont.), Natural PG [ slides | slides 2 | video ] |
|
||
F 03/26 |
Recitation #6:
Quiz 1 Review [ slides ] |
|||
T 03/30 |
Lecture #16
:
Natural PG (cont.), Deterministic PG, Re-parametrized PG [ slides | slides 2 | video ] |
|
||
Th 04/01 |
Lecture #17
:
MCTS with prior knowledge [ slides | video ] |
|
||
F 04/02 |
Recitation #7:
Quiz 2 Review [ slides | video ] |
HW3 due 04/02 11:59PM |
||
T 04/06 |
Lecture #18
:
Model-based RL [ slides | video ] |
|
||
Th 04/08 |
Lecture #19
:
Model-based RL II: graph neural networks, iLQR [ slides | video ] |
|
||
F 04/09 | Quiz 2 (online)
HW4 out |
|||
T 04/13 |
Lecture #20
:
MBRL with latent dynamic models [ slides | video ] |
|||
Th 04/15 | No classes | |||
F 04/16 | Spring Carnival - No classes | |||
T 04/20 |
Lecture #21
:
Exploration using curiosity and episodic memory [ slides | video ] |
|
||
Th 04/22 |
Lecture #22
:
Learning from demonstrations and task rewards, off-policy RL, adversarial imitation learning [ slides | video ] |
|||
F 04/23 |
Recitation #8:
Distributional RL [ slides | video ] |
HW4 due 04/23 11:59pm |
||
T 04/27 |
Lecture #23
:
Visual Imitation Learning [ slides | video ] |
|||
Th 04/29 |
Lecture #24
:
Visual Imitation Learning (cont.), Sim2Real transfer [ slides | video ] |
|
||
T 05/04 |
Lecture #25
:
State representation learning for RL [ slides | video ] |
|||
Th 05/06 |
Lecture #26
:
RL and generalization: A closer look to state representations for generalization in model free and model based RL [ slides | video ] |
|
||
F 05/07 |
Recitation #9:
Quiz 3 Review [ slides | video ] |
|||
Mo 05/10 | Quiz 3
8:30AM - 11:30AM |