Date Lecture Readings Logistics
T 02/02 Lecture #1 :
Introduction
[ slides | video ]

Th 02/04 Lecture #2 :
Evolutionary methods for policy search
[ slides | video ]

F 02/05 Recitation #1:
Neural Nets, TensorFlow & Keras, OpenAI Gym
[ slides | slides 2 | video ]

T 02/09 Lecture #3 :
Evolutionary methods for policy search (cont.)
[ slides | video ]

Th 02/11 Lecture #4 :
Multi-armed bandits
[ slides | video ]

HW1 out

F 02/12 Recitation #2:
Gaussian Processes
[ slides | video ]

T 02/16 Lecture #5 :
Bayesian Optimization with Gaussian Processes
[ slides | video ]

Th 02/18 Lecture #6 :
Bayesian Optimization with Gaussian Processes (cont.)
[ slides | video ]

F 02/19 Recitation #3:
HW1
[ slides | video ]

T 02/23 Break day - No classes

Th 02/25 Lecture #7 :
Imitation learning with behavior cloning
[ slides | video ]

F 02/26 Recitation #4:
REC: Review of MDP, Value Functions, Bellman Equation
[ slides | video ]

HW1 due 02/26 11:59pm

T 03/02 Lecture #8 :
Markov Decision Processes, Value Iteration, Policy Iteration
[ slides | video ]

Th 03/04 Lecture #9 :
Monte Carlo Learning and Temporal Difference Learning
[ slides | video ]

F 03/05 Recitation #5:
A comparative overview of DP vs. MC vs. TD
[ slides | video ]

HW2 out

T 03/09 Lecture #10 :
Monte Carlo Learning and Temporal Difference Learning (cont.)
[ slides | video ]

Th 03/11 Lecture #11 :
Function approximation in prediction and control
[ slides | video ]

F 03/12 Quiz 1 (online)

T 03/16 Lecture #12 :
Monte Carlo Tree search
[ slides | video ]

Th 03/18 Lecture #13 :
REINFORCE, Actor-Critic methods
[ slides | video ]

F 03/19 Mid-semester break - No classes

HW2 due 03/19 11:59pm
HW3 out

T 03/23 Lecture #14 :
REINFORCE, Actor-Critic methods (cont.)
[ slides | video ]

Th 03/25 Lecture #15 :
REINFORCE, Actor-Critic methods (cont.), Natural PG
[ slides | slides 2 | video ]

F 03/26 Recitation #6:
Quiz 1 Review
[ slides ]

T 03/30 Lecture #16 :
Natural PG (cont.), Deterministic PG, Re-parametrized PG
[ slides | slides 2 | video ]

Th 04/01 Lecture #17 :
MCTS with prior knowledge
[ slides | video ]

F 04/02 Recitation #7:
Quiz 2 Review
[ slides | video ]

HW3 due 04/02 11:59PM

T 04/06 Lecture #18 :
Model-based RL
[ slides | video ]

Th 04/08 Lecture #19 :
Model-based RL II: graph neural networks, iLQR
[ slides | video ]

F 04/09 Quiz 2 (online)

HW4 out

T 04/13 Lecture #20 :
MBRL with latent dynamic models
[ slides | video ]

Th 04/15 No classes

F 04/16 Spring Carnival - No classes

T 04/20 Lecture #21 :
Exploration using curiosity and episodic memory
[ slides | video ]

Th 04/22 Lecture #22 :
Learning from demonstrations and task rewards, off-policy RL, adversarial imitation learning
[ slides | video ]

F 04/23 Recitation #8:
Distributional RL
[ slides | video ]

HW4 due 04/23 11:59pm

T 04/27 Lecture #23 :
Visual Imitation Learning
[ slides | video ]

Th 04/29 Lecture #24 :
Visual Imitation Learning (cont.), Sim2Real transfer
[ slides | video ]

T 05/04 Lecture #25 :
State representation learning for RL
[ slides | video ]

Th 05/06 Lecture #26 :
RL and generalization: A closer look to state representations for generalization in model free and model based RL
[ slides | video ]

F 05/07 Recitation #9:
Quiz 3 Review
[ slides | video ]

Mo 05/10 Quiz 3

8:30AM - 11:30AM