Summary of MAP670C - Reinforcement Learning (M2DS) (2022-2023)

MAP670C - Reinforcement Learning (M2DS) (2022-2023)

Syllabus: This 20-hour course provides an introduction to reinforcement learning. It is based on the new edition of the book "Reinforcement Learning: An Introduction" by R. Sutton and A. Barto. Barto (available online at http://incompleteideas.net/book/the-book-2nd.html).

Outline:

Introduction to reinforcement learning and Markov decision processes
The bandit case
Tabular methods: prediction by dynamic programming, Monte Carlo method and TD Learning
Planning and learning for tabular methods
approximate methods: prediction, planning and learning

Grading: Project based on a research article

Teaching coordinator: Le Pennec Erwan