Mathematical Foundation of Reinforcement Learning

Published: February 09, 2026

Mathematical Foundation of Reinforcement Learning

Course Website

Course Website

Course Overview

I take this course together with the David Silver’s “Reinforcement Learning” course on Coursera to deepen my understanding of reinforcement learning. This course focuses on the mathematical foundations underlying reinforcement learning techniques.

However, the organization of some chapters in this course differs from that of David Silver’s and this course lacks some important topics of the RL. But I still find it a good complement to David Silver’s course.

My Learning Journey

2026.1.29 L1-Basic concepts
2026.1.29 L2-Bellman equation
2026.1.31 L3-Bellman optimality equation
2026.2.4 L4-Value iteration and policy iteration
2026.2.4 L5-Monte Carlo methods
2026.2.5 L6-Stochastic approximation
2026.2.6 L7-Temporal-Difference Learning
2026.2.7 L8-Value function methods
2026.2.8 L9-Policy gradient methods
2026.2.9 L10-Actor Critic

Kai Cheng

Mathematical Foundation of Reinforcement Learning

Course Website

Course Overview

My Learning Journey