Tutorial on Offline RL Theory

This was part of Reinforcement Learning Bootcamp

Nan Jiang, University of Illinois at Urbana-Champaign

Monday, March 9, 2026

Abstract: This tutorial will provide an introduction to the core ideas and results in offline RL theory, focusing on the setting of large state spaces and function approximation. Tentatively, the first part of the tutorial will establish the analyses of classic algorithms under the key assumptions on function approximation (such as Bellman-completeness) and the data distribution (i.e., coverage). The second part considers more advanced algorithms and analyses that rely on weaker or alternative assumptions, and extensions to novel settings beyond the standard single-agent MDPs. Participants are expected to be familiar with the theoretical foundation of MDPs (e.g., classic convergence analysis for tabular value iteration/policy iteration) and the basics of learning theory (e.g., concentration inequalities and union bound).