This event is part of Theoretical Advances in Reinforcement Learning and Control View Details

New Directions in Reinforcement Learning and Control

David Rubenstein Forum - 1201 E 60th St, Chicago, IL 60637

Description

Back to top

In recent years, many novel methods and directions have emerged in reinforcement learning and control. A particularly exciting development is the use of online optimization and statistical learning techniques in control theory. This has led to novel methods and guarantees in various contexts, including in stochastic and adversarial environments, system identification, iterative planning and sequence prediction. Other topics we will cover include new connections between control and both model-free and model-based reinforcement learning, as well as learning dynamical systems. We aim to bring together researchers to facilitate progress along these lines of investigation, and discuss important future directions in reinforcement learning, control, learning dynamical systems and applications to sequence prediction.

Poster Session and Lightning Talks

This workshop will include a poster session and lightning talks for early career researchers (including graduate students). In order to propose a poster or a lightning talk, you must first register for the workshop, and then submit a proposal using the form that will become available on this page after you register. You can request to do one, or both. The registration form should not be used to propose a poster or a lightning talk.

The deadline for proposing is Sunday, March 15, 2026. If your proposal is accepted, you should plan to attend the event in-person.

In-Person Registration

Seats are limited at the venue, which means that in-person registration may be capped prior to the workshop start date. If capacity is reached, a waitlist will be imposed, which the registration form will reflect. Early registration is strongly encouraged.

All in-person registrants must wait to receive an invitation to attend in-person from IMSI before traveling, which generally begin to be sent out 4-6 weeks in advance.

All registrants (online and in-person) will receive zoom links and are welcome to attend online.

Organizers

Back to top
E H
Elad Hazan Princeton University
X C
Xinyi Chen Google DeepMind
G B
Gon Buzaglo Princeton University

Speakers

Back to top
J A
Jacob Abernethy Georgia Tech
D B
Drew Bagnell Carnegie Mellon University
N C
Nadav Cohen Tel Aviv University
S D
Sarah Dean Cornell University
F D
Florian Dörfler ETH Zurich
D F
Dylan Foster Microsoft Research
G G
Gautam Goel University of California, Berkeley
N G
Noah Golowich Microsoft Research NYC
B H
Babak Hassibis Caltech
E H
Elad Hazan Princeton University
N L
Na Li Harvard University
A M
Annie Marsden Google DeepMind (GDM)
Z M
Zak Mhammedi Google Research
A O
Alex Olshevsky Boston University (BU)
N O
Necmiye Ozay University of Michigan
M R
Max Raginski University of Illinois Urbana-Champaign (UIUC)
D R
Daniel Russo Columbia University
D S
Dale Schuurmans University of Alberta and Google DeepMind
M S
Max Simchowitz Carnegie Mellon University
S T
Stephen Tu University of Southern California
V T
Vasileios Tzoumas University of Michigan
B V R
Ben Van Roy Stanford University

Schedule

Monday, May 11, 2026
8:30-8:50 CDT
Breakfast/Check-in
8:50-9:00 CDT
Welcome
9:00-9:45 CDT
TBA

Speaker: Elad Hazan (Princeton University)

9:45-10:00 CDT
Q&A
10:00-10:05 CDT
Tech Break
10:05-10:50 CDT
TBA

Speaker: Babak Hassibis (Caltech)

10:50-11:05 CDT
Q&A
11:05-11:35 CDT
Coffee Break
11:35-12:20 CDT
Understanding the foundation model pipeline through coverage

Speaker: Dylan Foster (Microsoft Research)

12:20-12:35 CDT
Q&A
12:35-13:35 CDT
Lunch break
13:35-14:20 CDT
TBA

Speaker: Drew Bagnell (Carnegie Mellon University and Aurora)

14:20-14:35 CDT
Q&A
14:35-15:35 CDT
Lightning Talks
15:40-16:30 CDT
Poster Session & Social Hour
Tuesday, May 12, 2026
8:30-9:00 CDT
Breakfast/Check-in
9:00-9:45 CDT
Sequences of Logits and The Low Rank Structure of Language Models

Speaker: Noah Golowich (Microsoft Research NYC)

9:45-10:00 CDT
Q&A
10:00-10:05 CDT
Tech Break
10:05-10:50 CDT
The Power of Universal Sequence Preconditioning

Speaker: Annie Marsden (Google Deepmind)

10:50-11:05 CDT
Q&A
11:05-11:35 CDT
Coffee Break
11:35-12:20 CDT
Learning Pipelines for Adaptive Control

Speaker: Florian Dorfler (University of Pennsylvania)

12:20-12:35 CDT
Q&A
12:35-13:35 CDT
Lunch break
13:35-14:20 CDT
TBA

Speaker: Vasileios Tzoumas (University of Michigan)

14:20-14:35 CDT
Q&A
14:35-15:00 CDT
Coffee Break
15:00-15:45 CDT
Success Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success

Speaker: Daniel Russo (Columbia University)

15:45-16:00 CDT
Q&A
Wednesday, May 13, 2026
8:30-9:00 CDT
Breakfast/Check-in
9:00-9:45 CDT
Learning and control in the presence of observer effects

Speaker: Sarah Dean (Cornell University)

9:45-10:00 CDT
Q&A
10:00-10:05 CDT
Tech Break
10:05-10:50 CDT
Controlled dynamical systems on the space of probability measures

Speaker: Max Raginsky (University of Illinois at Urbana-Champaign)

10:50-11:05 CDT
Q&A
11:05-11:35 CDT
Coffee Break
11:35-12:20 CDT
Large Language Models and Computation

Speaker: Dale Schuurmans (University of Alberta)

12:20-12:35 CDT
Q&A
12:35-13:35 CDT
Lunch break
13:35-14:20 CDT
Latent Representations for Control Design with Provable Stability and Safety Guarantees

Speaker: Stephen Tu (University of Southern California (USC))

14:20-14:35 CDT
Q&A
14:35-15:00 CDT
Coffee Break
15:00-15:45 CDT
Some fundamental limitations of learning for dynamics and control

Speaker: Necmiye Ozay (University of Michigan)

15:45-16:00 CDT
Q&A
Thursday, May 14, 2026
8:30-9:00 CDT
Breakfast/Check-in
9:00-9:45 CDT
A mathematical basis for Moravec’s paradox

Speaker: Max Simchowitz (Carnegie Mellon University)

9:45-10:00 CDT
Q&A
10:00-10:05 CDT
Tech Break
10:05-10:50 CDT
TBA

Speaker: Zak Mhammedi (Google Reasearch)

10:50-11:05 CDT
Q&A
11:05-11:35 CDT
Coffee Break
11:35-12:20 CDT
Self-Attention for Online Decision-Making and Control

Speaker: Gautam Goel (University of California, Berkeley (UC Berkeley)

12:20-12:35 CDT
Q&A
12:35-13:35 CDT
Lunch break
13:35-14:20 CDT
From Generative Models to Control: Representation-based Reinforcement Learning in Physical Systems

Speaker: Na Li (Harvard)

14:20-14:35 CDT
Q&A
14:35-15:00 CDT
Coffee Break
15:00-15:45 CDT
TBA

Speaker: Jacob Abernethy (Georgia Tech)

15:45-16:00 CDT
Q&A
Friday, May 15, 2026
8:30-9:00 CDT
Breakfast/Check-in
9:00-9:45 CDT
TBA

Speaker: Nadav Cohen (Tel Aviv University)

9:45-10:00 CDT
Q&A
10:00-10:30 CDT
Coffee Break
10:30-11:15 CDT
TBA

Speaker: Ben Van Roy (Stanford University)

11:15-11:30 CDT
Q&A
11:30-12:15 CDT
Can Gradient Descent Beat Ricatti?

Speaker: Alex Olshvesky (Boston University)

12:15-12:30 CDT
Q&A
12:30-12:45 CDT
Workshop Survey and Closing Remarks

Registration

IMSI is committed to making all of our programs and events inclusive and accessible. Contact [email protected] to request disability-related accommodations.

In order to register for this workshop, you must have an IMSI account and be logged in.