This event is part of Data-Driven Materials Informatics View Details

Learning Collective Variables and Coarse Grained Models

Description

Back to top

The dynamical behavior of molecular systems of relevance to chemistry, biophysics and materials science can be numerically simulated using deterministic molecular dynamics algorithms or stochastic algorithms such as Langevin dynamics. Although the systems of interest are composed of large numbers of atoms, collective interactions mean that the long-time evolution is in fact typically dictated by the variations of a small number of collective modes, known as collective variables or reaction coordinates. Intense efforts have recently been invested in automating the definition of collective variables from molecular simulation data using a variety of machine learning techniques. A key mathematical question is to characterize the quality of the dimensionality reduction, for instance by a priori or, even better, a posteriori estimates on the error committed by integrating the dynamics associated with the coarse-grained reduced model. Another important issue is how to incorporate various constraints into the discovery process, such as symmetries (permutation, rotation, translation). This workshop will focus on recent advances in the data driven learning and validation of collective modes and their applications in coarse-grained simulations and enhanced sampling.

This workshop will include lightning talks and a poster session for early career researchers (including graduate students). If accepted, you will be asked to do both. In order to propose a lightning session talk and a poster, you must first register for the workshop, and then submit a proposal using the form that will become available on this page after you register. The registration form should not be used to propose a lightning session talk or poster.

The deadline for proposing is April 2, 2024. If your proposal is accepted, you should plan to attend the event in-person.

Organizers

Back to top
A F
Andrew Ferguson University of Chicago
B K
Bettina Keller Freie Universität Berlin
M M
Marina Meila University of Washington
J R
Jutta Rogal New York University
G S
Gabriel Stoltz Ecole des Ponts and Inria Paris

Speakers

Back to top
D A
David Aristoff Colorado State University
M A
Marloes Arts Genmab
P B
Peter Bolhuis University of Amsterdam
K B
Klara Bonneau Freie University Berlin
C C
Christophe Chipot CNRS and Université de Nancy
P C
Pilar Cossio Flatiron Institute
R C
Roberto Covino Frankfurt Institute for Advanced Studies
B E
Bernd Ensing University of Amsterdam
A F
Andrew Ferguson University of Chicago
L F
Laura Filion University of Utrecht
P G
Paraskevi Gkeka Sanofi France
R G
Rafael Gomez-Bombarelli Massachusetts Institute of Technology (MIT)
N J
Nick Jackson University of Illinois
T L
Tony Lelièvre l’École des Ponts ParisTech
M M
Marina Meila University of Washington
K P
Karen Palacio-Rodriguez MPI Biophysics, Frankfurt
G R
Grant Rotskoff University of Stanford
Y W
Yusu Wang University of California, San Diego (UCSD)
R Y
Rose Yu University of California, San Diego (UCSD)
H Z
Hanyu Zhang TikTok
M Z
Ming Zhong Illinois Institute of Technology

Schedule

Back to top
Monday, April 22, 2024
9:00-10:30 CDT
Dimension reduction from the user’s perspective

Speaker: Marina Meila (University of Washington)

10:30-11:00 CDT
Coffee Break
11:00-11:55 CDT
Learning the committor, free energy profile, and rates with ML-driven path sampling simulations

Speaker: Roberto Covino (Frankfurt Institute for Advanced Studies)

11:55-12:50 CDT
Towards quantitative prediction of kinetic rates and optimal reaction coordinates from the projected dynamics of transition paths

Speaker: Karen Palacio-Rodriguez (MPI Biophysics, Frankfurt)

12:50-13:45 CDT
Lunch Break
13:45-14:30 CDT
A high-bias, low-variance review of data-driven collective variable discovery and enhanced sampling

Speaker: Andrew Ferguson (University of Chicago)

14:45-15:30 CDT
Hands-on 
15:30-16:30 CDT
Social Hour
Tuesday, April 23, 2024
9:15-10:45 CDT
Mathematical aspects of coarse graining: functional inequalities and quasi-stationary distributions 

Speaker: Tony Lelièvre (l’École des Ponts ParisTech)

10:45-11:20 CDT
Coffee Break
11:20-11:50 CDT
Lightning Talks
  • Luke Evans, Model Misspecification and Collective Variables in Simulation-based inference for cryo-EM
  • Matteo Carli, Nonparametric free energy estimation in high-dimensional CV spaces
  • John Maier, Designing target-optimized coarse-grained representations for soft materials using attentive message-passing
  • Xiaoou Cheng, The surprising efficiency of temporal difference learning for rare event prediction
  • Shashank Sule, Learning collective variables for quantifiably good representations of molecular dynamics
  • Eric Beyerle, Learning Thermodynamics and Reaction Coordinates for Colloidal Phase Transitions
  • Andrea Guljas, Machine Learned Molecular Coarse-Graining of Protein Folding and Aggregation
  • Carl Henning Hansen, Combining Molecular Simulations and Time-Resolved Experimental Data using the Fokker-Planck Equation and Maximum Entropy Reweighting
  • Mike Jones, A Flow-matching Approach for Generative Backmapping of Biomolecules
  • Juno Nam, Revealing hidden alchemical degrees of freedom in machine learning interatomic potentials
11:50-13:15 CDT
Poster Session with Lunch Break
13:15-14:10 CDT
End-to-end differentiation for reversible coarse-graining and rare-event sampling

Speaker: Rafael Gomez-Bombarelli (Massachusetts Institute of Technology (MIT))

14:10-14:40 CDT
Coffee Break
14:40-15:35 CDT
Quantum-Chemical Predictions from Coarse-Grained Molecular Representations

Speaker: Nick Jackson (University of Illinois)

15:35-16:30 CDT
Denoising Force Fields

Speaker: Marloes Arts (Genmab)

Wednesday, April 24, 2024
9:00-9:55 CDT
From coarse-grained models to conformational ensembles 

Speaker: Grant Rotskoff (University of Stanford)

9:55-10:25 CDT
Coffee Break
10:25-11:20 CDT
Autoencoders for dimensionality reduction in molecular dynamics: Collective variable dimension, biasing, and transition states

Speaker: Paraskevi Gkeka (Sanofi France)

11:20-11:50 CDT
Lightning Talks
  • Adolfo Poma, GōMartini 3: A Computational Tool to Elucidate Conformational Changes in Proteins
  • Chatipat Lorpaiboon, Dimensionality reduction for transient and metastable systems using VAMPnets with boundary conditions
  • Abhik Ghosh Moulick, Comparative Exploration of nucleosome Dynamics using All-Atom and Coarse-Grain Approaches
  • Thomas Pigeon, Estimating committor function using rare event sampling
  • Suemin Lee, Unraveling Protein-Ligand Dissociation Kinetics Through State Predictive Information Bottleneck based Enhanced Sampling
  • Yaoyi Chen, Enhancing Data Efficiency in Coarse-Grained Force Field Parameterization with Machine Learning
  • Spencer Guo, Inexact iterative numerical linear algebra for neural network-based spectral estimation and rare-event prediction
  • Rutika Patel, Conformational dynamics of nucleosomal histone H2B tails revealed by molecular dynamics and Markov state model
  • Soojung Yang, Learning Collective Variables for Protein Folding with Labeled Data Augmentation through Geodesic Interpolation
  • Subarna Sasmal, Reaction Coordinates for Conformational Transitions Using Linear Discriminant Analysis on Positions
11:50-13:15 CDT
Poster Session with Lunch Break
13:15-14:10 CDT
Learning reaction coordinates and finding optimal model parameters from sampled trajectory ensembles

Speaker: Peter Bolhuis (University of Amsterdam)

14:10-14:40 CDT
Coffee Break
14:40-15:35 CDT
Interpretable iterative learning of committors and Cvs 

Speaker: David Aristoff (Colorado State University)

15:35-16:30 CDT
Automatic Symmetry Discovery from Data 

Speaker: Rose Yu (University of California, San Diego (UCSD))

Thursday, April 25, 2024
9:00-9:55 CDT
Learning a Neural Free-Energy Functional from Pair-Correlation Functions

Speaker: Bernd Ensing (University of Amsterdam)

9:55-10:25 CDT
Coffee Break
10:25-11:20 CDT
Learning interactions in crowded environments 

Speaker: Laura Filion (University of Utrecht)

11:20-12:15 CDT
Manifold coordinates with physical meaning, and applications in MDS data

Speaker: Hanyu Zhang (TikTok)

12:15-13:15 CDT
Lunch Break
13:15-14:10 CDT
Active learning of Boltzmann samplers and potential energies with quantum mechanical accuracy

Speaker: Pilar Cossio (Flatiron Institute)

14:10-14:40 CDT
Coffee Break
14:40-15:35 CDT
Graph learning models: theoretical understanding, limitations and enhancements

Speaker: Yusu Wang (University of California, San Diego (UCSD))

15:35-16:30 CDT
Discussion session
Friday, April 26, 2024
9:00-9:55 CDT
Designing protein models with machine learning and experimental data

Speaker: Klara Bonneau (Freie University Berlin)

9:55-10:25 CDT
Coffee Break
10:25-11:20 CDT
Learning Collective Variables and Kernels from Observations 

Speaker: Ming Zhong (Illinois Institute of Technology)

11:20-12:15 CDT
Modeling rare events. Discovering reaction pathways, slow variables, and committor probabilities with machine learning

Speaker: Christophe Chipot (CNRS and Université de Nancy)

12:15-12:30 CDT
Workshop Survey

Poster Session

Back to top

The posters that have been submitted in advance for the poster session are available on the poster session page.