Permutation and Causal Inference

Description

Random permutation, as a particularly interesting type of stochasticity, has been a fundamental object of interest in two branches of statistics: causal inference, which focuses on drawing causal conclusions from randomized and quasi-randomized experiments, and distribution-free methods, which focuses on constructing and studying the stochastic structures of certain functionals of a distribution-free nature. The two fields have each witnessed explosive development in recent years. Notably, as the ideas of randomization, re-randomization, and multiple permutation tests have been booming in causal inference in the last ten years, conformal prediction, knockoffs, rank statistics, graph-based statistics, optimal transport, combinatorial inference, and Stein’s methods have simultaneously received increasing attention in the world of distribution-free methods.

Researchers working in these two areas are now, more than ever, realizing the foundational connection between them: they are faced with similar data analysis challenges and need similar technical tools. This workshop will bring experts from these two distinct worlds together, to communicate, to learn from each other, and to stimulate conversations and collaborations.

Organizers

R B

Rina Barber University of Chicago

P D

Peng Ding University of California, Berkeley

F H

Fang Han University of Washington

N P

Nicole Pashley Rutgers University

Speakers

M A

Mona Azadkia ETH Zürich and London School of Economics

E Y C

Eun Yi Chung University of Illinois at Urbana-Champaign

T D

Tirthankar Dasgupta Rutgers University

H D

Holger Dette Ruhr-Universität Bochum

C F

Colin Fogarty University of Michigan

N J

Nianqiao Ju Purdue University

L L

Lihua Lei Stanford University

X L

Xinran Li University of Illinois at Urbana-Champaign

L M

Lester Mackey Microsoft New England

S P

Sam Pimentel University of California, Berkeley

A R

Adrian Roellin National University of Singapore

Y R

Yaniv Romano Technion – Israel Institute of Technology

B S

Bodhi Sen Columbia University

L S

Lei Shi University of California, Berkeley

P B S

Philip B. Stark University of California, Berkeley

P T

Panos Toulis University of Chicago

J W

Jingshen Wang University of California, Berkeley

J W

Jingshu Wang University of Chicago

A Z

Anqi Zhao National University of Singapore

Schedule

Tuesday, August 22, 2023

9:00-9:45 CDT

Statistical inference for function-on-function linear regression

Speaker: Holger Dette (Ruhr-Universität Bochum)

Abstract +

10:00-10:30 CDT

Coffee Break

10:30-11:15 CDT

Covariate-adaptive randomization inference conditional on optimal propensity score matching

Speaker: Sam Pimentel (University of California, Berkeley)

Abstract +

11:30-12:30 CDT

Lunch

12:30-13:15 CDT

Inference for Synthetic Controls via Leave-Two-Out Placebo Tests

Speaker: Lihua Lei (Stanford University)

Abstract +

13:35-14:20 CDT

Adaptive Experiments Toward Learning Treatment Effect Heterogeneity

Speaker: Jingshen Wang (University of California, Berkeley)

Abstract +

14:35-15:00 CDT

Coffee Break

15:00-15:45 CDT

No star is good news: a unified look at rerandomization based on p-values from covariate balance tests

Speaker: Anqi Zhao (National University of Singapore)

Abstract +

Randomized experiments balance all covariates on average and provide the gold standard for estimating treatment effects. Chance imbalances nevertheless exist more or less in realized treatment allocations, complicating the interpretation of experimental results. To inform readers of the comparability of treatment groups at baseline, modern scientific publications often report covariate balance tables with not only covariate means by treatment group but also the associated p-values from significance tests of their differences. The practical need to avoid small p-values as indicators of poor balance motivates balance check and rerandomization based onthese p-values from covariate balance tests (ReP) as an attractive tool for improving covariate balance in randomized experiments. Despite the intuitiveness of such strategy and its possibly already widespread use in practice, the existing literature lacks results about its implications on subsequent inference, subjecting many effectively rerandomized experiments to possibly inefficient analyses. To fill this gap, we examine a variety of potentially useful schemes for ReP and quantify their impact on subsequent inference. Specifically, we focus on three estimators of the average treatment effect from the unadjusted, additive, and fully interacted linear regressionsof the outcome on treatment, respectively, and derive their asymptotic sampling properties under ReP. The main findings are threefold. First, ReP improves covariate balance between treatment groups, thereby strengthening the causal conclusions that can be drawn from experimental data. In addition to increasing comparability of treatment groups, the improved balance also reduces the asymptotic conditional biases of the estimators and ensures more coherent inferences between covariate-adjusted and unadjusted analyses. Second, the estimator from the fully interacted regression is asymptotically the most efficient under all ReP schemes examined, and permits convenient regression-assisted inference identical to that under complete randomization. Third, ReP improves the asymptotic efficiency of the estimators from the unadjusted and additive regressions. The corresponding standard regression analyses are accordingly still valid but in general overconservative. As a result, the combination of ReP for treatment allocationand fully interacted regression for analysis ensures both covariate balance and convenient and efficient inference. Importantly, our theory is design-based and holds regardless of how well the models involved in both the rerandomization and analysis stages represent the true data-generating processes.

Wednesday, August 23, 2023

9:00-9:45 CDT

Randomization-based inference: some methods and algorithms

Speaker: Tirthankar Dasgupta (Rutgers University)

10:00-10:30 CDT

Coffee Break

10:30-11:15 CDT

A simple measure of conditional dependence

Speaker: Mona Azadkia (ETH Zürich and London School of Economics)

11:30-12:30 CDT

Lunch

12:30-13:15 CDT

Permutation Inference under Dependence

Speaker: EunYi Chung (University of Illinois at Urbana-Champaign)

Abstract +

13:35-14:20 CDT

Higher order fluctuations in dense random graph models

Speaker: Adrian Roellin (National University of Singapore)

Abstract +

14:35-15:20 CDT

When Is A Randomization Test for Spillover Effects Also A Permutation Test?

Speaker: Panos Toulis (University of Chicago)

15:35-16:30 CDT

Social Hour

Thursday, August 24, 2023

9:00-9:45 CDT

Measuring association on topological spaces using kernels and geometric graphs

Speaker: Bodhi Sen (Columbia University)

Abstract +

10:00-10:30 CDT

Coffee Break

10:30-11:15 CDT

Exact and Conservative Inference in Blocked Experiments with Binary Outcomes

Speaker: Philip Stark (University of California, Berkeley)

11:30-12:30 CDT

Lunch

12:30-13:15 CDT

Unifying Modes of Inference for Average Treatment Effects in Randomized Experiments

Speaker: Colin Fogarty (University of Michigan)

Abstract +

13:35-14:20 CDT

Berry-Esseen bounds for design-based causal inference with possibly diverging treatment levels and varying group sizes

Speaker: Lei Shi (University of California, Berkeley)

Abstract +

14:35-15:00 CDT

Coffee Break

15:00-15:45 CDT

A simple Markov chain for independent Bernoulli variables conditioned on their sum

Speaker: Nianqiao Phyllis Ju (Purdue University)

Abstract +

Friday, August 25, 2023

9:00-9:45 CDT

Advances in Distribution Compression

Speaker: Lester Mackey (Microsoft New England)

Abstract +

10:00-10:30 CDT

Coffee Break

10:30-11:15 CDT

Causal mediation analysis with Mendelian Randomization

Speaker: Jingshu Wang (University of Chicago)

11:30-12:30 CDT

Lunch

12:30-13:15 CDT

Online Conditional Randomization Test via Testing by Betting

Speaker: Yaniv Romano (Technion – Israel Institute of Technology)

Abstract +

13:35-14:20 CDT

Robust sensitivity analysis for matched observational studies

Speaker: Xinran LI (University of Illinois at Urbana-Champaign)

Videos

Covariate-adaptive randomization inference conditional on optimal propensity score matching

Sam Pimentel
August 22, 2023

Inference for Synthetic Controls via Leave-Two-Out Placebo Tests

Lihua Lei
August 22, 2023

Randomization-based inference: some methods and algorithms

Tirthankar Dasgupta
August 23, 2023

A simple measure of conditional dependence

Mona Azadkia
August 23, 2023

Permutation Inference under Dependence

EunYi Chung
August 23, 2023

Higher order fluctuations in dense random graph models

Adrian Roellin
August 23, 2023

When Is A Randomization Test for Spillover Effects Also A Permutation Test?

Panos Toulis
August 23, 2023

Measuring association on topological spaces using kernels and geometric graphs

Bodhi Sen
August 24, 2023

Exact and Conservative Inference in Blocked Experiments with Binary Outcomes

Philip Stark
August 24, 2023

Unifying Modes of Inference for Average Treatment Effects in Randomized Experiments

Colin Fogarty
August 24, 2023

Berry-Esseen bounds for design-based causal inference with possibly diverging treatment levels and varying group sizes

Lei Shi
August 24, 2023

A simple Markov chain for independent Bernoulli variables conditioned on their sum

Nianqiao Phyllis Ju
August 24, 2023

Advances in Distribution Compression

Lester Mackey
August 25, 2023

Causal mediation analysis with Mendelian Randomization

Jingshu Wang
August 25, 2023

Online Conditional Randomization Test via Testing by Betting

Yaniv Romano
August 25, 2023

Robust sensitivity analysis for matched observational studies

Xinran LI
August 25, 2023

Connections and Applications

Description

Organizers

Speakers

Schedule

Videos