This was part of Recent Advances in Random Networks

On differential privacy of U statistics and applications to random networks

Purna Sarkar, University of Texas, Austin

Thursday, January 15, 2026



Slides
Abstract: We consider the problem of privately estimating a parameter 𝔼[h(X1,…,Xk)], where X1, X2, …, Xk are i.i.d. data from some distribution and h is a permutation-invariant function. Without privacy constraints, standard estimators are U-statistics, which commonly arise in a wide range of problems, including subgraph counts in random networks, nonparametric signed rank tests, symmetry testing, uniformity testing,  and can be shown to be minimum variance unbiased estimators under mild conditions. Despite the recent surge in interest in private mean estimation, the privatizing of U-statistics has received relatively little attention. While existing private mean estimation algorithms can be applied to obtain confidence intervals, we show that they can lead to suboptimal private error, e.g., constant-factor inflation in the leading term, or even Θ(1/n) rather than O(1/n^2) in degenerate settings. To remedy this, we propose a new thresholding-based approach using local Hájek projections to reweight different subsets of the data. This leads to nearly optimal private error for non-degenerate U-statistics and a strong indication of near-optimality for degenerate U-statistics.