This was part of
Statistics Meets Tensors
A latent space model for text networks
Emma Zhang, Emory University
Wednesday, May 7, 2025
Abstract: Network data enriched with textual information, referred to as text networks, arise in a wide range of applications, including email communications, scientific collaborations, and legal contracts. In such settings, both the structure of interactions (i.e., who connects with whom) and their content (i.e., what is communicated) are useful for understanding network relations. Traditional network analyses often focus only on the structure of the network and discard the rich textual information, resulting in an incomplete or inaccurate view of interactions. In this paper, we introduce a new modeling approach that incorporates texts into the analysis of networks using topic-aware text embedding, representing the text network as a generalized multi-layer network where each layer corresponds to a topic extracted from the data. We develop a new and flexible latent space network model that captures how node-topic preferences directly modulate edge formation and establish identifiability conditions for the proposed model. We tackle model estimation with a projected gradient descent algorithm and further discuss its theoretical properties. The efficacy of our proposed method is demonstrated through simulations and an analysis of an email network.