Statistics & Data Science Talk

Statistics and Data Science Talk

Event Date

Location
Physical and Data Sciences Bldg (PDSB) 1025
Speaker: Kai Zhang, Professor of Statistics and Operations Research, University of North Carolina, Chapel Hill
 
Title: BET and BELIEF
 
Abstract: We study the problem of distribution-free dependence detection and modeling through the new framework of binary expansion statistics (BEStat). The binary expansion testing (BET) avoids the problem of non-uniform consistency and improves upon a wide class of commonly used methods (a) by achieving the minimax rate in sample size requirement for reliable power and (b) by providing clear interpretations of global relationships upon rejection of independence. The binary expansion approach also connects the symmetry statistics with the current computing system to facilitate efficient bitwise implementation. Modeling with the binary expansion linear effect (BELIEF) is motivated by the fact that two linearly uncorrelated binary variables must be also independent. Inferences from BELIEF are easily interpretable because they describe the association of binary variables in the language of linear models, yielding convenient theoretical insight and striking parallels with the Gaussian world. With BELIEF, one may study generalized linear models (GLM) through transparent linear models, providing insight into how modeling is affected by the choice of link. We explore these phenomena and provide a host of related theoretical results. This is joint work with Benjamin Brown and Xiao-Li Meng.
 
References:
a.    Zhang, K.  (2019). BET on Independence. Journal of the American Statistical Association, 114, 1620-1637.
b.    Brown, B., Zhang, K., and Meng, X-L. (2025). BELIEF in Dependence: Leveraging Atomic Linearity in Data Bits for Rethinking Generalized Linear Models. The Annals of Statistics, 53, 1068-1094.
 
Faculty webspage (links to UNC): https://zhangk.web.unc.edu/ 

Tags