Sparse generalized eigenvalue problem (GEP) plays a pivotal role in a large family of high-dimensional learning tasks, including sparse Fisher’s discriminant analysis, canonical correlation analysis, and sufficient dimension reduction. Most of the existing methods and theory in the context of specific statistical models that are special cases of sparse GEP require restrictive structural assumptions on the input matrices. This talk will focus on a two-stage computational framework for solving the non-convex optimization problem resulting from the sparse GEP. At the first stage, we solve a convex relaxation of the sparse GEP. Taking the solution as an initial value, we then exploit a non-convex optimization perspective and propose the truncated Rayleigh flow method (Rifle) to estimate the leading generalized eigenvector, and show that it converges to a solution with the optimal statistical rate of convergence. Theoretically, our method significantly improves upon the existing literature by eliminating the structural assumptions on the input matrices. Numerical studies in the context of several statistical models are provided to validate the theoretical results. We then apply the proposed method to an electrocorticography data to understand how human brains recall and mentally rehearse word sequences.
Sparse Generalized Eigenvalue Problem and Its Application to Multivariate Statistics
Tuesday, December 4, 2018 - 11:00 to 12:00
Kean Ming Tan, Assistant Professor, University of Minnesota, School of Statistics
Room 4192, Earth Sciences Building (2207 Main Mall)