Ohio State is in the process of revising websites and program materials to accurately reflect compliance with the law. While this work occurs, language referencing protected class status or other activities prohibited by Ohio Senate Bill 1 may still appear in some places. However, all programs and activities are being administered in compliance with federal and state law.

Seminar: Vince Vu

Department of Statistics Seminar Series
February 9, 2012
All Day
209 W Eighteenth Ave (EA), Room 170

Title

Sparse Principal Subspace Estimation

Speaker

Vince Vu, Carnegie Mellon University

Abstract

Principal components analysis (PCA) is an important and widely used technique for dimension reduction in the visualization and analysis of multivariate data. Its main idea is to capture the principal modes of variation of the data within a subspace spanned by the leading eigenvectors of a population covariance matrix. Unfortunately, it may not provide consistent estimates of the population eigenvectors when applied to high-dimensional data.

In this talk I introduce a new method, sparse principal subspace estimation, that is more suitable for the high-dimensional regime. The method builds on the main idea of PCA by imposing the following constraint: the principal subspace of variation is spanned mostly by a smaller number of variables. This sparsity constraint is reasonable when the signal of interest has sparse representation in some basis, and importantly, it can make estimation feasible and enhance interpretability in high-dimensions. I will discuss theoretical properties, algorithmic issues, and examples in data visualization and the analysis of neuroimaging data.