Open Access from IS Review: A Computational Perspective on Projection Pursuit in High Dimensions: Feasible or Infeasible Feature Extraction

Each week, we select a recently published Open Access article to feature. This week’s article comes from the International Statistical Review and assesses the challenges and feasibility of projection pursuit in analyzing large data sets. 

The article’s abstract is given below, with the full article available to read here.

Zhang, C.Ye, J., and Wang, X. (2022A Computational Perspective on Projection Pursuit in High Dimensions: Feasible or Infeasible Feature ExtractionInternational Statistical Reviewhttps://doi.org/10.1111/insr.12517.

Finding a suitable representation of multivariate data is fundamental in many scientific disciplines. Projection pursuit ( PP) aims to extract interesting ‘non-Gaussian’ features from multivariate data, and tends to be computationally intensive even when applied to data of low dimension. In high-dimensional settings, a recent work (Bickel et al., 2018) on PP addresses asymptotic characterization and conjectures of the feasible projections as the dimension grows with sample size. To gain practical utility of and learn theoretical insights into PP in an integral way, data analytic tools needed to evaluate the behaviour of PP in high dimensions become increasingly desirable but are less explored in the literature. This paper focuses on developing computationally fast and effective approaches central to finite sample studies for (i) visualizing the feasibility of PP in extracting features from high-dimensional data, as compared with alternative methods like PCA and ICA, and (ii) assessing the plausibility of PP in cases where asymptotic studies are lacking or unavailable, with the goal of better understanding the practicality, limitation and challenge of PP in the analysis of large data sets.

More Details