摘要
Principal component analysis (PCA) has been widely used in exploratory data analysis. Contrastive PCA (Abid et al.), a generalized method of PCA, is a new tool used to capture features of a target dataset relative to a background dataset while preserving the maximum amount of information contained in the data. With high dimensional data, contrastive PCA becomes impractical due to its high computational requirement of forming the contrastive covariance matrix and associated eigenvalue decomposition for extracting leading components. In this article, we propose a geometric curvilinear-search method to solve this problem and provide a convergence analysis. Our approach offers significant computational efficiencies. Specifically, it reduces the time complexity from (Formula presented.) to a more manageable (Formula presented.), where n, m are the sample sizes of the target data and background data, respectively, p is the data dimension and r is the number of leading components. Additionally, we streamline the space complexity from (Formula presented.), necessary for storing the contrastive covariance matrix, to a more economical (Formula presented.), sufficient for storing the data alone. Numerical examples are presented to show the merits of the proposed algorithm. Supplementary materials for this article are available online.
原文 | ???core.languages.en_GB??? |
---|---|
頁(從 - 到) | 909-916 |
頁數 | 8 |
期刊 | Journal of Computational and Graphical Statistics |
卷 | 33 |
發行號 | 3 |
DOIs | |
出版狀態 | 已出版 - 2024 |
指紋
深入研究「A Geometric Algorithm for Contrastive Principal Component Analysis in High Dimension」主題。共同形成了獨特的指紋。資料集
-
A Geometric Algorithm for Contrastive Principal Component Analysis in High Dimension
Lu, R.-S. (???dataset.roles.dataset.creator???), Wang, S.-H. (???dataset.roles.dataset.creator???) & Huang, S.-Y. (???dataset.roles.dataset.creator???), Taylor & Francis, 2023
DOI: 10.6084/m9.figshare.24712600.v1, https://tandf.figshare.com/articles/dataset/A_Geometric_Algorithm_for_Contrastive_Principal_Component_Analysis_in_High_Dimension/24712600/1
資料集
-
A Geometric Algorithm for Contrastive Principal Component Analysis in High Dimension
Lu, R.-S. (???dataset.roles.dataset.creator???), Wang, S.-H. (???dataset.roles.dataset.creator???) & Huang, S.-Y. (???dataset.roles.dataset.creator???), Taylor & Francis, 2023
DOI: 10.6084/m9.figshare.24712600, https://tandf.figshare.com/articles/dataset/A_Geometric_Algorithm_for_Contrastive_Principal_Component_Analysis_in_High_Dimension/24712600
資料集