Using the Kriging Correlation for unsupervised feature selection problems

Cheng Han Chua, Meihui Guo, Shih Feng Huang

研究成果: 雜誌貢獻期刊論文同行評審

摘要

This paper proposes a KC Score to measure feature importance in clustering analysis of high-dimensional data. The KC Score evaluates the contribution of features based on the correlation between the original features and the reconstructed features in the low dimensional latent space. A KC Score-based feature selection strategy is further developed for clustering analysis. We investigate the performance of the proposed strategy by conducting a study of four single-cell RNA sequencing (scRNA-seq) datasets. The results show that our strategy effectively selects important features for clustering. In particular, in three datasets, our proposed strategy selected less than 5% of the features and achieved the same or better clustering performance than when using all of the features.

原文???core.languages.en_GB???
文章編號11522
期刊Scientific Reports
12
發行號1
DOIs
出版狀態已出版 - 12月 2022

指紋

深入研究「Using the Kriging Correlation for unsupervised feature selection problems」主題。共同形成了獨特的指紋。

引用此