Abstract
This paper proposes a KC Score to measure feature importance in clustering analysis of high-dimensional data. The KC Score evaluates the contribution of features based on the correlation between the original features and the reconstructed features in the low dimensional latent space. A KC Score-based feature selection strategy is further developed for clustering analysis. We investigate the performance of the proposed strategy by conducting a study of four single-cell RNA sequencing (scRNA-seq) datasets. The results show that our strategy effectively selects important features for clustering. In particular, in three datasets, our proposed strategy selected less than 5% of the features and achieved the same or better clustering performance than when using all of the features.
| Original language | English |
|---|---|
| Article number | 11522 |
| Journal | Scientific Reports |
| Volume | 12 |
| Issue number | 1 |
| DOIs | |
| State | Published - Dec 2022 |
Fingerprint
Dive into the research topics of 'Using the Kriging Correlation for unsupervised feature selection problems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver