Speaker Identification Using Discriminative Features and Sparse Representation

Yu Hao Chin, Jia Ching Wang, Chien Lin Huang, Kuang Yao Wang, Chung Hsien Wu

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Speaker identification is an important topic with relevance to various disciplines. This paper proposes a novel speaker identification system, which consists of two major components-feature extraction and sparse representation classifier (SRC). Although SRC has been utilized for many classification purposes, few studies have provided insight into the link between the commonly used speaker identification feature, i-vector, and SRC. To combine i-vector and SRC sufficiently, we use probabilistic principal component analysis and Bartlett test to extract high-quality i-vector to construct a discriminative dictionary in SRC, supporting effective speaker identification. Besides improving dictionary from the i-vector aspect, we also utilize dictionary learning to further enhance the content of the dictionary. Two learning methods are proposed-robust principal component analysis dictionary and SVD-dictionary. Furthermore, we propose constructing a noise dictionary and combine it with the original dictionary to absorb and suppress noise when implementing the sparse coding. Various coding methods are utilized and analyzed. A comparison to the methods for speaker identification reveals that the proposed method outperforms the baselines and confirms its feasibility.

Original languageEnglish
Article number7872470
Pages (from-to)1979-1987
Number of pages9
JournalIEEE Transactions on Information Forensics and Security
Volume12
Issue number8
DOIs
StatePublished - Aug 2017

Keywords

  • Sparse representation classifier (SRC)
  • speaker identification

Fingerprint

Dive into the research topics of 'Speaker Identification Using Discriminative Features and Sparse Representation'. Together they form a unique fingerprint.

Cite this