Ensemble based speaker recognition using unsupervised data selection

Chien Lin Huang, Jia Ching Wang, Bin Ma

Research output: Contribution to journalReview articlepeer-review

2 Scopus citations

Abstract

This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.

Original languageEnglish
Article numbere10
JournalAPSIPA Transactions on Signal and Information Processing
Volume5
DOIs
StatePublished - 10 May 2016

Keywords

  • Ensemble classifier
  • Speaker recognition
  • Unsupervised data selection

Fingerprint

Dive into the research topics of 'Ensemble based speaker recognition using unsupervised data selection'. Together they form a unique fingerprint.

Cite this