Prosody modeling and eigen-prosody analysis for robust speaker recognition

Zi He Chen, Yuan Fu Liao, Yau Tarng Juang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

9 引文 斯高帕斯(Scopus)

摘要

Unseen handset mismatch and limited training/test data are the major source of performance degradation for speaker identification in telecommunication environment. In this paper, a vector quantization (VQ)-based prosody modeling and an eigen-prosody analysis (EPA) is integrated to transform the close-set speaker identification problem into a full text document retrieval-similar task. The prosody modeling labels the prosodic feature contours of a speaker's speech into sequences of prosody states. EPA then constructs a compact eigen-prosody space to represent the constellation of speakers. Furthermore, EPA is fused with a lower-level a priori knowledge interpolation (AKI) handset distortion compensator to complement each other. Experimental results on the HTIMIT database had shown that about 41.0% and 32.8% relative error rate reduction for seen and unseen handsets, respectively, was achieved comparing with the maximum a priori-adapted Gaussian mixture model/cepstral mean subtraction (MAP-GMM/CMS) baseline.

原文???core.languages.en_GB???
主出版物標題2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
發行者Institute of Electrical and Electronics Engineers Inc.
頁面I185-I188
ISBN(列印)0780388747, 9780780388741
DOIs
出版狀態已出版 - 2005
事件2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
持續時間: 18 3月 200523 3月 2005

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
I
ISSN(列印)1520-6149

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
國家/地區United States
城市Philadelphia, PA
期間18/03/0523/03/05

指紋

深入研究「Prosody modeling and eigen-prosody analysis for robust speaker recognition」主題。共同形成了獨特的指紋。

引用此