Joint prosodic and spectral modeling for robust speaker verification

  • Yuan Fu Liao
  • , Wen Chieh Chang
  • , Zong You Xie
  • , Ding Yun Zeng
  • , Yau Tarng Juang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

In this paper, a joint prosodic and spectral modeling framework is proposed instead of traditional score-domain fusion approaches to alleviate the problem of mismatch channel/handset/ambient noise. The basic idea is to embed the concept of hierarchical structure of speech prosody into an ergodic HMM (EHMM), and model the prosodic status transitions and prosodic/spectral features by EHMM's states, state transition probabilities and state-dependent observation distributions, respectively. Experimental results evaluated on the standard single-speaker detection task of NIST 2001 speaker recognition evaluation (NIST-SRE 2001) showed that the proposed approach not only outperformed the spectral feature-based baseline (8.04% vs. 8.64% in equal error rate, EER) but also worked a little bit better than score-domain fusion (8.44%) approach.

原文???core.languages.en_GB???
主出版物標題Proceedings of the 4th International Conference on Speech Prosody, SP 2008
發行者International Speech Communications Association
頁面143-146
頁數4
ISBN(列印)9780616220030
出版狀態已出版 - 2008
事件4th International Conference on Speech Prosody 2008, SP 2008 - Campinas, Brazil
持續時間: 6 5月 20089 5月 2008

出版系列

名字Proceedings of the 4th International Conference on Speech Prosody, SP 2008

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???4th International Conference on Speech Prosody 2008, SP 2008
國家/地區Brazil
城市Campinas
期間6/05/089/05/08

指紋

深入研究「Joint prosodic and spectral modeling for robust speaker verification」主題。共同形成了獨特的指紋。

引用此