摘要
A novel eigen-prosody analysis approach is proposed for robust speaker recognition under a mismatch handset environment. The idea is to convert the prosodie contours of a speaker's speech into sequences of prosody symbols, and transform the speaker recognition problem into a full-text document retrieval-similar task. Experimental results on the HTIMIT corpus have shown that, even though only few training/test data are available, about 32.2% relative error rate reduction could be achieved compared with the conventional Gaussian mixture model/cepstral mean subtraction approach.
| 原文 | ???core.languages.en_GB??? |
|---|---|
| 頁(從 - 到) | 1233-1235 |
| 頁數 | 3 |
| 期刊 | Electronics Letters |
| 卷 | 40 |
| 發行號 | 19 |
| DOIs | |
| 出版狀態 | 已出版 - 16 9月 2004 |
指紋
深入研究「Eigen-prpsody analysis for robust speaker recognition under mismatch handset environment」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver