A maximum entropy approach to Chinese grapheme-to-phoneme conversion

研究成果: 書貢獻/報告類型會議論文篇章同行評審

2 引文 斯高帕斯(Scopus)

摘要

Grapheme-to-phoneme (G2P) conversion plays an important role in speech synthesis. The main difficulty facing Chinese G2P conversion is that many Chinese characters are polyphonic, having more than one pronunciation. A Chinese G2P system must be able to pick the correct pronunciation from among several candidates. Contextual information on neighboring characters such as character n-grams, phonetic information, or position of the polyphone in a word or sentence is the key to correct prediction. Most previous works employed rule-based or rule-learning methods, which often suffered from data sparseness. In this paper, we propose a novel G2P approach to avoid data sparseness. Our method uses the maximum entropy (ME) model framework to represent contextual information as ME features. Our system achieves a top accuracy of 99.84%, which is significantly higher than other state-of-the-art rule-based and rule-learning methods. In addition, our approach consistently improves accuracy regardless of a character's main pronunciation ratio. Further analysis also shows that the ME model is fast and efficient, requiring much less training and labeling time.

原文???core.languages.en_GB???
主出版物標題2009 IEEE International Conference on Information Reuse and Integration, IRI 2009
頁面411-416
頁數6
DOIs
出版狀態已出版 - 2009
事件2009 IEEE International Conference on Information Reuse and Integration, IRI 2009 - Las Vegas, NV, United States
持續時間: 10 8月 200912 8月 2009

出版系列

名字2009 IEEE International Conference on Information Reuse and Integration, IRI 2009

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2009 IEEE International Conference on Information Reuse and Integration, IRI 2009
國家/地區United States
城市Las Vegas, NV
期間10/08/0912/08/09

指紋

深入研究「A maximum entropy approach to Chinese grapheme-to-phoneme conversion」主題。共同形成了獨特的指紋。

引用此