Fast Gated Recurrent Network for Speech Synthesis

Bima Prihasto, Tzu Chiang Tai, Pao Chi Chang, Jia Ching Wang

研究成果: 雜誌貢獻期刊論文同行評審

摘要

The recurrent neural network (RNN) has been used in audio and speech processing, such as language translation and speech recognition. Although RNN-based architecture can be applied to speech synthesis, the long computing time is still the primary concern. This research proposes a fast gated recurrent neural network, a fast RNN-based architecture, for speech synthesis based on the minimal gated unit (MGU). Our architecture removes the unit state history from some equations in MGU. OurMGU-based architecture is about twice faster, with equally good sound quality than the other MGU-based architectures.

原文???core.languages.en_GB???
頁(從 - 到)1634-1638
頁數5
期刊IEICE Transactions on Information and Systems
E105D
發行號9
DOIs
出版狀態已出版 - 9月 2022

指紋

深入研究「Fast Gated Recurrent Network for Speech Synthesis」主題。共同形成了獨特的指紋。

引用此