Self-supervised Learning and Masked Language Model for Code-switching Automatic Speech Recognition

Po Kai Chen, Li Yeh Fu, Cheng Kai Chen, Yi Xing Lin, Chih Ping Chen, Chien Lin Huang, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

Code-switching (CS) is a common linguistic phenomenon that poses significant challenges for automatic speech recognition systems due to the lack of corpus. In this paper, we propose a novel approach to address this challenge by leveraging self-supervised learning (SSL) and the masked language model (MLM) in speech recognition. Specifically, we use the wav2vec 2.0 pre-trained model to reduce the dependency on CS labeled data, and the MLM to rerank sentences generated using beam search decoding. Our proposed method is evaluated on the SEAME dataset, and experimental results show that it outperforms state-of-the-art CS speech recognition approaches by 15.6% and 19.9% in terms of token error rates (TER). Moreover, the proposed method is generalizable and can be extended to other CS languages. These results demonstrate the effectiveness of our approach and its potential for future research in the field of CS speech recognition.

原文???core.languages.en_GB???
主出版物標題ICCE 2024 - 2024 IEEE 10th International Conference on Communications and Electronics
編輯Seong Ho Jeong, Ho Dac Loc, Serge Fdida, Tho Le-Ngoc
發行者Institute of Electrical and Electronics Engineers Inc.
頁面387-391
頁數5
ISBN(電子)9798350379785
DOIs
出版狀態已出版 - 2024
事件10th IEEE International Conference on Communications and Electronics, ICCE 2024 - Da Nang City, Viet Nam
持續時間: 31 7月 20242 8月 2024

出版系列

名字ICCE 2024 - 2024 IEEE 10th International Conference on Communications and Electronics

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???10th IEEE International Conference on Communications and Electronics, ICCE 2024
國家/地區Viet Nam
城市Da Nang City
期間31/07/242/08/24

指紋

深入研究「Self-supervised Learning and Masked Language Model for Code-switching Automatic Speech Recognition」主題。共同形成了獨特的指紋。

引用此