Single-Channel Target Speaker Extraction System with Attention Enhancement

Yen Ting Lai, Yi En Lin, Pao Chi Chang, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

In this paper, we propose a system for single-channel target speaker extraction. We adopt a Temporal Convolutional Network (TCN) architecture as speech extraction model. We also import an attention enhancement to provide system more rich and efficient information. This can improve the extraction model to better estimate the mask of the target. With the better mask, the quality of the target speaker extraction is noticeably improved.

原文???core.languages.en_GB???
主出版物標題Proceedings - 2022 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2022
發行者Institute of Electrical and Electronics Engineers Inc.
頁面433-434
頁數2
ISBN(電子)9781665470506
DOIs
出版狀態已出版 - 2022
事件2022 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2022 - Taipei, Taiwan
持續時間: 6 7月 20228 7月 2022

出版系列

名字Proceedings - 2022 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2022

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2022 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2022
國家/地區Taiwan
城市Taipei
期間6/07/228/07/22

指紋

深入研究「Single-Channel Target Speaker Extraction System with Attention Enhancement」主題。共同形成了獨特的指紋。

引用此