Lightweight End-To-End Deep Learning Model For Music Source Separation

Yao Ting Wang, Yi Xing Lin, Kai Wen Liang, Tzu Chiang Tai, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

In this work, we propose a lightweight end-to-end music source separation deep learning model. Deep learning models for audio source separation based on time-domain have been proposed for end-to-end processing. However, the proposed models are complex and difficult to use when the computing resources of the device are limited. Additionally, long delays may be expected since long-term inputs are required to obtain adequate results for separation, making the models unsuitable for applications that require low latency. In the proposed model, Atrous Spatial Pyramid Pooling is used to reduce the number of parameters, and the receptive field preserving decoder is utilized to enhance the result of separation while the input context length is limited. The experimental results show that the proposed method obtains better results than previous methods while using 10% or fewer parameters.

原文???core.languages.en_GB???
主出版物標題2022 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022
編輯Kong Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong
發行者Institute of Electrical and Electronics Engineers Inc.
頁面315-318
頁數4
ISBN(電子)9798350397963
DOIs
出版狀態已出版 - 2022
事件13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022 - Singapore, Singapore
持續時間: 11 12月 202214 12月 2022

出版系列

名字2022 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022
國家/地區Singapore
城市Singapore
期間11/12/2214/12/22

指紋

深入研究「Lightweight End-To-End Deep Learning Model For Music Source Separation」主題。共同形成了獨特的指紋。

引用此