Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction

An Dang, Toan H. Vu, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

17 引文 斯高帕斯(Scopus)

摘要

Audio scenes are often composed of a variety of sound events from different sources. Their content exhibits wide variations in both frequency and time domain. Convolutional neural networks (CNNs) provide an effective way to extract spatial information of multidimensional data such as image, audio, and video; they have the ability to learn hierarchical representation from time-frequency features of audio signals. In this paper, we develop a convolutional neural network and employ a multi-scale multi-feature extraction methods for acoustic scene classification. We conduct experiments on the TUT Acoustic Scenes 2016 dataset. Experimental results show that the use of multi-scale multi-feature extraction methods improves significantly the performance of the system. Our proposed approach obtains a high accuracy of 85.9% that outperforms the baseline approach by a large margin of 8.7%.

原文???core.languages.en_GB???
主出版物標題2018 IEEE International Conference on Consumer Electronics, ICCE 2018
編輯Saraju P. Mohanty, Peter Corcoran, Hai Li, Anirban Sengupta, Jong-Hyouk Lee
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1-4
頁數4
ISBN(電子)9781538630259
DOIs
出版狀態已出版 - 26 3月 2018
事件2018 IEEE International Conference on Consumer Electronics, ICCE 2018 - Las Vegas, United States
持續時間: 12 1月 201814 1月 2018

出版系列

名字2018 IEEE International Conference on Consumer Electronics, ICCE 2018
2018-January

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2018 IEEE International Conference on Consumer Electronics, ICCE 2018
國家/地區United States
城市Las Vegas
期間12/01/1814/01/18

指紋

深入研究「Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction」主題。共同形成了獨特的指紋。

引用此