Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks

Chien Yao Wang, Tzu Chiang Tai, Jia Ching Wang, Andri Santoso, Seksan Mathulaprangsan, Chin Chin Chiang, Chung Hsien Wu

研究成果: 雜誌貢獻期刊論文同行評審

13 引文 斯高帕斯(Scopus)

摘要

This article proposes two novel deep convolutional neural networks (CNN), which are called the sparse coding convolutional neural network (SC-CNN) and the multi-convolutional-channel SC-CNN (MSC-CNN), to address the sound event recognition and retrieval problem. Unlike the general framework of a CNN, in which the feature learning process is performed hierarchically, the proposed framework models the whole memorization process in the human brain, including encoding, storage, and recollection. In particular, the MSC-CNN is designed to recognize multiple sound events that occur simultaneously. The experimental results indicate that the proposed SC-CNN and MSC-CNN outperforms the state-of-the-art systems in sound event recognition and retrieval.

原文???core.languages.en_GB???
文章編號8952659
頁(從 - 到)1875-1887
頁數13
期刊IEEE/ACM Transactions on Audio Speech and Language Processing
28
DOIs
出版狀態已出版 - 2020

指紋

深入研究「Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks」主題。共同形成了獨特的指紋。

引用此