Projects per year
Abstract
This article proposes two novel deep convolutional neural networks (CNN), which are called the sparse coding convolutional neural network (SC-CNN) and the multi-convolutional-channel SC-CNN (MSC-CNN), to address the sound event recognition and retrieval problem. Unlike the general framework of a CNN, in which the feature learning process is performed hierarchically, the proposed framework models the whole memorization process in the human brain, including encoding, storage, and recollection. In particular, the MSC-CNN is designed to recognize multiple sound events that occur simultaneously. The experimental results indicate that the proposed SC-CNN and MSC-CNN outperforms the state-of-the-art systems in sound event recognition and retrieval.
Original language | English |
---|---|
Article number | 8952659 |
Pages (from-to) | 1875-1887 |
Number of pages | 13 |
Journal | IEEE/ACM Transactions on Audio Speech and Language Processing |
Volume | 28 |
DOIs | |
State | Published - 2020 |
Keywords
- Sound event recognition
- deep learning
- sound event retrieval
- sparse coding convolutional neural network
Fingerprint
Dive into the research topics of 'Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks'. Together they form a unique fingerprint.Projects
- 2 Finished
-
-
Deep Intelligence Based Spoken Language Processing( II )
Wang, J.-C. (PI)
1/01/19 → 31/12/19
Project: Research