Kernel weighted Fisher sparse analysis on multiple maps for audio event recognition

Yu Hao Chin, Bo Wei Chen, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

摘要

This work presents a novel approach for audio event recognition. The approach develops a weighted kernel fisher sparse analysis method based on multiple maps. The proposed method consists of maps extraction and kernel weighted Fisher sparse analysis. Two maps are firstly extracted from each audio file, i.e. scale-frequency map and damping-frequency map. The scale and frequency of the Gabor atoms are extracted to construct a scale-frequency map. On the other hand, the damping-frequency map is generated according to the frequency and damping factor of damped atoms. Gabor atoms can be utilized to model human auditory perception, and the damped atoms can be used to model commonly observed damped oscillations in natural signals. This work fuses the advantages of these two dictionaries to improve the performance of the system. During the recognition stage, this work constructs a kernel sparse representation-based classifier via the proposed kernel weighted Fisher sparse analysis to enhance separability. The proposed kernel weighted Fisher sparse analysis combines sparse representation with heteroscedastic kernel weighted discriminant analysis (HKWDA), which is useful for providing a discriminative recognition of audio events because a weighted pairwise Chernoff criterion is utilized in the kernel space. Experiments on a 20-class audio event database indicate that the proposed approach can achieve an accuracy rate of 82.70%. Also, integrating the scale-frequency map with MFCCs increases the accuracy rate to 87.70%.

原文???core.languages.en_GB???
主出版物標題2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面6010-6014
頁數5
ISBN(電子)9781509041176
DOIs
出版狀態已出版 - 16 6月 2017
事件2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - New Orleans, United States
持續時間: 5 3月 20179 3月 2017

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(列印)1520-6149

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017
國家/地區United States
城市New Orleans
期間5/03/179/03/17

指紋

深入研究「Kernel weighted Fisher sparse analysis on multiple maps for audio event recognition」主題。共同形成了獨特的指紋。

引用此