摘要
This work presents an access control system, which is a speaker identification system based on whispered speech. Speaker identification is a main function of an access control system. Hence, a novel speaker identification system using instantaneous frequencies is proposed. The input speech signals pass through both signal independent and signal dependent filters firstly. Then, we derive the signal's instantaneous frequencies by applying the Hilbert transform. The analyzed instantaneous frequencies are proceeded to be modeled as probability density models. We use these probability density models as the feature in the proposed speaker identification system. In this work, we compare the use of parametric and nonparametric probability density estimation for instantaneous frequency modeling. Furthermore, we propose an approximated probability product kernel support vector machine (APPKSVM). In the APPKSVM, Riemann sum is applied in approximating the probability product kernel. The whisper sounds from the CHAIN speech corpus were used in the experiments. Results of the experiments show the superiority of the proposed speaker identification system.
原文 | ???core.languages.en_GB??? |
---|---|
文章編號 | 7229351 |
頁(從 - 到) | 1191-1199 |
頁數 | 9 |
期刊 | IEEE Transactions on Automation Science and Engineering |
卷 | 12 |
發行號 | 4 |
DOIs | |
出版狀態 | 已出版 - 10月 2015 |