摘要
In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it in the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. The masking properties of the human auditory system are then derived based on the perceptual filterbank. Finally, the prior SNR and the masking threshold of each critical band are taken to decide the attenuation factor of the optimal linear estimator. Five different types of in-car noises in TAICAR database were used in our evaluation. The experimental results demonstrated that our approach outperformed conventional subspace and spectral subtraction methods.
原文 | ???core.languages.en_GB??? |
---|---|
頁(從 - 到) | 1055-1062 |
頁數 | 8 |
期刊 | IEICE Transactions on Information and Systems |
卷 | E90-D |
發行號 | 7 |
DOIs | |
出版狀態 | 已出版 - 7月 2007 |