Critical band subspace-based speech enhancement using SNR and auditory masking aware technique

Jia Ching Wang, Hsiao Ping Lee, Jhing Fa Wang, Chung Hsien Yang

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it in the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. The masking properties of the human auditory system are then derived based on the perceptual filterbank. Finally, the prior SNR and the masking threshold of each critical band are taken to decide the attenuation factor of the optimal linear estimator. Five different types of in-car noises in TAICAR database were used in our evaluation. The experimental results demonstrated that our approach outperformed conventional subspace and spectral subtraction methods.

Original languageEnglish
Pages (from-to)1055-1062
Number of pages8
JournalIEICE Transactions on Information and Systems
VolumeE90-D
Issue number7
DOIs
StatePublished - Jul 2007

Keywords

  • Human auditory system
  • In-car noise
  • Karhunen-loeve transform (KLT)
  • Perceptual filterbank
  • Signal subspace
  • Speech enhancement
  • Wavelet transform

Fingerprint

Dive into the research topics of 'Critical band subspace-based speech enhancement using SNR and auditory masking aware technique'. Together they form a unique fingerprint.

Cite this