Discriminative Vector Learning with Application to Single Channel Speech Separation

Ha Minh Tan, Kai Wen Liang, Jia Ching Wang

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

In this paper, we introduce a discriminative vector learning method and apply it to single-channel speech separation. First, speech samples are transformed into discriminative vectors using two backbone networks. These vectors are easily separated by simple clustering algorithms. Among them, vectors with lower similarity are separated into different clusters, while vectors in the same cluster have higher similarity. This property is very important in image segmentation, audio separation, and data clustering problems. In our work, we design the network architecture to improve the discriminativeness of vectors through learning, taking this task as spectrogram segmentation. Experiments show that our method significantly improves performance compared to other deep clustering methods for speech separation.

Fingerprint

Dive into the research topics of 'Discriminative Vector Learning with Application to Single Channel Speech Separation'. Together they form a unique fingerprint.

Cite this