Discriminative Vector Learning with Application to Single Channel Speech Separation

Ha Minh Tan, Kai Wen Liang, Jia Ching Wang

研究成果: 雜誌貢獻會議論文同行評審

1 引文 斯高帕斯(Scopus)


In this paper, we introduce a discriminative vector learning method and apply it to single-channel speech separation. First, speech samples are transformed into discriminative vectors using two backbone networks. These vectors are easily separated by simple clustering algorithms. Among them, vectors with lower similarity are separated into different clusters, while vectors in the same cluster have higher similarity. This property is very important in image segmentation, audio separation, and data clustering problems. In our work, we design the network architecture to improve the discriminativeness of vectors through learning, taking this task as spectrogram segmentation. Experiments show that our method significantly improves performance compared to other deep clustering methods for speech separation.


深入研究「Discriminative Vector Learning with Application to Single Channel Speech Separation」主題。共同形成了獨特的指紋。