TY - JOUR
T1 - Discriminative Vector Learning with Application to Single Channel Speech Separation
AU - Tan, Ha Minh
AU - Liang, Kai Wen
AU - Wang, Jia Ching
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this paper, we introduce a discriminative vector learning method and apply it to single-channel speech separation. First, speech samples are transformed into discriminative vectors using two backbone networks. These vectors are easily separated by simple clustering algorithms. Among them, vectors with lower similarity are separated into different clusters, while vectors in the same cluster have higher similarity. This property is very important in image segmentation, audio separation, and data clustering problems. In our work, we design the network architecture to improve the discriminativeness of vectors through learning, taking this task as spectrogram segmentation. Experiments show that our method significantly improves performance compared to other deep clustering methods for speech separation.
AB - In this paper, we introduce a discriminative vector learning method and apply it to single-channel speech separation. First, speech samples are transformed into discriminative vectors using two backbone networks. These vectors are easily separated by simple clustering algorithms. Among them, vectors with lower similarity are separated into different clusters, while vectors in the same cluster have higher similarity. This property is very important in image segmentation, audio separation, and data clustering problems. In our work, we design the network architecture to improve the discriminativeness of vectors through learning, taking this task as spectrogram segmentation. Experiments show that our method significantly improves performance compared to other deep clustering methods for speech separation.
UR - http://www.scopus.com/inward/record.url?scp=85180542328&partnerID=8YFLogxK
U2 - 10.1109/ICASSP49357.2023.10096181
DO - 10.1109/ICASSP49357.2023.10096181
M3 - 會議論文
AN - SCOPUS:85180542328
SN - 1520-6149
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
T2 - 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Y2 - 4 June 2023 through 10 June 2023
ER -