TY - GEN
T1 - Source separation using dictionary learning and deep recurrent neural network with locality preserving constraint
AU - Pham, Tuan
AU - Lee, Yuan Shan
AU - Mathulaprangsan, Seksan
AU - Wang, Jia Ching
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/8/28
Y1 - 2017/8/28
N2 - Deep learning is a popular method for monaural source separation, and especially for extracting a singing voice from a single-channel song. However, deep learning-based source separation ignores the geometrical structure of the input data. This work develops a novel approach to source separation that is based on non-negative matrix factorization (NMF) and deep recurrent neural networks (DRNN) with a locality-preserving constraint. First, NMF was used to learn patterns from training data. The learned patterns are linearly combined with the output of DRNN. Second, a locality-preserving constraint is developed to exploit the inner-structure of the input data in the DRNN learning process. Experimental results obtained using the MIR-1K dataset reveal that the proposed algorithm outperforms the baselines.
AB - Deep learning is a popular method for monaural source separation, and especially for extracting a singing voice from a single-channel song. However, deep learning-based source separation ignores the geometrical structure of the input data. This work develops a novel approach to source separation that is based on non-negative matrix factorization (NMF) and deep recurrent neural networks (DRNN) with a locality-preserving constraint. First, NMF was used to learn patterns from training data. The learned patterns are linearly combined with the output of DRNN. Second, a locality-preserving constraint is developed to exploit the inner-structure of the input data in the DRNN learning process. Experimental results obtained using the MIR-1K dataset reveal that the proposed algorithm outperforms the baselines.
KW - Deep recurrent neural network
KW - Locality preserving constraint
KW - Nonnegative matrix factorization
UR - http://www.scopus.com/inward/record.url?scp=85030235324&partnerID=8YFLogxK
U2 - 10.1109/ICME.2017.8019516
DO - 10.1109/ICME.2017.8019516
M3 - 會議論文篇章
AN - SCOPUS:85030235324
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
SP - 151
EP - 156
BT - 2017 IEEE International Conference on Multimedia and Expo, ICME 2017
PB - IEEE Computer Society
T2 - 2017 IEEE International Conference on Multimedia and Expo, ICME 2017
Y2 - 10 July 2017 through 14 July 2017
ER -