TY - GEN
T1 - Customized Wake-Up Word with Key Word Spotting using Convolutional Neural Network
AU - Tsai, Tsung Han
AU - Hao, Ping Cheng
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - In this paper, a customized wake-up word system combined with key word spotting using neural network was proposed. This system is divided into three phases: Training wake-up word phase, detecting wake-up word phase and key word spotting phase. In training phase, user can say any word in any language and system will automatically count how many syllable of this word. If several syllables are in the range, system will accept this customized wake-up word. Next, the word will be extracted the features by Mel-Frequency Cepstral Coefficients (MFCC) method. It can be used for speaker model, speech model and state sequence for next phase. In detecting phase, system detects an unknown voice segment and compares it with models. After these steps, system will determine to wake up or not. If user says the right wake-up word, system goes to next phase. In key word spotting phase, the command words are fixed. The system is designed using convolutional neural network for key word spotting model. Moreover, all processes are executed without Internet to protect user privacy. This system can give a good result with a very small amount of wake-up word training data, and run in real-Time.
AB - In this paper, a customized wake-up word system combined with key word spotting using neural network was proposed. This system is divided into three phases: Training wake-up word phase, detecting wake-up word phase and key word spotting phase. In training phase, user can say any word in any language and system will automatically count how many syllable of this word. If several syllables are in the range, system will accept this customized wake-up word. Next, the word will be extracted the features by Mel-Frequency Cepstral Coefficients (MFCC) method. It can be used for speaker model, speech model and state sequence for next phase. In detecting phase, system detects an unknown voice segment and compares it with models. After these steps, system will determine to wake up or not. If user says the right wake-up word, system goes to next phase. In key word spotting phase, the command words are fixed. The system is designed using convolutional neural network for key word spotting model. Moreover, all processes are executed without Internet to protect user privacy. This system can give a good result with a very small amount of wake-up word training data, and run in real-Time.
KW - convolutional neural network
KW - customized wake-up-word
KW - gaussian mixture model
KW - hidden markov model
KW - mel-frequency cepstral coefficients
UR - http://www.scopus.com/inward/record.url?scp=85082988543&partnerID=8YFLogxK
U2 - 10.1109/ISOCC47750.2019.9027708
DO - 10.1109/ISOCC47750.2019.9027708
M3 - 會議論文篇章
AN - SCOPUS:85082988543
T3 - Proceedings - 2019 International SoC Design Conference, ISOCC 2019
SP - 136
EP - 137
BT - Proceedings - 2019 International SoC Design Conference, ISOCC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International System-on-Chip Design Conference, ISOCC 2019
Y2 - 6 October 2019 through 9 October 2019
ER -