TY - JOUR
T1 - Vowel quality scoring on speech rehabilitation assistance
AU - Syauqy, Dahnial
AU - Wu, Chao Min
AU - Setyawati, Onny
N1 - Publisher Copyright:
© Research India Publications.
PY - 2014
Y1 - 2014
N2 - This paper attempted to develop a tool to assist speech therapy and rehabilitation which focused on vowel quality analysis of the speech. The tool was designed to extract the speech features information to determine the vowel quality of the patient and compare it with a normal speech recording. In order to help the assessment to be done by a basic user without particular knowledge of speech processing, the tool was designed in simple interface. However, the tool also provided deep analysis of the speech which can be useful for the speech therapist. Two speech features including pitch and formants were used as input information to classify the vowel of voiced speech segment which became the comparison quantity with another particular template of speech. Cepstrum based pitch tracking algorithm were used to estimate the pitch. Then, two popular classification methods, KNearest Neighbor (K-NN) and Multilayer Perceptron (MLP) were investigated and compared as the vowel classification algorithm. Finally, the vowel features similarity between both speeches was quantified and overall score was made. For the vowel classification algorithm, MLP method provided better accuracy (92.61% for men, 86.75% for women and 83.75% for children) compared to K-NN method (91.67%, 86.21% and 80.69%) and up to 5-times faster in the computation time. The overall result also indicated the advantage of the tool for both patient and therapist by using provided simple and professional mode.
AB - This paper attempted to develop a tool to assist speech therapy and rehabilitation which focused on vowel quality analysis of the speech. The tool was designed to extract the speech features information to determine the vowel quality of the patient and compare it with a normal speech recording. In order to help the assessment to be done by a basic user without particular knowledge of speech processing, the tool was designed in simple interface. However, the tool also provided deep analysis of the speech which can be useful for the speech therapist. Two speech features including pitch and formants were used as input information to classify the vowel of voiced speech segment which became the comparison quantity with another particular template of speech. Cepstrum based pitch tracking algorithm were used to estimate the pitch. Then, two popular classification methods, KNearest Neighbor (K-NN) and Multilayer Perceptron (MLP) were investigated and compared as the vowel classification algorithm. Finally, the vowel features similarity between both speeches was quantified and overall score was made. For the vowel classification algorithm, MLP method provided better accuracy (92.61% for men, 86.75% for women and 83.75% for children) compared to K-NN method (91.67%, 86.21% and 80.69%) and up to 5-times faster in the computation time. The overall result also indicated the advantage of the tool for both patient and therapist by using provided simple and professional mode.
KW - Computer assisted speech therapy
KW - Speech disorder
KW - Speech processing
KW - Vowel classification
UR - http://www.scopus.com/inward/record.url?scp=84941098455&partnerID=8YFLogxK
M3 - 期刊論文
AN - SCOPUS:84941098455
SN - 0973-4562
VL - 9
SP - 27199
EP - 27210
JO - International Journal of Applied Engineering Research
JF - International Journal of Applied Engineering Research
IS - 24
ER -