TY - GEN
T1 - Emotional Speech Analysis Based on Convolutional Neural Networks
AU - Kao, Yi Chin
AU - Li, Chung Ting
AU - Tai, Tzu Chiang
AU - Wang, Jia Ching
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - In recent studies, speech emotion recognition has been an intriguing and arduous area of research in human behavior analysis. The goal of this research area is to classify people's emotional states according to their speech tones. At present, the research area focuses on identifying the effectiveness of automatic classifiers of speech emotions to improve the classification efficiency in practical applications, e.g., for use in telecommunication services, identifying positive emotions (e.g., happy, surprise) and negative emotions (e.g., sad, angry, disgust, and fear), which can supply a large number of valid information for platform users and customers of telecommunication services.In this paper, the complex task of identifying positive and negative emotions in human voice data is investigated by using deep learning techniques. Five open emotion speech datasets are used to train multi-level models for positive and negative emotion recognition. The experimental results shows that our model can obtain good results for both positive and negative emotion speech data.
AB - In recent studies, speech emotion recognition has been an intriguing and arduous area of research in human behavior analysis. The goal of this research area is to classify people's emotional states according to their speech tones. At present, the research area focuses on identifying the effectiveness of automatic classifiers of speech emotions to improve the classification efficiency in practical applications, e.g., for use in telecommunication services, identifying positive emotions (e.g., happy, surprise) and negative emotions (e.g., sad, angry, disgust, and fear), which can supply a large number of valid information for platform users and customers of telecommunication services.In this paper, the complex task of identifying positive and negative emotions in human voice data is investigated by using deep learning techniques. Five open emotion speech datasets are used to train multi-level models for positive and negative emotion recognition. The experimental results shows that our model can obtain good results for both positive and negative emotion speech data.
KW - Convolutional Neural Network (CNN)
KW - Emotion classification
KW - Speech detection
UR - http://www.scopus.com/inward/record.url?scp=85126199247&partnerID=8YFLogxK
U2 - 10.1109/ICOT54518.2021.9680651
DO - 10.1109/ICOT54518.2021.9680651
M3 - 會議論文篇章
AN - SCOPUS:85126199247
T3 - 2021 9th International Conference on Orange Technology, ICOT 2021
BT - 2021 9th International Conference on Orange Technology, ICOT 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Orange Technology, ICOT 2021
Y2 - 16 December 2021 through 17 December 2021
ER -