Emotional Speech Analysis Based on Convolutional Neural Networks

Yi Chin Kao, Chung Ting Li, Tzu Chiang Tai, Jia Ching Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In recent studies, speech emotion recognition has been an intriguing and arduous area of research in human behavior analysis. The goal of this research area is to classify people's emotional states according to their speech tones. At present, the research area focuses on identifying the effectiveness of automatic classifiers of speech emotions to improve the classification efficiency in practical applications, e.g., for use in telecommunication services, identifying positive emotions (e.g., happy, surprise) and negative emotions (e.g., sad, angry, disgust, and fear), which can supply a large number of valid information for platform users and customers of telecommunication services.In this paper, the complex task of identifying positive and negative emotions in human voice data is investigated by using deep learning techniques. Five open emotion speech datasets are used to train multi-level models for positive and negative emotion recognition. The experimental results shows that our model can obtain good results for both positive and negative emotion speech data.

Original languageEnglish
Title of host publication2021 9th International Conference on Orange Technology, ICOT 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665478427
DOIs
StatePublished - 2021
Event9th International Conference on Orange Technology, ICOT 2021 - Tainan, Taiwan
Duration: 16 Dec 202117 Dec 2021

Publication series

Name2021 9th International Conference on Orange Technology, ICOT 2021

Conference

Conference9th International Conference on Orange Technology, ICOT 2021
Country/TerritoryTaiwan
CityTainan
Period16/12/2117/12/21

Keywords

  • Convolutional Neural Network (CNN)
  • Emotion classification
  • Speech detection

Fingerprint

Dive into the research topics of 'Emotional Speech Analysis Based on Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this