Predicting the Probability Density Function of Music Emotion Using Emotion Space Mapping

Yu Hao Chin, Jia Ching Wang, Ju Chiang Wang, Yi Hsuan Yang

研究成果: 雜誌貢獻期刊論文同行評審

6 引文 斯高帕斯(Scopus)


Computationally modeling the affective content of music has been intensively studied in recent years because of its wide applications in music retrieval and recommendation. Although significant progress has been made, this task remains challenging due to the difficulty in properly characterizing the emotion of a music piece. Music emotion perceived by people is subjective by nature and thus complicates the process of collecting the emotion annotations as well as developing the predictive model. Instead of assuming people can reach a consensus on the emotion of music, in this work we propose a novel machine learning approach that characterizes the music emotion as a probability distribution in the valence-Arousal (VA) emotion space, not only tackling the subjectivity but also precisely describing the emotions of a music piece. Specifically, we represent the emotion of a music piece as a probability density function (PDF) in the VA space via kernel density estimation from human annotations. To associate emotion with the audio features extracted from music pieces, we learn the combination coefficients by optimizing some objective functions of audio features, and then predict the emotion of an unseen piece by linearly combining the PDFs of the training pieces with the coefficients. Several algorithms for learning the coefficients are studied. Evaluations on the NTUMIR and MediaEval 2013 datasets validate the effectiveness of the proposed methods in predicting the probability distributions of emotion from audio features. We also demonstrate how to use the proposed approach in emotion-based music retrieval.

頁(從 - 到)541-549
期刊IEEE Transactions on Affective Computing
出版狀態已出版 - 1 10月 2018


深入研究「Predicting the Probability Density Function of Music Emotion Using Emotion Space Mapping」主題。共同形成了獨特的指紋。