LAMP, a lyrics and audio mandopop dataset for music mood estimation: Dataset compilation, system construction, and testing

Wei Rong Chu, Richard Tzong Han Tsai, Ying Shian Wu, Hui Hsin Wu, Hung Yi Chen, Jane Yung Jen Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Music mood estimation (MME) is an emerging subfield in music information retrieval research. Whereas most MME research focuses on audio analysis, exploring the significance of lyrics in predicting song emotion has been receiving more attention in recent years. One major impediment to MME research is the lack of clearly-labeled and publicly-available datasets of separately annotated lyrics and audio. In the first section of this paper, we describe the creation of the LAMP dataset, containing 492 mandarin pop songs with separate mood annotations for lyrics text and audio music. Our second contribution is to demonstrate with statistical analysis on the LAMP dataset how lyrics and audio contribute individually to a song's overall mood. Our analysis suggests that lyrics can serve as a valid measure for music mood estimation, especially in song valence, and provide supplementary mood information to audio. Thirdly, we propose the Sentiment Score Approach for extracting affective words from lyrics text and show that it is the most effective individual method for improving MME accuracy while reducing the number of features. Lastly, we combine our best lyrical feature configuration with audio features in an MME system for estimating song valence. This configuration outperforms audio-features-only by 16.517% and lyrical-features-only by 1.5%, suggesting strongly that lyrical features can be an important source of supplementary information for audio-music features when predicting song valence.

Original languageEnglish
Title of host publicationProceedings - International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2010
Pages53-59
Number of pages7
DOIs
StatePublished - 2010
Event2010 15th Conference on Technologies and Applications of Artificial Intelligence, TAAI 2010 - Hsinchu, Taiwan
Duration: 18 Nov 201020 Nov 2010

Publication series

NameProceedings - International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2010

Conference

Conference2010 15th Conference on Technologies and Applications of Artificial Intelligence, TAAI 2010
Country/TerritoryTaiwan
CityHsinchu
Period18/11/1020/11/10

Keywords

  • Lyrics
  • Mandarin pop song
  • Music information retrieval
  • Music mood estimation
  • Valence

Fingerprint

Dive into the research topics of 'LAMP, a lyrics and audio mandopop dataset for music mood estimation: Dataset compilation, system construction, and testing'. Together they form a unique fingerprint.

Cite this