Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis

Lung Hao Lee, Jian Hong Li, Liang Chih Yu

研究成果: 雜誌貢獻期刊論文同行評審

摘要

An increasing amount of research has recently focused on dimensional sentiment analysis that represents affective states as continuous numerical values on multiple dimensions, such as valence-Arousal (VA) space. Compared to the categorical approach that represents affective states as distinct classes (e.g., positive and negative), the dimensional approach can provide more fine-grained (real-valued) sentiment analysis. However, dimensional sentiment resources with valence-Arousal ratings are very rare, especially for the Chinese language. Therefore, this study aims to: (1) Build a Chinese valence-Arousal resource called Chinese EmoBank, the first Chinese dimensional sentiment resource featuring various levels of text granularity including 5,512 single words, 2,998 multi-word phrases, 2,582 single sentences, and 2,969 multi-sentence texts. The valence-Arousal ratings are annotated by crowdsourcing based on the Self-Assessment Manikin (SAM) rating scale. A corpus cleanup procedure is then performed to improve annotation quality by removing outlier ratings and improper texts. (2) Evaluate the proposed resource using different categories of classifiers such as lexicon-based, regression-based, and neural-network-based methods, and comparing their performance to a similar evaluation of an English dimensional sentiment resource.

原文???core.languages.en_GB???
文章編號65
期刊ACM Transactions on Asian and Low-Resource Language Information Processing
21
發行號4
DOIs
出版狀態已出版 - 7月 2022

指紋

深入研究「Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis」主題。共同形成了獨特的指紋。

引用此