Construction of a blog emotion corpus for Chinese emotional expression analysis
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
A blog emotion corpus for emotional expression analysis in Chinese
Computer Speech and Language
Hi-index | 0.00 |
Mood classification for blogs is useful in helping user-to-agent interaction for a variety of applications involving the web, such as user modeling, recommendation systems, and user interface fields. It is challenging at the same time because of the diversity of the characteristics of bloggers, their experiences, and the way moods are expressed. As an attempt to handle the diversity, we combine multiple sources of evidence for a mood type. Support Vector Machine based Mood Classifier (SVMMC) is integrated with Mood Flow Analyzer (MFA) that incorporates commonsense knowledge obtained from the general public (i.e. ConceptNet), the Affective Norms English Words (ANEW) list, and mood transitions. In combining the two different approaches, we employ a statistically weighted voting scheme based on the Support Vector Machine (SVM). For evaluation, we have built a mood corpus consisting of manually annotated blogs, which amounts to over 4000 blogs. Our proposed method outperforms SVMMC by 5.68% in precision. The improvement is attributed to the strategy of choosing more trustable classification results in an interleaving fashion between the SVMMC and our MFA.