Reliable measures for aligning Japanese-English news articles and sentences
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A fully-lexicalized probabilistic model for Japanese syntactic and case structure analysis
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Annotating attributions and private states
CorpusAnno '05 Proceedings of the Workshop on Frontiers in Corpus Annotations II: Pie in the Sky
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
A blog emotion corpus for emotional expression analysis in Chinese
Computer Speech and Language
Identifying expressions of emotion in text
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Statistical analysis of a Japanese emotion corpus for natural language processing
ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
Affect analysis in context of characters in narratives
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
This paper presents our research on automatic annotation of a five-billion-word corpus of Japanese blogs with information on affect and sentiment. We first perform a study in emotion blog corpora to discover that there has been no large scale emotion corpus available for the Japanese language. We choose the largest blog corpus for the language and annotate it with the use of two systems for affect analysis: ML-Ask for word- and sentence-level affect analysis and CAO for detailed analysis of emoticons. The annotated information includes affective features like sentence subjectivity (emotive/non-emotive) or emotion classes (joy, sadness, etc.), useful in affect analysis. The annotations are also generalized on a 2-dimensional model of affect to obtain information on sentence valence/polarity (positive/negative) useful in sentiment analysis. The annotations are evaluated in several ways. Firstly, on a test set of a thousand sentences extracted randomly and evaluated by over forty respondents. Secondly, the statistics of annotations are compared to other existing emotion blog corpora. Finally, the corpus is applied in several tasks, such as generation of emotion object ontology or retrieval of emotional and moral consequences of actions.