Experience Mining: Building a Large-Scale Database of Personal Experiences and Opinions from Web Documents

Authors:
Kentaro Inui;Shuya Abe;Kazuo Hara;Hiraku Morita;Chitose Sao;Megumi Eguchi;Asuka Sumida;Koji Murakami;Suguru Matsuyoshi
Affiliations:
-;-;-;-;-;-;-;-;-
Venue:
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Year:
2008

Citing 11
Cited 14

Mining the peanut gallery: opinion extraction and semantic classification of product reviews

WWW '03 Proceedings of the 12th international conference on World Wide Web
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Large Margin Methods for Structured and Interdependent Output Variables

The Journal of Machine Learning Research
A temporal constraint structure for extracting temporal information from clinical narrative

Journal of Biomedical Informatics
A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Extracting semantic orientations of words using spin model

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Deeper sentiment analysis using machine translation technology

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Determining Modality and Factuality for Text Entailment

ICSC '07 Proceedings of the International Conference on Semantic Computing
Joint extraction of entities and relations for opinion recognition

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing

Statement map: assisting information crediblity analysis by visualizing arguments

Proceedings of the 3rd workshop on Information credibility on the web
TEXT2TABLE: medical text summarization system based on named entity recognition and modality identification

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Detecting experiences from weblogs

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Mining personal experiences and opinions from Web documents

Web Intelligence and Agent Systems
Extracting hidden information based on comparing web with UGC

WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
Learning opinions in user-generated web content

Natural Language Engineering
Solution mining for specific contextualised problems: towards an approach for experience mining

Proceedings of the 21st international conference companion on World Wide Web
Extracting tip information from social media

Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
Spatio-Temporal Web Sensors by Social Network Analysis

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Automatic organization of human task goals for web-scale problem solving knowledge

Proceedings of the seventh international conference on Knowledge capture
Normalizing Complex Functional Expressions in Japanese Predicates: Linguistically-Directed Rule-Based Paraphrasing and Its Application

ACM Transactions on Asian Language Information Processing (TALIP)
Toward advice mining: conditional random fields for extracting advice-revealing text units

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Two Phase Extraction Method for Multi-label Classification of Real Life Tweets

Proceedings of International Conference on Information Integration and Web-based Applications & Services
External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a new UGC-oriented language technology application, which we call experience mining. Experience mining aims at automatically collecting instances of personal experiences as well as opinions from an explosive number of user generated contents (UGCs) such as weblog and forum posts and storing them in an experience database with semantically rich indices. After arguing the technical issues of this new task, we focus on the central problem, factuality analysis, among others and propose a machine learning-based solution as well as the task definition itself. Our empirical evaluation indicates that our factuality analysis task is sufficiently well-defined to achieve a high inter-annotator agreement and our Factorial CRF-based model considerably outperforms the baseline. We also present an application system, which currently stores over 30M experience instances extracted from 150M Japanese blog posts with semantic indices and is scheduled to start serving as an experience search engine for unrestricted users in October.