Storing and analysing voice of the market data in the corporate data warehouse

Authors:
Lisette García-Moya;Shahad Kudama;María José Aramburu;Rafael Berlanga
Affiliations:
Temporal Knowledge Bases Group, Universitat Jaume I, Castellón, Spain;Temporal Knowledge Bases Group, Universitat Jaume I, Castellón, Spain;Temporal Knowledge Bases Group, Universitat Jaume I, Castellón, Spain;Temporal Knowledge Bases Group, Universitat Jaume I, Castellón, Spain
Venue:
Information Systems Frontiers
Year:
2013

Citing 25
Cited 1

Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Annotea: an open RDF infrastructure for shared Web annotations

Proceedings of the 10th international conference on World Wide Web
Mining and summarizing customer reviews

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Opinion observer: analyzing and comparing opinions on the Web

WWW '05 Proceedings of the 14th international conference on World Wide Web
Building the Data Warehouse

Building the Data Warehouse
Duplicate Record Detection: A Survey

IEEE Transactions on Knowledge and Data Engineering
ARSA: a sentiment-aware model for predicting sales performance using blogs

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Show me the money!: deriving the pricing power of product features by mining consumer reviews

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Wikify!: linking documents to encyclopedic knowledge

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Survey of Text Mining II: Clustering, Classification, and Retrieval

Survey of Text Mining II: Clustering, Classification, and Retrieval
Contextualizing data warehouses with documents

Decision Support Systems
Open information extraction from the web

Communications of the ACM - Surviving the data deluge
Opinion analysis for business intelligence applications

OBI '08 Proceedings of the first international workshop on Ontology-supported business intelligence
Towards a Data Warehouse Contextualized with Web Opinions

ICEBE '08 Proceedings of the 2008 IEEE International Conference on e-Business Engineering
Generating complex ontology instances from documents

Journal of Algorithms
Enhanced Business Intelligence using EROCS

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
A generalized Co-HITS algorithm and its application to bipartite graphs

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Semantic annotation, indexing, and retrieval

Web Semantics: Science, Services and Agents on the World Wide Web
Semantic annotation for knowledge management: Requirements and a survey of the state of the art

Web Semantics: Science, Services and Agents on the World Wide Web
Latent aspect rating analysis on review text data: a rating regression approach

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Supporting natural language processing with background knowledge: coreference resolution case

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Extracting and ranking product features in opinion documents

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Automatic construction of a context-aware sentiment lexicon: an optimization approach

Proceedings of the 20th international conference on World wide web
Probabilistic ranking of product features from customer reviews

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Towards Tailored Semantic Annotation Systems from Wikipedia

DEXA '11 Proceedings of the 2011 22nd International Workshop on Database and Expert Systems Applications

Business Intelligence and the Web

Information Systems Frontiers

Quantified Score

Hi-index	0.00

Visualization

Abstract

Web opinion feeds have become one of the most popular information sources users consult before buying products or contracting services. Negative opinions about a product can have a high impact in its sales figures. As a consequence, companies are more and more concerned about how to integrate opinion data in their business intelligence models so that they can predict sales figures or define new strategic goals. After analysing the requirements of this new application, this paper proposes a multidimensional data model to integrate sentiment data extracted from opinion posts in a traditional corporate data warehouse. Then, a new sentiment data extraction method that applies semantic annotation as a means to facilitate the integration of both types of data is presented. In this method, Wikipedia is used as the main knowledge resource, together with some well-known lexicons of opinion words and other corporate data and metadata stores describing the company products like, for example, technical specifications and user manuals. The resulting information system allows users to perform new analysis tasks by using the traditional OLAP-based data warehouse operators. We have developed a case study over a set of real opinions about digital devices which are offered by a wholesale dealer. Over this case study, the quality of the extracted sentiment data is evaluated, and some query examples that illustrate the potential uses of the integrated model are provided.