Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

Authors:
Danyun Xu;Gong Cheng;Yuzhong Qu
Affiliations:
-;-;-
Venue:
Information Processing and Management: an International Journal
Year:
2014

Citing 19
Cited 0

WordNet: a lexical database for English

Communications of the ACM
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Centroid-based summarization of multiple documents

Information Processing and Management: an International Journal
Ontology summarization based on rdf sentence graph

Proceedings of the 16th international conference on World Wide Web
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Graph summarization with bounded error

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Efficient aggregation for graph summarization

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Identifying Potentially Important Concepts and Relations in an Ontology

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
RDF Snippets for Semantic Web Search Engines

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
LexRank: graph-based lexical centrality as salience in text summarization

Journal of Artificial Intelligence Research
DBpedia - A crystallization point for the Web of Data

Web Semantics: Science, Services and Agents on the World Wide Web
A novel keyword search paradigm in relational databases: Object summaries

Data & Knowledge Engineering
RELIN: relatedness and informativeness-based centrality for entity summarization

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Size-l object summaries for relational keyword search

Proceedings of the VLDB Endowment
A string metric for ontology alignment

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Summarizing highly structured documents for effective search interaction

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Evaluating entity summarization using a game-based ground truth

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
Incorporating compactness to generate term-association view snippets for ontology search

Information Processing and Management: an International Journal
The notion of diversity in graphical entity summarisation on semantic knowledge graphs

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The volume of entity-centric structured data grows rapidly on the Web. The description of an entity, composed of property-value pairs (a.k.a. features), has become very large in many applications. To avoid information overload, efforts have been made to automatically select a limited number of features to be shown to the user based on certain criteria, which is called automatic entity summarization. However, to the best of our knowledge, there is a lack of extensive studies on how humans rank and select features in practice, which can provide empirical support and inspire future research. In this article, we present a large-scale statistical analysis of the descriptions of entities provided by DBpedia and the abstracts of their corresponding Wikipedia articles, to empirically study, along several different dimensions, which kinds of features are preferable when humans summarize. Implications for automatic entity summarization are drawn from the findings.