The notion of diversity in graphical entity summarisation on semantic knowledge graphs

Authors:
Marcin Sydow;Mariusz Pikuła;Ralf Schenkel
Affiliations:
Web Mining Lab, Polish-Japanese Institute of Information Technology, Warsaw, Poland and Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland;Web Mining Lab, Polish-Japanese Institute of Information Technology, Warsaw, Poland;Saarland University and MPI for Informatics, Saarbrücken, Germany
Venue:
Journal of Intelligent Information Systems
Year:
2013

Citing 37
Cited 1

Introduction to algorithms

Introduction to algorithms
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Advances in Automatic Text Summarization

Advances in Automatic Text Summarization
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Sentence Fusion for Multidocument News Summarization

Computational Linguistics
Visual exploration of multivariate graphs

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Less is more: probabilistic models for retrieving fewer relevant documents

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Ontology summarization based on rdf sentence graph

Proceedings of the 16th international conference on World Wide Web
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Crowdsourcing user studies with Mechanical Turk

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Novelty and diversity in information retrieval evaluation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Crowdsourcing for relevance evaluation

ACM SIGIR Forum
Crowdsourcing: Why the Power of the Crowd Is Driving the Future of Business

Crowdsourcing: Why the Power of the Crowd Is Driving the Future of Business
Snippet Generation for Semantic Web Search Engines

ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
Diversifying search results

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Xoom: a tool for zooming in and out of XML documents

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Enhancing diversity, coverage and balance for summarization through structure learning

Proceedings of the 18th international conference on World wide web
An axiomatic approach for result diversification

Proceedings of the 18th international conference on World wide web
Efficient Computation of Diverse Query Results

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Honeycomb: Visual Analysis of Large Scale Social Networks

INTERACT '09 Proceedings of the 12th IFIP TC 13 International Conference on Human-Computer Interaction: Part II
Language-model-based ranking for queries on RDF-graphs

Proceedings of the 18th ACM conference on Information and knowledge management
Topic analysis for topic-focused multi-document summarization

Proceedings of the 18th ACM conference on Information and knowledge management
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction

ACM Transactions on Information Systems (TOIS)
Visual Exploration of RDF Data

SOFSEM'08 Proceedings of the 34th conference on Current trends in theory and practice of computer science
EUSUM: extracting easy-to-understand english summaries for non-native readers

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Generating templates of entity summaries with an entity-aspect model and pattern mining

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Cross-language document summarization based on machine translation quality prediction

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Generating summaries for ontology search

Proceedings of the 20th international conference companion on World wide web
Coherent citation-based summarization of scientific papers

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Summarizing the differences in multilingual news

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
To diversify or not to diversify entity summaries on RDF knowledge graphs?

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
RELIN: relatedness and informativeness-based centrality for entity summarization

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
GraphPrism: compact visualization of network structure

Proceedings of the International Working Conference on Advanced Visual Interfaces
Towards exploratory video search using linked data

Multimedia Tools and Applications
Methods for Mining and Summarizing Text Conversations

Methods for Mining and Summarizing Text Conversations
Evaluating entity summarization using a game-based ground truth

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
AGNES: a novel algorithm for visualising diversified graphical entity summarisations on knowledge graphs

ISMIS'12 Proceedings of the 20th international conference on Foundations of Intelligent Systems

Preferences in Wikipedia abstracts: Empirical findings and implications for automatic entity summarization

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given an entity represented by a single node q in semantic knowledge graph D, the Graphical Entity Summarisation problem (GES) consists in selecting out of D a very small surrounding graph S that constitutes a generic summary of the information concerning the entity q with given limit on size of S. This article concerns the role of diversity in this quite novel problem. It gives an overview of the diversity concept in information retrieval, and proposes how to adapt it to GES. A measure of diversity for GES, called ALC, is defined and two algorithms presented, baseline, diversity-oblivious PRECIS and diversity-aware DIVERSUM. A reported experiment shows that DIVERSUM actually achieves higher values of the ALC diversity measure than PRECIS. Next, an objective evaluation experiment demonstrates that diversity-aware algorithm is superior to the diversity-oblivious one in terms of fact selection. More precisely, DIVERSUM clearly achieves higher recall than PRECIS on ground truth reference entity summaries extracted from Wikipedia. We also report another intrinsic experiment, in which the output of diversity-aware algorithm is significantly preferred by human expert evaluators. Importantly, the user feedback clearly indicates that the notion of diversity is the key reason for the preference. In addition, the experiment is repeated twice on an anonymous sample of broad population of Internet users by means of a crowd-sourcing platform, that further confirms the results mentioned above.