Enhanced web document summarization using hyperlinks

Authors:
J.-Y. Delort;B. Bouchon-Meunier;M. Rifqi
Affiliations:
University Paris 6, Paris, France;University Paris 6, Paris, France;University Paris 6, Paris, France
Venue:
Proceedings of the fourteenth ACM conference on Hypertext and hypermedia
Year:
2003

Citing 10
Cited 29

Towards general measures of comparison of objects

Fuzzy Sets and Systems - Special issue dedicated to the memory of Professor Arnold Kaufmann
A Web navigation tool for the blind

Assets '98 Proceedings of the third international ACM conference on Assistive technologies
Automatic resource compilation by analyzing hyperlink structure and associated text

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Summarizing text documents: sentence selection and evaluation metrics

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
OCELOT: a system for summarizing Web pages

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Topical locality in the Web

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Automatically summarising Web sites: is there a way around it?

Proceedings of the ninth international conference on Information and knowledge management
A vector space model for automatic indexing

Communications of the ACM
Seeing the whole in parts: text summarization for web browsing on handheld devices

Proceedings of the 10th international conference on World Wide Web
Interactive Document Summarisation Using Automatically Extracted Keyphrases

HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 4 - Volume 4

World wide web site summarization

Web Intelligence and Agent Systems
Web-page summarization using clickthrough data

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Gist summaries for visually impaired surfers

Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility
Strategies for automatic LOM metadata generating in a web-based CSCL tool

WebMedia '05 Proceedings of the 11th Brazilian Symposium on Multimedia and the web
Identifying commented passages of documents using implicit hyperlinks

Proceedings of the seventeenth conference on Hypertext and hypermedia
A Novel Partitioning-Based Clustering Method and Generic Document Summarization

WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
From social bookmarking to social summarization: an experiment in community-based summary generation

Proceedings of the 12th international conference on Intelligent user interfaces
Temporal multi-page summarization

Web Intelligence and Agent Systems
Csurf: a context-driven non-visual web-browser

Proceedings of the 16th international conference on World Wide Web
Context browsing with mobiles - when less is more

Proceedings of the 5th international conference on Mobile systems, applications and services
Learning query-biased web page summarization

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
CLBCRA-Approach for Combination of Content-Based and Link-Based Ranking in Web Search

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Web content summarization using social bookmarks: a new approach for social summarization

Proceedings of the 10th ACM workshop on Web information and data management
Bridging the Web Accessibility Divide

Electronic Notes in Theoretical Computer Science (ENTCS)
Automated construction of web accessibility models from transaction click-streams

Proceedings of the 18th international conference on World wide web
A Document Descriptor Extractor Based on Relevant Expressions

EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Query-topic focused web pages summarization

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Extraction of anchor-related text and its evaluation by user studies

Proceedings of the 2007 conference on Human interface: Part I
Adaptive focused crawling

The adaptive web
Web news summarization via soft clustering algorithm

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Social summarization in collaborative web search

Information Processing and Management: an International Journal
Mixture model based label association techniques for web accessibility

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Towards automatic building of document keywords

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Extracting the gist of social network services using Wikipedia

Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
A hierarchical model of web summaries

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Social context summarization

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
NewPR-Combining TFIDF with pagerank

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Why read if you can skim: towards enabling faster screen reading

Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Addressing Challenges in Web Accessibility for the Blind and Visually Impaired

International Journal of Distance Education Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the issue of Web document summarization. As textual content of Web documents is often scarce or irrelevant and existing summarization techniques are based on it, many Web pages and websites cannot be suitably summarized. We consider the context of a Web document by the textual content of all the documents linking to it. To summarize a target Web document, a context-based summarizer has to perform a preprocessing task, during which it will be decided which pieces of information in the source documents are relevant to the content of the target. Then a context-based summarizer faces two issues: first, the selected elements may partially deal with the topic of the target, second they may be related to the target and yet not contain any clues about the content of the target.In this paper we put forward two new summarization by context algorithms. The first one uses both the content and the context of the document and the second one is based only on the elements of the context. It is shown that summaries taking into account the context are usually much more relevant than those made only from the content of the target document. Optimal conditions of the proposed algorithms with respect to the sizes of the content and the context of the document to summarize are studied.