Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
HYPERTEXT '89 Proceedings of the second annual ACM conference on Hypertext
Practical approach to the stereo matching of urban imagery
Image and Vision Computing
Approaches to passage retrieval in full text information systems
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Subtopic structuring for full-length document access
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Passage-level evidence in document retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic text decomposition and structuring
Information Processing and Management: an International Journal
Context and structure in automated full-text information access
Context and structure in automated full-text information access
TileBars: visualization of term distribution information in full text information access
CHI '95 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatic text decomposition using text segments and text themes
Proceedings of the the seventh ACM conference on Hypertext
Labeled, typed links as cues when reading hypertext documents
Journal of the American Society for Information Science
Building hypertext using information retrieval
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A language modelling approach to relevance profiling for document browsing
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Handbook of Logic and Language
Handbook of Logic and Language
ECDL '97 Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries
A network-based approach to text handling for the on-line scientific community
A network-based approach to text handling for the on-line scientific community
Topic segmentation: algorithms and applications
Topic segmentation: algorithms and applications
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Understanding Digital Libraries, Second Edition (The Morgan Kaufmann Series in Multimedia and Information Systems)
Using topic shifts for focussed access to XML repositories
ECIR'07 Proceedings of the 29th European conference on IR research
Progress in information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Supporting orientation during search result examination
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Hi-index | 0.00 |
When presented with a retrieved document, users of a search engine are usually left with the task of pinning down the relevant information inside the document. Often this is done by a time-consuming combination of skimming, scrolling and Ctrl+F. In the setting of a digital library for scientific literature the issue is especially urgent when dealing with reference works, such as surveys and handbooks, as these typically contain long documents. Our aim is to develop methods for providing a “go-read-here” type of retrieval functionality, which points the user to a segment where she can best start reading to find out about her topic of interest. We examine multiple query-independent ways of segmenting texts into coherent chunks that can be returned in response to a query. Most (experienced) authors use paragraph breaks to indicate topic shifts, thus providing us with one way of segmenting documents. We compare this structural method with semantic text segmentation methods, both with respect to topical focus and relevancy. Our experimental evidence is based on manually segmented scientific documents and a set of queries against this corpus. Structural segmentation based on contiguous blocks of relevant paragraphs is shown to be a viable solution for our intended application of providing “go-read-here” functionality.