Retrieval performance in Ferret a conceptual information retrieval system
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Effective retrieval of structured documents
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Integrating automatic genre analysis into digital libraries
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
HICSS '98 Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences - Volume 2
Towards Automatic Web Genre Identification
HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 4 - Volume 4
Genre as Interface Metaphor: Exploiting Form and Function in Digital Environments
HICSS '99 Proceedings of the Thirty-Second Annual Hawaii International Conference on System Sciences-Volume 2 - Volume 2
Structural features in content oriented XML retrieval
Proceedings of the 14th ACM international conference on Information and knowledge management
Learning to summarise XML documents using content and structure
Proceedings of the 14th ACM international conference on Information and knowledge management
Effects of web document evolution on genre classification
Proceedings of the 14th ACM international conference on Information and knowledge management
The SMART Retrieval System—Experiments in Automatic Document Processing
The SMART Retrieval System—Experiments in Automatic Document Processing
The form is the substance: classification of genres in text
HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Journal of the American Society for Information Science and Technology
Classifying XML Documents by Using Genre Features
DEXA '07 Proceedings of the 18th International Conference on Database and Expert Systems Applications
Structured text retrieval by means of affordances and genre
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Automatic genre identification: towards a flexible classification scheme
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Structured text retrieval by means of affordances and genre
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Hi-index | 0.00 |
This paper offers a proposal for some preliminary research on the retrieval of structured text, such as extensible mark-up language (XML). We believe that capturing the way in which a reader perceives the meaning of documents, especially genres of text, may have implications for information retrieval (IR) and in particular, for cognitive IR and relevance. Previous research on 'shallow' features of structured text has shown that categorization by form is possible. Gibson's theory of 'affordances' and genre offer the reader the meaning and purpose - through structure - of a text, before the reader has even begun to read it, and should therefore provide a good basis for the 'deep' skimming and categorization of texts. We believe that Gibson's 'affordances' will aid the user to locate, examine and utilize shallow or deep features of genres and retrieve relevant output. Our proposal puts forward two hypotheses, with a list of research questions to test them, and culminates in experiments involving the studies of human categorization behaviour when viewing the structures of emails and web documents. Finally, we will examine the effectiveness of adding structural layout cues to a Yahoo discussion forum (currently only a bag-of-words), which is rich in structure, but only searchable through a Boolean search engine.