Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Proceedings of the 10th international conference on World Wide Web
Document Visualization on Small Displays
MDM '03 Proceedings of the 4th International Conference on Mobile Data Management
Report on the INEX 2003 workshop
ACM SIGIR Forum
Automatic detection of fragments in dynamically generated web pages
Proceedings of the 13th international conference on World Wide Web
Preparing heterogeneous XML for full-text search
ACM Transactions on Information Systems (TOIS)
When a few highly relevant answers are enough
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Processing heterogeneous collections in XML information retrieval
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Hi-index | 0.00 |
The effort around EXTIRP 2004 focused on the heterogeneity of XML document collections. The subcollections of the heterogeneous track (het-track) did not offer us a suitable testbed, but we successfully applied methods independent of any document type to the original INEX test collection. By closing our eyes to the element names defined in the DTD, we created comparable runs and discovered improvement in the results. This was anticipated evidence for our hypothesis that we do not need to know the element names when indexing the collection or when returning full-text answers to the Content-Only type queries. Some problematic areas were also identified. One of them is score combination which enables us to combine elements of any size into one ranked list of results given that we have the relevance scores of the leaf-level elements. However, finding a suitable score combination method remains part of our future work.