Nonlinear pattern matching in trees
Journal of the ACM (JACM)
Structured answers for a large structured document collection
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
DOLORES: a system for logic-based retrieval of multimedia objects
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Comparative analysis of five XML query languages
ACM SIGMOD Record
Extended Boolean information retrieval
Communications of the ACM
Integrating contents and structure in text retrieval
ACM SIGMOD Record
Modern Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
The SMART Retrieval System—Experiments in Automatic Document Processing
The SMART Retrieval System—Experiments in Automatic Document Processing
Journal of the American Society for Information Science and Technology - XML
XIRQL: An XML query language based on information retrieval concepts
ACM Transactions on Information Systems (TOIS)
The effectiveness of automatically structured queries in digital libraries
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Configurable indexing and ranking for XML information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Content and structure in indexing and ranking XML
Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
Searching structured documents
Information Processing and Management: an International Journal
Processing content-oriented XPath queries
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Choosing document structure weights
Information Processing and Management: an International Journal
The SphereSearch engine for unified ranked retrieval of heterogeneous XML and web documents
VLDB '05 Proceedings of the 31st international conference on Very large data bases
An efficient and versatile query engine for TopX search
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Integrating document and data retrieval based on XML
The VLDB Journal — The International Journal on Very Large Data Bases
Dynamic element retrieval in a structured environment
ACM Transactions on Information Systems (TOIS)
An architecture for xml information retrieval in a peer-to-peer environment
Proceedings of the ACM first Ph.D. workshop in CIKM
Relevance measures for XML information retrieval
International Journal of Web and Grid Services
Phil: A Lazy Implementation of a Language for Approximate Filtering of XML Documents
Electronic Notes in Theoretical Computer Science (ENTCS)
RRSi: indexing XML data for proximity twig queries
Knowledge and Information Systems
Integrating Structure in the Probabilistic Model for Information Retrieval
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Query ranking in probabilistic XML data
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
A coherent query language for XML
Journal of Intelligent Information Systems
Matching subsequences in trees
Journal of Discrete Algorithms
Annotating wikipedia articles with semantic tags for structured retrieval
Proceedings of the 2nd ACM workshop on Social web search and mining
CPM'03 Proceedings of the 14th annual conference on Combinatorial pattern matching
Flexible document-query matching based on a probabilistic content and structure score combination
Proceedings of the 2010 ACM Symposium on Applied Computing
Toward approximate GML retrieval based on structural and semantic characteristics
ICWE'10 Proceedings of the 10th international conference on Web engineering
The tree inclusion problem: In linear space and faster
ACM Transactions on Algorithms (TALG)
Flexible querying of XML documents
ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Field-weighted XML retrieval based on BM25
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Integrating text retrieval and image retrieval in XML document searching
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
The tree inclusion problem: in optimal space and faster
ICALP'05 Proceedings of the 32nd international conference on Automata, Languages and Programming
No tag, a little nesting, and great XML keyword search
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Matching subsequences in trees
CIAC'06 Proceedings of the 6th Italian conference on Algorithms and Complexity
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Web Semantics: Science, Services and Agents on the World Wide Web
Ranked retrieval of structured documents with the s-term vector space model
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
Interactive searching behavior with structured XML documents
INEX'04 Proceedings of the Third international conference on Initiative for the Evaluation of XML Retrieval
EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Context-Specific frequencies and discriminativeness for the retrieval of structured documents
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Survey: An overview on XML similarity: Background, current trends and future directions
Computer Science Review
Pay-as-You-Go ranking of schema mappings using query logs
DILS'12 Proceedings of the 8th international conference on Data Integration in the Life Sciences
Selection fusion in semi-structured retrieval
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
XML represents both content and structure of documents. Taking advantage of the document structure promises to greatly improve the retrieval precision. In this article, we present a retrieval technique that adopts the similarity measure of the vector space model, incorporates the document structure, and supports structured queries. Our query model is based on tree matching as a simple and elegant means to formulate queries without knowing the exact structure of the data. Using this query model we propose a logical document concept by deciding on the document boundaries at query time. We combine structured queries and term-based ranking by extending the term concept to structural terms that include substructures of queries and documents. The notions of term frequency and inverse document frequency are adapted to logical documents and structural terms. We introduce an efficient technique to calculate all necessary term frequencies and inverse document frequencies at query time. By adjusting parameters of the retrieval process we are able to model two contrary approaches: the classical vector space model, and the original tree matching approach.