BUS: an effective indexing and retrieval scheme in structured documents
Proceedings of the third ACM conference on Digital libraries
Flexible queries over semistructured data
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Schema-Driven Evaluation of Approximate Tree-Pattern Queries
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Searching and Browsing Collections of Structural Information
ADL '00 Proceedings of the IEEE Advances in Digital Libraries 2000
Searching XML documents via XML fragments
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
XRANK: ranked keyword search over XML documents
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
FleXPath: flexible structure and full-text querying for XML
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
On the integration of structure indexes and inverted lists
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Content and structure in indexing and ranking XML
Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
Adaptive Processing of Top-k Queries in XML
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Report on the DB/IR panel at SIGMOD 2005
ACM SIGMOD Record
Principles of dataspace systems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Flexible and efficient XML search with complex full-text predicates
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Probabilistic information retrieval approach for ranking of database query results
ACM Transactions on Database Systems (TODS)
A method for comparison of standardized information within systems biology
Proceedings of the 38th conference on Winter simulation
A co-training framework for searching XML documents
Information Systems
XML search: languages, INEX and scoring
ACM SIGMOD Record
Fragment-based approximate retrieval in highly heterogeneous XML collections
Data & Knowledge Engineering
Querying complex structured databases
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Efficient keyword search over virtual XML views
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Multi-dimensional search for personal information management systems
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Relevance measures for XML information retrieval
International Journal of Web and Grid Services
Usage-based ranking of distributed XML data
Proceedings of the 2008 ACM symposium on Applied computing
A survey of top-k query processing techniques in relational database systems
ACM Computing Surveys (CSUR)
Ranking for Approximated XQuery Full-Text Queries
BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
Expert Systems with Applications: An International Journal
RRSi: indexing XML data for proximity twig queries
Knowledge and Information Systems
Query ranking in probabilistic XML data
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Retrieving XML data from heterogeneous sources through vague querying
ACM Transactions on Internet Technology (TOIT)
Efficient keyword search over virtual XML views
The VLDB Journal — The International Journal on Very Large Data Bases
Materialized View Selection in XML Databases
DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
A Prüfer Based Approach to Process Top-k Queries in XML
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Dissemination of heterogeneous XML data in publish/subscibe systems
Proceedings of the 18th ACM conference on Information and knowledge management
Effective XML content and structure retrieval with relevance ranking
Proceedings of the 18th ACM conference on Information and knowledge management
Language-model-based ranking for queries on RDF-graphs
Proceedings of the 18th ACM conference on Information and knowledge management
Cluster-Based Exploration for Effective Keyword Search over Semantic Datasets
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Efficient keyword search over data-centric XML documents
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
CoXML: a cooperative XML query answering system
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Towards adaptive information merging using selected XML fragments
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Structural consistency: enabling XML keyword search to eliminate spurious results consistently
The VLDB Journal — The International Journal on Very Large Data Bases
Exploit keyword query semantics and structure of data for effective XML keyword search
ADC '10 Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104
On the expressiveness of generalization rules for XPath query relaxation
Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Retrieving samples from biobanks
ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
Unified structure and content search for personal information management systems
Proceedings of the 14th International Conference on Extending Database Technology
Relaxing queries based on XML structure and content preferences
WISS'10 Proceedings of the 2010 international conference on Web information systems engineering
An efficient co-operative framework for multi-query processing over compressed XML data
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
No tag, a little nesting, and great XML keyword search
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
ArHeX: an approximate retrieval system for highly heterogeneous XML document collections
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
How to pack directed acyclic graphs into small blocks
CIAC'06 Proceedings of the 6th Italian conference on Algorithms and Complexity
KCAM: concentrating on structural similarity for XML fragments
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Preference functional dependencies for managing choices
ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Highly heterogeneous XML collections: how to retrieve precise results?
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Optimal top-k generation of attribute combinations based on ranked lists
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Vague queries on peer-to-peer XML databases
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Locating and ranking XML documents based on content and structure synopses
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Leveraging the storage layer to support XML similarity joins in XDBMSs
ADBIS'12 Proceedings of the 16th East European conference on Advances in Databases and Information Systems
An approach to define flexible structural constraints in XQuery
AMT'12 Proceedings of the 8th international conference on Active Media Technology
Querying and ranking incomplete twigs in probabilistic XML
World Wide Web
Flex-BaseX: an XML engine with a flexible extension of Xquery full-text
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Flexible structural constraints in XQuery full-text
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Hi-index | 0.00 |
XML repositories are usually queried both on structure and content. Due to structural heterogeneity of XML, queries are often interpreted approximately and their answers are returned ranked by scores. Computing answer scores in XML is an active area of research that oscillates between pure content scoring such as the well-known tf*idf and taking structure into account. However, none of the existing proposals fully accounts for structure and combines it with content to score query answers. We propose novel XML scoring methods that are inspired by tf*idf and that account for both structure and content while considering query relaxations. Twig scoring, accounts for the most structure and content and is thus used as our reference method. Path scoring is an approximation that loosens correlations between query nodes hence reducing the amount of time required to manipulate scores during top-k query processing. We propose efficient data structures in order to speed up ranked query processing. We run extensive experiments that validate our scoring methods and that show that path scoring provides very high precision while improving score computation time.