Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Incremental updates of inverted lists for text document retrieval
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
The entity-relationship model—toward a unified view of data
ACM Transactions on Database Systems (TODS) - Special issue: papers from the international conference on very large data bases: September 22–24, 1975, Framingham, MA
Integrating contents and structure in text retrieval
ACM SIGMOD Record
Object-Relational DBMSs: Tracking the Next Great Wave
Object-Relational DBMSs: Tracking the Next Great Wave
Storing and querying ordered XML using a relational database system
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Oracle8i Index-Organized Table and Its Application to New Domains
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
An Efficient Indexing Technique for Full Text Databases
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Fast Incremental Indexing for Full-Text Information Retrieval
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
WWW '03 Proceedings of the 12th international conference on World Wide Web
Extensible Indexing: a Framework for Integrating Domain-Specific Indexing Schemes into Oracle8i
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
XRANK: ranked keyword search over XML documents
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
ACM SIGMOD Record
Native Xquery processing in oracle XMLDB
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Enhancing XML search with XQuery 1.0 and XPath 2.0 full-text
IBM Systems Journal
Making database systems usable
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Query rewrite for XML in Oracle XML DB
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Towards a physical XML independent XQuery/SQL/XML engine
Proceedings of the VLDB Endowment
Efficient keyword search over virtual XML views
The VLDB Journal — The International Journal on Very Large Data Bases
Query processing techniques for solid state drives
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Hi-index | 0.00 |
RDBMS provides best performance for querying structured data that starts out with a well-defined schema. However, such a 'schema first, data later' approach does not work for unstructured data or data without much structure. Therefore, RDBMS typically stores such data without any schema in LOB columns (for example, Character Large Object (CLOB) or Binary Large Object (BLOB) columns) and provides Information-Retrieval (IR) style, keyword-based search capability over these LOB columns. Lately, XML as a native datatype (XMLType) in RDBMS has been introduced via the SQL/XML standard. Semi-structured data with or without any schema can be stored into such XMLType columns, and XQuery provides query capability over them. In particular, XQuery full text specification provides the capability of searching keywords within document context. Such full context-aware text search capability is more powerful than pure keyword search, since the user can now provide fine-grained context in which the keywords should occur. However, XML with XQuery full text searching requires that the user first convert her text data into XML and store them into XMLType column. Such massive physical data migration with possible loss of document fidelity and its potential impact on existing production environments are often expensive enough that users are reluctant to adopt the XML/XQuery approach. In this paper, we propose a pay-as-you-go architecture to provide XML text view over LOB columns, so that user can take advantage of context-aware full-text search capability adaptively. This adaptive architecture includes a novel XML text index that can be created over the LOB column where the content is stored. The XML text index supports an XML text view over LOB data on top of which XQuery full-text search capability is feasible. Such an adaptive index/view approach provides least intrusion over existing data, as it requires no physical data migration. We describe the design and challenge of building such an adaptive XML text index. Furthermore, we advocate that the pay-as-you-go approach provides the integration bridge between the structured relational world and text oriented document world and fulfills the primary motivation of XML in the database.