NaLIX: A generic natural language search environment for XML data

  • Authors:
  • Yunyao Li;Huahai Yang;H. V. Jagadish

  • Affiliations:
  • IBM Almaden Research Center, San Jose, CA;University at Albany, State University of New York, Albany, NY;University of Michigan, Ann Arbor, MI

  • Venue:
  • ACM Transactions on Database Systems (TODS)
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe the construction of a generic natural language query interface to an XML database. Our interface can accept a large class of English sentences as a query, which can be quite complex and include aggregation, nesting, and value joins, among other things. This query is translated, potentially after reformulation, into an XQuery expression. The translation is based on mapping grammatical proximity of natural language parsed tokens in the parse tree of the query sentence to proximity of corresponding elements in the XML data to be retrieved. Iterative search in the form of followup queries is also supported. Our experimental assessment, through a user study, demonstrates that this type of natural language interface is good enough to be usable now, with no restrictions on the application domain.