Foundations of statistical natural language processing
Foundations of statistical natural language processing
Information Retrieval
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
XML Document Classification Using Extended VSM
Focused Access to XML Documents
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
From natural language to NEXI, an interface for INEX 2005 queries
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Journal of Information Science
Hi-index | 0.00 |
Users of information retrieval (IR) systems require an interface that is powerful and easy-to-use in order to fulfill their information requirement. In XML-IR systems this is a non-trivial task since users expect these systems to fulfill both their structural and content requirements. Most existing XML-IR systems accept queries formatted in formal query languages, however, these languages are difficult to use. This paper presents NLPX – an XML-IR system with a natural language interface that is user friendly enough so it can be used intuitively, but sophisticated enough to be able to handle complex structured queries. NLPX accepts English queries that contain both users' content and structural requirements. It uses a set of grammar templates to derive the structural and content requirements and translates them into a formal language (NEXI). The formal language queries can then be processed by many existing XML-IR systems. The system was developed for participation in the NLP Track of the INEX 2004 Workshop, and results indicated that natural language interfaces are able to capture users' structural and content requirements, but not as accurately as some formal language interfaces.