Syntactic Extraction Approach to Processing Local Document Collections

  • Authors:
  • Jolanta Mizera-Pietraszko

  • Affiliations:
  • Department of Information Systems, Institute of Informatics, Wroclaw University of Technology, Wroclaw, Poland 50-370

  • Venue:
  • FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Techniques of processing databases like free text searching, or proximity search are one of the key factors that influence efficiency of query answering. Since most users prefer querying systems in natural language, a correct answer formulation based on the electronic document content seems a real challenge. Processing queries in multilingual environment usually impedes the system responsiveness even more. This paper proposes an approach of overcoming these obstacles by implementation of syntactic information extraction. Some evaluation methodologies commonly used by TREC, NTCIR, SIGIR etc are studied in order to suggest that it is not only a system architecture itself, a translation model or the document format, but also other factors that determine the system performance. The shallow technique of the syntactic information extraction used appears to be a robust of the system described. In this light, it is possible to achieve comparable results when processing monolingual and cross-lingual collections.