Querying Web Data - The WebQA Approach

  • Authors:
  • Sunny K. S. Lam;M. Tamer Özsu

  • Affiliations:
  • -;-

  • Venue:
  • WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The common paradigm of searching and retrieving informationon the Web is based on keyword-based search usingone or more search engines, and then browsing throughthe large number of returned URLs. This is significantlyweaker than the declarative querying that is supported byDBMSs. The lack of a schema and the high volatility ofWeb make "database-like" querying of Web data difficult.In this paper we report on our work in building a system,called WebQA, that provides a declarative query-based approachto Web data retrieval that uses question-answeringtechnology in extracting information from Web sites that areretrieved by search engines. The approach consists of firstusing meta-search techniques in an open environment togather candidate responses from search engines and otheron-line databases, and then using information extractiontechniques to find the answer to the specific question fromthese candidates. A prototype system has been developedto test this approach. Testing includes evaluation of itsperformance as a question-answering system using a well-knownevaluation system called TREC-9. Its accuracy usingTREC-9 data for simple questions is high and its retrievalperformance is good. The system employs an open systemarchitecture allowing for on-going improvements in variousaspects.