An agent-based system framework for multi-slot web information extraction

  • Authors:
  • Shudong Zhang;Ye Qin;Naiming Yao

  • Affiliations:
  • College of Information Engineering, College of Information Engineering, Beijing, China;College of Information Engineering, College of Information Engineering, Beijing, China;College of Information Engineering, College of Information Engineering, Beijing, China

  • Venue:
  • CAR'10 Proceedings of the 2nd international Asia conference on Informatics in control, automation and robotics - Volume 3
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

At present, the scale and diversity of Web information are immense. Acquiring Web information simply relies on search engine which is increasingly unable to meet user needs, thus Web information extraction (WebIE) technology attracts widely attentions. In this paper, a framework of distributed multi-slot WebIE system based on agent is proposed. It includes user agent, mediator agent, wrapper agent, data store agent, page preprocessing agent and corresponding knowledge base. The agents communicate each other and cooperate together to carry out the general goal of the system. Moreover, aiming at multi-slot extraction, the approaches of extraction rule learning and repair are presented, which enable to enhance adaptability of the system.