VisHue: web page segmentation for an improved query interface for medlineplus medical encyclopedia

  • Authors:
  • Aastha Madaan;Wanming Chu;Subhash Bhalla

  • Affiliations:
  • University of Aizu, Aizu-Wakamatsu Shi, Fukushima-ken, Japan;University of Aizu, Aizu-Wakamatsu Shi, Fukushima-ken, Japan;University of Aizu, Aizu-Wakamatsu Shi, Fukushima-ken, Japan

  • Venue:
  • DNIS'11 Proceedings of the 7th international conference on Databases in Networked Information Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

World Wide Web has become the largest source of information. Consequently web based information retrieval, information extraction; automatic page adaptation and querying deep-web are gaining importance. The need for information retrieval applications is increasing. To address the problems of the ever expanding information over the internet, traditional information retrieval techniques have been applied. Such techniques are sometimes time consuming, and laborious, and the results obtained may be unsatisfactory. This study is an attempt to query web pages like MedlinePlus medical encyclopedia by segmenting the web pages. It summarizes the existing approaches for web page segmentation from the perspective of "structure realization for improved querying" on the web. It proposes a new algorithm VisHue for web page segmentation based on visual cues and heuristics and further uses the hierarchical structure generated by it to develop the Query by Segment or Tag (QBT) query interface. This interface is close to the end-user and exploits the relationships among the various content groups within a web page. Such an improved query-interface enables the user to perform in-depth querying. It is a step beyond the page-level search.