Applying passage in Web text mining

  • Authors:
  • Thanaruk Theeramunkong

  • Affiliations:
  • Info. Tech. Prog., Sirindhorn Int. Inst. of Tech., Thammasat Univ., Pathumthani 12121, Thailand and Info. Res. and Dev. Div., Natl. Elec. and Comp. Tech. Ctr. (NECTEC), Rajthevi, Bangkok 10400 Tha ...

  • Venue:
  • International Journal of Intelligent Systems - Intelligent Technologies
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Textual information on the Web is very huge, varied, and useful. Although traditional text mining treats a text document as a single piece of information, this approach may not be suitable for Web documents that are long and heterogeneous in their contents. This article presents a new approach that applies the concept of a passage to Web text mining. In this approach, a single Web text document is considered as several passages instead of a single text. To investigate the effectiveness of the approach, Thai Web documents taken from the Internet are used. As our preliminary experiment, we explore the influence of using passages on the construction of association rules by comparing them with a version that does not use passages. © 2004 Wiley Periodicals, Inc.