Applying passage in Web text mining

Authors:
Thanaruk Theeramunkong
Affiliations:
Info. Tech. Prog., Sirindhorn Int. Inst. of Tech., Thammasat Univ., Pathumthani 12121, Thailand and Info. Res. and Dev. Div., Natl. Elec. and Comp. Tech. Ctr. (NECTEC), Rajthevi, Bangkok 10400 Tha ...
Venue:
International Journal of Intelligent Systems - Intelligent Technologies
Year:
2004

Citing 0
Cited 3

Answering form-based web queries using the data-mining approach

Journal of Intelligent Information Systems
Applying latent semantic indexing in frequent itemset mining for document relation discovery

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Tag co-occurrence analysis using the association data mining rule

Proceedings of the 2012 iConference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Textual information on the Web is very huge, varied, and useful. Although traditional text mining treats a text document as a single piece of information, this approach may not be suitable for Web documents that are long and heterogeneous in their contents. This article presents a new approach that applies the concept of a passage to Web text mining. In this approach, a single Web text document is considered as several passages instead of a single text. To investigate the effectiveness of the approach, Thai Web documents taken from the Internet are used. As our preliminary experiment, we explore the influence of using passages on the construction of association rules by comparing them with a version that does not use passages. © 2004 Wiley Periodicals, Inc.