Efficient Data Mining from Large Text Databases

  • Authors:
  • Hiroki Arimura;Hiroshi Sakamoto;Setsuo Arikawa

  • Affiliations:
  • -;-;-

  • Venue:
  • Progress in Discovery Science, Final Report of the Japanese Discovery Science Project
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we consider the problem of discovering a simple class of combinatorial patterns from a large collection of unstructured text data. As a framework of data mining, we adopted optimized pattern discovery in which a mining algorithm discovers the best patterns that optimize a given statistical measure within a class of hypothesis patterns on a given data set. We present effcient algorithms for the classes of proximity word association patterns and report the experiments on the keyword discovery from Web data.