Mining Text Data: Special Features and Patterns

  • Authors:
  • Miguel Delgado;Maria J. Martín-Bautista;Daniel Sánchez;María Amparo Vila Miranda

  • Affiliations:
  • -;-;-;-

  • Venue:
  • Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Text mining is an increasingly important research field because of the necessity of obtaining knowledge from the enormous number of text documents available, especially on the Web. Text mining and data mining, both included in the field of information mining, are similar in some sense, and thus it may seem that data mining techniques may be adapted in a straightforward way to mine text. However, data mining deals with structured data, whereas text presents special characteristics and is basically unstructured. In this context, the aims of this paper are three: - To study particular features of text. - To identify the patterns we may look for in text. - To discuss the tools we may use for that purpose.In relation with the third point we overview existing proposals, as well as some new tools we are developing by adapting data mining tools previously developed by our research group.