PMML and UIMA based frameworks for deploying analytic applications and services

  • Authors:
  • David Ferrucci;Robert L. Grossman;Anthony Levas

  • Affiliations:
  • IBM T. J. Watson Research Center;University of Illinois at Chicago;IBM T. J. Watson Research Center

  • Venue:
  • Proceedings of the 4th international workshop on Data mining standards, services and platforms
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is convenient to divide data into structured data, semi-structured data and unstructured data. By structured data, we mean data that is organized into fields or attributes. Examples include database records. Semi-structured data has attributes but does not have the regularity of structured data. Data defined by HTML or XML tags are examples of semi-structured data. Unstructured data lacks attributes or fields and includes text data, signals, images, video, audio or similar data. Of course, data may be a combination of one or more of these types. For example, the content of a message can be unstructured text and the metadata semi-structured XML tags.