The PMML path towards true interoperability in data mining

  • Authors:
  • Alex Guazzelli;Tridivesh Jena;Wen-Ching Lin;Michael Zeller

  • Affiliations:
  • Zementis, Inc, San Diego, CA, USA;Zementis, Inc, San Diego, CA, USA;Zementis, Inc, San Diego, CA, USA;Zementis, Inc, San Diego, CA, USA

  • Venue:
  • Proceedings of the 2011 workshop on Predictive markup language modeling
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the de facto standard for data mining models, the Predictive Model Markup Language (PMML) provides tremendous benefits for business, IT, and the data mining industry in general, since it allows for predictive models to be easily moved between applications. Due to the cross-platform and vendor-independent nature of such an open-standard, auto-generated PMML code is often represented in different versions of PMML. A tool may export PMML 2.1 and another import PMML 4.0. This problem raises the issue of conversion. For true interoperability, PMML needs to be easily converted from one version to another. In this paper, we describe the capabilities associated with the "PMML Converter". This application represents a great step in the PMML path towards true interoperability in data mining. Besides converting older versions of PMML to its latest, the PMML converter checks PMML files for syntax issues and, if issues are encountered, automatically corrects them. This paper also describes the capabilities associated with an interactive PMML-based application, the "Transformations Generator". Auto-generated PMML code can omit important data pre-processing steps which are an integral part of a predictive solution. The Transformations Generator aims to bridge this gap by providing a graphical interface for the development and expression of data pre-processing steps in PMML.