Review:

  • Authors:
  • Sumana Sharma;Kweku-muata Osei-bryson

  • Affiliations:
  • Department of information systems, the information systems research institute, virginia commonwealth university, richmond, va 23284, usa;Department of information systems, the information systems research institute, virginia commonwealth university, richmond, va 23284, usa

  • Venue:
  • The Knowledge Engineering Review
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The knowledge discovery and data mining (KDDM) process models describe the various phases (e.g. business understanding, data understanding, data preparation, modeling, evaluation and deployment) of the KDDM process. They act as a roadmap for implementation of the KDDM process by presenting a list of tasks for executing the various phases. The checklist approach of describing the tasks is not adequately supported by appropriate tools, which specify ‘how’ the particular task can be implemented. This may result in tasks not being implemented. Another disadvantage is that the long checklist does not capture or leverage the dependencies that exist among the various tasks of the same and different phases. This not only makes the process cumbersome to implement, but also hinders possibilities for semi-automation of certain tasks. Given that each task in the process model serves an important goal and even affects the execution of related tasks due to the dependencies, these limitations are likely to negatively affect the efficiency and effectiveness of KDDM projects. This paper proposes an improved KDDM process model that overcomes these shortcomings by prescribing tools for supporting each task as well as identifying and leveraging dependencies among tasks for semi-automation of tasks, wherever possible.