Automated Case Generation from Databases Using Similarity-Based Rough Approximation

  • Authors:
  • Liqiang Geng;Christine W. Chan

  • Affiliations:
  • -;-

  • Venue:
  • MICAI '02 Proceedings of the Second Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Knowledge acquisition for a case-based reasoning system from domain experts is a bottleneck in the system development process. With the huge amounts of data that have become available, it would be useful to derive automatically representative cases from available databases rather than acquiring them from domain experts. This paper presents two algorithms using similarity-based rough set theory to derive cases automatically from available databases. The first algorithm SRS1 requires the user to decide the similarity thresholds for the objects in a database, while the second algorithm SRS2 can automatically select proper similarity thresholds. These algorithms require fewer parameters from domain experts than other case generation algorithms. Also they can tackle noise and inconsistent data in the database and select a reasonable number of the representative cases from the database. The experimental results were compared with those from well-known data mining systems, such as rule induction systems and neural network systems.