Subgroup Discovery for Test Selection: A Novel Approach and Its Application to Breast Cancer Diagnosis

  • Authors:
  • Marianne Mueller;Rómer Rosales;Harald Steck;Sriram Krishnan;Bharat Rao;Stefan Kramer

  • Affiliations:
  • Institut für Informatik, Technische Universität München, Garching, Germany 85748;IKM CAD and Knowledge Solutions, Siemens Healthcare, Malvern, USA 19335;IKM CAD and Knowledge Solutions, Siemens Healthcare, Malvern, USA 19335;IKM CAD and Knowledge Solutions, Siemens Healthcare, Malvern, USA 19335;IKM CAD and Knowledge Solutions, Siemens Healthcare, Malvern, USA 19335;Institut für Informatik, Technische Universität München, Garching, Germany 85748

  • Venue:
  • IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a new approach to test selection based on the discovery of subgroups of patients sharing the same optimal test, and present its application to breast cancer diagnosis. Subgroups are defined in terms of background information about the patient. We automatically determine the best t subgroups a patient belongs to, and decide for the test proposed by their majority. We introduce the concept of prediction quality to measure how accurate the test outcome is regarding the disease status. The quality of a subgroup is then the best mean prediction quality of its members (choosing the same test for all). Incorporating the quality computation in the search heuristic enables a significant reduction of the search space. In experiments on breast cancer diagnosis data we showed that it is faster than the baseline algorithm APRIORI-SD while preserving its accuracy.