Data preparation for data mining in medical data sets

Authors:
Grzegorz Ilczuk;Alicja Wakulicz-Deja
Affiliations:
Siemens AG Medical Solutions, Erlangen, Germany;Institut of Informatics University of Silesia, Sosnowiec, Poland
Venue:
Transactions on rough sets VI
Year:
2007

Citing 11
Cited 0

Rough classification of patients after highly selective vagotomy for duodenal ulcer

International Journal of Man-Machine Studies
On Kleene algebras and closed semirings

MFCS '90 Proceedings on Mathematical foundations of computer science 1990
Rough sets

Communications of the ACM
A new version of the rule induction system LERS

Fundamenta Informaticae
Data preparation for data mining

Data preparation for data mining
Introduction to the Theory of Computation

Introduction to the Theory of Computation
Knowledge and Uncertainty: A Rough Set Approach

Proceedings of the SOFTEKS Workshop on Incompleteness and Uncertainty in Information Systems
The Application of Support Diagnose in Mitochondrial Encephalomyopathies

TSCTC '02 Proceedings of the Third International Conference on Rough Sets and Current Trends in Computing
Applying rough set theory to multi stage medical diagnosing

Fundamenta Informaticae
Attribute selection and rule generation techniques for medical diagnosis systems

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Rough sets approach to medical diagnosis system

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data preparation is a very important but also a time consuming part of a Data Mining process. In this paper we describe a hierarchical method of text classification based on regular expressions. We use the presented method in our data mining system during a pre-processing stage to transform Latin free-text medical reports into a decision table. Such decision tables are used as an input for rough sets based rule induction subsystem. In this study we also compare accuracy and scalability of our method with a standard approach based on dictionary phrases.