Predicting breast cancer survivability: a comparison of three data mining methods

Authors:
Dursun Delen;Glenn Walker;Amit Kadam
Affiliations:
Department of Management Science and Information Systems, Oklahoma State University, 700 North Greenwood Venue, Tulsa, OK 74106, USA;Department of Management Science and Information Systems, Oklahoma State University, 700 North Greenwood Venue, Tulsa, OK 74106, USA;Department of Management Science and Information Systems, Oklahoma State University, 700 North Greenwood Venue, Tulsa, OK 74106, USA
Venue:
Artificial Intelligence in Medicine
Year:
2005

Citing 13
Cited 82

Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks

Neural Networks
C4.5: programs for machine learning

C4.5: programs for machine learning
Neural Networks: A Comprehensive Foundation

Neural Networks: A Comprehensive Foundation
Induction of Decision Trees

Machine Learning
Modeling medical prognosis: survival analysis techniques

Computers and Biomedical Research
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Integrating classification trees with local logistic regression in Intensive Care prognosis

Artificial Intelligence in Medicine
An evolutionary artificial neural networks approach for breast cancer diagnosis

Artificial Intelligence in Medicine
Uniqueness of medical data mining

Artificial Intelligence in Medicine
Confidentiality issues for medical data miners

Artificial Intelligence in Medicine
Prediction of postoperative morbidity after lung resection using an artificial neural network ensemble

Artificial Intelligence in Medicine
A combined neural network and decision trees model for prognosis of breast cancer relapse

Artificial Intelligence in Medicine
Data mining for indicators of early mortality in a database of clinical records

Artificial Intelligence in Medicine

A novel hybrid method based on artificial immune recognition system (AIRS) with fuzzy weighted pre-processing for thyroid disease diagnosis

Expert Systems with Applications: An International Journal
A new hybrid method based on fuzzy-artificial immune system and k-nn algorithm for breast cancer diagnosis

Computers in Biology and Medicine
Medical decision support system based on artificial immune recognition immune system (AIRS), fuzzy weighted pre-processing and feature selection

Expert Systems with Applications: An International Journal
Diagnosis of atherosclerosis from carotid artery Doppler signals as a real-world medical application of artificial immune systems

Expert Systems with Applications: An International Journal
Automatic determination of diseases related to lymph system from lymphography data using principles component analysis (PCA), fuzzy weighting pre-processing and ANFIS

Expert Systems with Applications: An International Journal
A novel cognitive interpretation of breast cancer thermography with complementary learning fuzzy neural memory structure
Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease

Expert Systems with Applications: An International Journal
The feasibility of constructing a Predictive Outcome Model for breast cancer using the tools of data mining

Expert Systems with Applications: An International Journal
Predicting Metastasis in Breast Cancer: Comparing a Decision Tree with Domain Experts

Journal of Medical Systems
Medical application of Artificial Immune Recognition System (AIRS): Diagnosis of atherosclerosis from carotid artery Doppler signals

Computers in Biology and Medicine
Overnight features of transcutaneous carbon dioxide measurement as predictors of metabolic status

Artificial Intelligence in Medicine
Identification and validation of predictive factors for glycemic control: neural networks vs. logistic regression

CEA'07 Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications
Predictive factors of glycemic control: a comparison of decision tree and neural networks

ACOS'07 Proceedings of the 6th Conference on WSEAS International Conference on Applied Computer Science - Volume 6
Design of a hybrid system for the diabetes and heart diseases

Expert Systems with Applications: An International Journal
Artificial neural network prediction of clozapine response with combined pharmacogenetic and clinical data

Computer Methods and Programs in Biomedicine
Effect of feature-type in selecting distance measure for an artificial immune system as a pattern recognizer

Digital Signal Processing
Breast cancer survivability via AdaBoost algorithms

HDKM '08 Proceedings of the second Australasian workshop on Health data and knowledge management - Volume 80
Dynamic Bayesian networks as prognostic models for clinical patient management

Journal of Biomedical Informatics
Mining lung cancer patient data to assess healthcare resource utilization

Expert Systems with Applications: An International Journal
A Study on Chronic Obstructive Pulmonary Disease Diagnosis Using Multilayer Neural Networks

Journal of Medical Systems
A Comparison of Four Data Mining Models: Bayes, Neural Network, SVM and Decision Trees in Identifying Syndromes in Coronary Heart Disease

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
A comparative study on thyroid disease diagnosis using neural networks

Expert Systems with Applications: An International Journal
Review: Neural networks and statistical techniques: A review of applications

Expert Systems with Applications: An International Journal
Short Communication: Assessment of exercise stress testing with artificial neural network in determining coronary artery disease and predicting lesion localization

Expert Systems with Applications: An International Journal
Analysis of healthcare coverage: A data mining approach

Expert Systems with Applications: An International Journal
Sequential association rules for forecasting failure patterns of aircrafts in Korean airforce

Expert Systems with Applications: An International Journal
A comparative study on diabetes disease diagnosis using neural networks

Expert Systems with Applications: An International Journal
A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method

Expert Systems with Applications: An International Journal
HYBRID GREY RELATIONAL ARTIFICIAL NEURAL NETWORK AND AUTO REGRESSIVE INTEGRATED MOVING AVERAGE MODEL FOR FORECASTING TIME-SERIES DATA

Applied Artificial Intelligence
Prediction of periventricular leukomalacia. Part II: Selection of hemodynamic features using computational intelligence

Artificial Intelligence in Medicine
When a decision tree learner has plenty of time

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Toward breast cancer survivability prediction models through improving training space

Expert Systems with Applications: An International Journal
Interactive survival analysis with the OCDM system: From development to application

Information Systems Frontiers
Predicting the outcome of patients with subarachnoid hemorrhage using machine learning techniques

IEEE Transactions on Information Technology in Biomedicine - Special section on computational intelligence in medical systems
Impact of censoring on learning Bayesian networks in survival modelling

Artificial Intelligence in Medicine
A survey of prediction models for breast cancer survivability

Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
Comparing classification techniques for predicting essential hypertension

Expert Systems with Applications: An International Journal
Automated trend analysis of proteomics data using an intelligent data mining architecture

Expert Systems with Applications: An International Journal
Improving Bayesian credibility intervals for classifier error rates using maximum entropy empirical priors

Artificial Intelligence in Medicine
Learning Bayesian networks from survival data using weighting censored instances

Journal of Biomedical Informatics
Chest diseases diagnosis using artificial neural networks

Expert Systems with Applications: An International Journal
Hybrid prediction model for Type-2 diabetic patients

Expert Systems with Applications: An International Journal
A comparative analysis of machine learning techniques for student retention management

Decision Support Systems
Effective framework for prediction of disease outcome using medical datasets: clustering and classification

International Journal of Computational Intelligence Studies
A hybrid prediction model with F-score feature selection for type II Diabetes databases

Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India
Neural Network Classifier with Entropy Based Feature Selection on Breast Cancer Diagnosis

Journal of Medical Systems
Exploring comprehensible classification rules from trained neural networks integrated with a time-varying binary particle swarm optimizer

Engineering Applications of Artificial Intelligence
Comparing the performance of data mining techniques for oral cancer prediction

Proceedings of the 2011 International Conference on Communication, Computing & Security
Developing treatment plan support in outpatient health care delivery with decision trees technique

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Cost-sensitive case-based reasoning using a genetic algorithm: Application to medical diagnosis

Artificial Intelligence in Medicine
A novel distance measure for data vectors with nominal feature values

ECS'10/ECCTD'10/ECCOM'10/ECCS'10 Proceedings of the European conference of systems, and European conference of circuits technology and devices, and European conference of communications, and European conference on Computer science
Combining a new data classification technique and regression analysis to predict the Cost-To-Serve new customers

Computers and Industrial Engineering
A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets

Artificial Intelligence in Medicine
Tuberculosis Disease Diagnosis Using Artificial Neural Network Trained with Genetic Algorithm

Journal of Medical Systems
A Study on Hepatitis Disease Diagnosis Using Multilayer Neural Network with Levenberg Marquardt Training Algorithm

Journal of Medical Systems
A lung cancer outcome calculator using ensemble data mining on SEER data

Proceedings of the Tenth International Workshop on Data Mining in Bioinformatics
Classification of pulmonary nodules using neural network ensemble

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
A New Approach: Role of Data Mining in Prediction of Survival of Burn Patients

Journal of Medical Systems
A Software Tool for Determination of Breast Cancer Treatment Methods Using Data Mining Approach

Journal of Medical Systems
A comparative study on support vector machine and constructive RBF neural network for prediction of success of dental implants

CIARP'05 Proceedings of the 10th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis and Applications
A comparative study on machine learning techniques for prediction of success of dental implants

MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
An approach based on probabilistic neural network for diagnosis of Mesothelioma's disease

Computers and Electrical Engineering
Uncensoring censored data for machine learning: A likelihood-based approach

Expert Systems with Applications: An International Journal
A data pre-processing method to increase efficiency and accuracy in data mining

AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
The medical applications of attribute weighted artificial immune system (AWAIS): diagnosis of heart and diabetes diseases

ICARIS'05 Proceedings of the 4th international conference on Artificial Immune Systems
Machine learning for improved pathological staging of prostate cancer: A performance comparison on a range of classifiers

Artificial Intelligence in Medicine
Intelligent DSS for talent management: a proposed architecture using knowledge discovery approach

Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
Diagnosing Breast Masses in Digital Mammography Using Feature Selection and Ensemble Methods

Journal of Medical Systems
Efficient classifiers for multi-class classification problems

Decision Support Systems
Breast Alert: An On-line Tool for Predicting the Lifetime Risk of Women Breast Cancer

Journal of Medical Systems
A Study on Hepatitis Disease Diagnosis Using Probabilistic Neural Network

Journal of Medical Systems
An expert system for optimising thyroid disease diagnosis

International Journal of Computational Science and Engineering
Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA)

Computer Methods and Programs in Biomedicine
Predicting syndrome by NEI specifications: a comparison of five data mining algorithms in coronary heart disease

LSMS'07 Proceedings of the 2007 international conference on Life System Modeling and Simulation
wFDT: weighted fuzzy decision trees for prognosis of breast cancer survivability

AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Mammographical mass detection and classification using Local Seed Region Growing-Spherical Wavelet Transform (LSRG-SWT) hybrid scheme

Computers in Biology and Medicine
Robust predictive model for evaluating breast cancer survivability

Engineering Applications of Artificial Intelligence
A hybrid intelligent system for medical data classification

Expert Systems with Applications: An International Journal
Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings

Neurocomputing
Review: Knowledge discovery in medicine: Current issue and future trend

Expert Systems with Applications: An International Journal
Lung cancer survival prediction using ensemble data mining on SEER data

Scientific Programming - Biological Knowledge Discovery and Data Mining
On sampling strategies for small and continuous data with the modeling of genetic programming and adaptive neuro-fuzzy inference system

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - FUZZYSS'2011: 2nd International Fuzzy Systems Symposium

Quantified Score

Hi-index	0.03

Visualization

Abstract

Objective:: The prediction of breast cancer survivability has been a challenging research problem for many researchers. Since the early dates of the related research, much advancement has been recorded in several related fields. For instance, thanks to innovative biomedical technologies, better explanatory prognostic factors are being measured and recorded; thanks to low cost computer hardware and software technologies, high volume better quality data is being collected and stored automatically; and finally thanks to better analytical methods, those voluminous data is being processed effectively and efficiently. Therefore, the main objective of this manuscript is to report on a research project where we took advantage of those available technological advancements to develop prediction models for breast cancer survivability. Methods and material:: We used two popular data mining algorithms (artificial neural networks and decision trees) along with a most commonly used statistical method (logistic regression) to develop the prediction models using a large dataset (more than 200,000 cases). We also used 10-fold cross-validation methods to measure the unbiased estimate of the three prediction models for performance comparison purposes. Results:: The results indicated that the decision tree (C5) is the best predictor with 93.6% accuracy on the holdout sample (this prediction accuracy is better than any reported in the literature), artificial neural networks came out to be the second with 91.2% accuracy and the logistic regression models came out to be the worst of the three with 89.2% accuracy. Conclusion:: The comparative study of multiple prediction models for breast cancer survivability using a large dataset along with a 10-fold cross-validation provided us with an insight into the relative prediction ability of different data mining methods. Using sensitivity analysis on neural network models provided us with the prioritized importance of the prognostic factors used in the study.