Analysis of Breast Cancer Using Data Mining and Statistical Techniques

  • Authors:
  • Xiangchun Xiong;Yangon Kim;Yuncheol Baek;Dae Wong Rhee;Soo-Hong Kim

  • Affiliations:
  • Towson University;Towson University;Sangmyung University;Sangmyung University;Sangmyung University

  • Venue:
  • SNPD-SAWN '05 Proceedings of the Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing and First ACIS International Workshop on Self-Assembling Wireless Networks
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

Data Mining & Statistics Analysis is the search for valuable information in large volumes of data. It is now widely used in health care industry. Especially breast cancer is the second most cause of cancer and the second most dangerous cancer. The best way to improve a breast cancer victimýs chance of long-term survival is to detect it as early as possible. Currently there are three methods to diagnose breast cancer: mammography, FNA (fine needle aspirate) and surgical biopsy. The diagnose accuracy of mammography is from 68% to 79%, the accuracy of FNA is inconsistent with varying from 65% to 98%, the accuracy of a surgical biopsy is nearly 100%. The procedure of a surgical biopsy, however, is both unpleasant and costly. In this paper, we use a FNA with a data mining & statistics method to get an easy way to achieve a best result We combine some statistical methods such as PCA, PLS linear regression analysis with data mining methods such as select attribute, decision trees and association rules to find the unsuspected relationships. In addition, the experimental results are shown and discussed.