Discovering interobserver variability in the cytodiagnosis of breast cancer using decision trees and Bayesian networks

  • Authors:
  • Nicandro Cruz-Ramírez;Héctor-Gabriel Acosta-Mesa;Humberto Carrillo-Calvet;Rocío-Erandi Barrientos-Martínez

  • Affiliations:
  • Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Sebastián Camacho # 5, Col. Centro, C.P. 91000, Xalapa, Veracruz, Mexico;Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Sebastián Camacho # 5, Col. Centro, C.P. 91000, Xalapa, Veracruz, Mexico;Facultad de Ciencias, Universidad Nacional Autónoma de México, Circuito Exterior Ciudad Universitaria, México, D.F., Mexico;Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Sebastián Camacho # 5, Col. Centro, C.P. 91000, Xalapa, Veracruz, Mexico

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We evaluate the performance of two decision tree procedures and four Bayesian network classifiers as potential decision support systems in the cytodiagnosis of breast cancer. In order to test their performance thoroughly, we use two real-world databases containing 692 cases and 322 cases collected by a single observer and 19 observers, respectively. The results show that, in general, there are considerable differences in all tests (accuracy, sensitivity, specificity, PV+, PV- and ROC) when a specific classifier uses the single-observer dataset compared to those when this same classifier uses the multiple-observer dataset. These results suggest that different observers see different things: a problem known as interobserver variability. We graphically unveil such a problem by presenting the structures of the decision trees and Bayesian networks resultant from running both databases.