Associative Naïve Bayes classifier: Automated linking of gene ontology to medline documents

  • Authors:
  • Hyunki Kim;Su-Shing Chen

  • Affiliations:
  • Electronics and Telecommunications Research Institute, Daejeon 305-700, Republic of Korea;CAS-MPG Partner Institute of Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yue Yan Road, Shanghai 200031, China

  • Venue:
  • Pattern Recognition
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

We demonstrate a text-mining method, called associative Naive Bayes (ANB) classifier, for automated linking of MEDLINE documents to gene ontology (GO). The approach of this paper is a nontrivial extension of document classification methodology from a fixed set of classes C={c"1,c"2,...,c"n} to a knowledge hierarchy like GO. Due to the complexity of GO, we use a knowledge representation structure. With that structure, we develop the text mining classifier, called ANB classifier, which automatically links Medline documents to GO. To check the performance, we compare our datasets under several well-known classifiers: NB classifier, large Bayes classifier, support vector machine and ANB classifier. Our results, described in the following, indicate its practical usefulness.