Mining cell cycle literature using support vector machines

  • Authors:
  • Theodoros G. Soldatos;Georgios A. Pavlopoulos

  • Affiliations:
  • Life Biosystems GmbH, Heidelberg, Germany;ESAT-SCD / IBBT-K.U.Leuven Future Health Department, Katholieke Universiteit Leuven, Leuven, Belgium

  • Venue:
  • SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

While biomedical literature is rapidly increasing, text classification remains a challenge for researchers, curators and librarians. In the context of this work, we use the Caipirini (http://caipirini.org) service to report on the exploration of a literature corpus related to the G1, S, G2 and M phases of the human cell cycle respectively. We use Support Vector Machines (SVMs) and a well-studied dataset to compare each of the cell cycle phases against all others in order to find abstracts that are related to one specific phase at a time. Finally we measure the performance of the results using the standard accuracy, precision and recall metrics. We find differences between the results of each of the four phases and we compare with previous findings of relevant work. We conclude that the results concur and help interpreting the observed classification performance.