Biomedical knowledge navigation by literature clustering

  • Authors:
  • Yasunori Yamamoto;Toshihisa Takagi

  • Affiliations:
  • Department of Computational Biology, University of Tokyo, Kibanto CB01, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan;Department of Computational Biology, University of Tokyo, Kibanto CB01, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

There is an urgent need for a system that facilitates surveys by biomedical researchers and the subsequent formulation of hypotheses based on the knowledge stored in literature. One approach is to cluster papers discussing a topic of interest and reveal its sub-topics that allow researchers to acquire an overview of the topic. We developed such a system called McSyBi. It accepts a set of citation data retrieved with PubMed and hierarchically and non-hierarchically clusters them based on the titles and the abstracts using statistical and natural language processing methods. A novel point is that McSyBi allows its users to change the clustering by entering a MeSH term or UMLS Semantic Type, and therefore they can see a set of citation data from multiple aspects. We evaluated McSyBi quantitatively and qualitatively: clustering of 27 sets of citation data (40643 different papers) and scrutiny of several resultant clusters. While non-hierarchical clustering provides us with an overview of the target topic, hierarchical clustering allows us to see more details and relationships among citation data. McSyBi is freely available at http://textlens.hgc.jp/McSyBi/.