Exploiting Ontology Structure and Patterns of Annotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms

  • Authors:
  • Woei-Jyh Lee;Louiqa Raschid;Hassan Sayyadi;Padmini Srinivasan

  • Affiliations:
  • University of Maryland, College Park, USA MD 20742;University of Maryland, College Park, USA MD 20742;University of Maryland, College Park, USA MD 20742;The University of Iowa, Iowa City, USA IA 52242

  • Venue:
  • DILS '08 Proceedings of the 5th international workshop on Data Integration in the Life Sciences
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

There is significant knowledge captured through annotations on the life sciences Web. In past research, we developed a methodology of supportand confidencemetrics from association rule mining, to mine the association bridge(of termlinks) between pairs of controlled vocabulary (CV) terms across two ontologies. Our (naive) approach did not exploit the following: implicit knowledge captured via the hierarchical is-astructure of ontologies, and patterns of annotation in datasets that may impact the distribution of parent/child or sibling CV terms. In this research, we consider this knowledge. We aggregate termlinksover the siblings of a parent CV term and use them as additional evidence to boost supportand confidencescores in the associations of the parent CV term. A weight factor (茂戮驴) reflects the contribution from the child CV terms; its value can be varied to reflect a variance of confidence values among the sibling CV terms of some parent CV term. We illustrate the benefits of exploiting this knowledge through experimental evaluation.