Discovering temporal bisociations for linking concepts over time
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
PIMiner: a web tool for extraction of protein interactions from biomedical literature
International Journal of Data Mining and Bioinformatics
Hi-index | 0.00 |
The novel connection between Raynaud disease and fish oils was uncovered from two disjointed biomedical literature sets by Swanson in 1986. Since then, there have been many approaches to uncover novel connections by mining the biomedical literature. One of the popular approaches is to adapt the association rule (AR) method to automatically identify implicit novel connections between concept A and concept C from two disjointed sets of documents through intermediate B concept. Since A and C concepts do not occur together in the same data set, the mining goal is to find novel connection among A and C concepts in the disjoint data sets. It first applies association rule to the two disjointed biomedical literature sets separately to generate two rule sets (A→B, B→C), and then applies transitive law to get the novel connections A→C. However, this approach generates a huge number of possible connections among the millions of biomedical concepts and a lot of these hypothetical connections are spurious, useless, and-or biologically meaningless. Thus it is essential to develop new approach to generate highly likely novel and biologically relevant connections among the biomedical concepts. This paper presents a biomedical semantic-based association rule system (Bio-SARS) that significantly reduce spurious-useless-biologically irrelevant connections through semantic filtering. Compared to other approaches such as latent semantic indexing and traditional association rule-based approach, our approach generates much fewer rules and a lot of these rules represent relevant connections among biological concepts. © 2009 Wiley Periodicals, Inc.