Extraction of protein interaction data: a comparative analysis of methods in use
EURASIP Journal on Bioinformatics and Systems Biology
Finding optimal parameters for edit distance based sequence classification is NP-hard
Proceedings of the KDD-09 Workshop on Statistical and Relational Learning in Bioinformatics
Evolutionary hypernetwork classifiers for protein-proteininteraction sentence filtering
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Identifying interaction sentences from biological literature using automatically extracted patterns
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Towards effective sentence simplification for automatic processing of biomedical text
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
BioProber2.0: a unified biomedical workbench with mining and probing literatures
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
ICIC'07 Proceedings of the intelligent computing 3rd international conference on Advanced intelligent computing theories and applications
Mining the relationship between gene and disease from literature
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
Hi-index | 3.84 |
Motivation: Protein-protein interactions play critical roles in biological processes, and many biologists try to find or to predict crucial information concerning these interactions. Before verifying interactions in biological laboratory work, validating them from previous research is necessary. Although many efforts have been made to create databases that store verified information in a structured form, much interaction information still remains as unstructured text. As the amount of new publications has increased rapidly, a large amount of research has sought to extract interactions from the text automatically. However, there remain various difficulties associated with the process of applying automatically generated results into manually annotated databases. For interactions that are not found in manually stored databases, researchers attempt to search for abstracts or full papers. Results: As a result of a search for two proteins, PubMed frequently returns hundreds of abstracts. In this paper, a method is introduced that validates protein-protein interactions from PubMed abstracts. A query is generated from two given proteins automatically and abstracts are then collected from PubMed. Following this, target proteins and their synonyms are recognized and their interaction information is extracted from the collection. It was found that 67.37% of the interactions from DIP-PPI corpus were found from the PubMed abstracts and 87.37% of interactions were found from the given full texts. Availability: Contact authors. Contact: janghc@etri.re.kr