Inductive Logic Programming: Techniques and Applications
Inductive Logic Programming: Techniques and Applications
Information Retrieval Meets Gene Analysis
IEEE Intelligent Systems
The role of domain information in Word Sense Disambiguation
Natural Language Engineering
Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data
Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data
A General Similarity Framework for Horn Clause Logic
Fundamenta Informaticae
IEEE Transactions on Information Theory
Hi-index | 0.00 |
We present a suite of Machine Learning and knowledge-based components for textual-profile based gene prioritization. Most genetic diseases are characterized by many potential candidate genes that can cause the disease. Gene expression analysis typically produces a large number of co-expressed genes that could be potentially responsible for a given disease. Extracting prior knowledge from text-based genomic information sources is essential in order to reduce the list of potential candidate genes to be then further analyzed in laboratory. In this paper we present a suite of Machine Learning algorithms and knowledge-based components for improving the computational gene prioritization process. The suite includes basic Natural Language Processing capabilities, advanced text classification and clustering algorithms, robust information extraction components based on qualitative and quantitative keyword extraction methods and exploitation of lexical knowledge bases for semantic text processing.