IEEE Transactions on Pattern Analysis and Machine Intelligence
Explorations in Automatic Thesaurus Discovery
Explorations in Automatic Thesaurus Discovery
Supervised Learning of Term Similarities
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Term Clustering Using a Corpus-Based Similarity Measure
TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Identifying terms by their family and friends
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
A methodology for automatic term recognition
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Surface grammatical analysis for the extraction of terminological noun phrases
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Hierarchical clustering of words
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
A methodology for terminology-based knowledge acquisition and integration
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Terminology-driven mining of biomedical literature
Proceedings of the 2003 ACM symposium on Applied computing
Using automatically learnt verb selectional preferences for classification of biomedical terms
Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Using name-internal and contextual features to classify biological terms
Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
On the Identification of Goals in Stakeholders' Dialogs
Innovations for Requirement Analysis. From Stakeholders' Needs to Formal Designs
A symbolic approach to automatic multiword term structuring
Computer Speech and Language
Journal of Biomedical Informatics
Natural language processing: mature enough for requirements documents analysis?
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Hi-index | 0.00 |
Term recognition and clustering are key topics in automatic knowledge acquisition and text mining. In this paper we present a novel approach to the automatic discovery of term similarities, which serves as a basis for both classification and clustering of domain-specific concepts represented by terms. The method is based on automatic extraction of significant patterns in which terms tend to appear. The approach is domain independent: it needs no manual description of domain-specific features and it is based on knowledge-poor processing of specific term features. However, automatically collected patterns are domain specific and identify significant contexts in which terms are used. Beside features that represent contextual patterns, we use lexical and functional similarities between terms to define a combined similarity measure. The approach has been tested and evaluated in the domain of molecular biology, and preliminary results are presented.