Automatic discovery of term similarities using pattern mining

Authors:
Goran Nenadić;Irena Spasić;Sophia Ananiadou
Affiliations:
University of Salford, Salford, UK;University of Salford, Salford, UK;University of Salford, Salford, UK
Venue:
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Year:
2002

Citing 10
Cited 7

Similarity Measures

IEEE Transactions on Pattern Analysis and Machine Intelligence
Explorations in Automatic Thesaurus Discovery

Explorations in Automatic Thesaurus Discovery
Supervised Learning of Term Similarities

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Term Clustering Using a Corpus-Based Similarity Measure

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Identifying terms by their family and friends

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
A methodology for automatic term recognition

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Surface grammatical analysis for the extraction of terminological noun phrases

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Hierarchical clustering of words

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
A methodology for terminology-based knowledge acquisition and integration

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1

Terminology-driven mining of biomedical literature

Proceedings of the 2003 ACM symposium on Applied computing
Using automatically learnt verb selectional preferences for classification of biomedical terms

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Using name-internal and contextual features to classify biological terms

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
On the Identification of Goals in Stakeholders' Dialogs

Innovations for Requirement Analysis. From Stakeholders' Needs to Formal Designs
A symbolic approach to automatic multiword term structuring

Computer Speech and Language
Methodological Review: Natural Language Processing methods and systems for biomedical ontology learning

Journal of Biomedical Informatics
Natural language processing: mature enough for requirements documents analysis?

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Term recognition and clustering are key topics in automatic knowledge acquisition and text mining. In this paper we present a novel approach to the automatic discovery of term similarities, which serves as a basis for both classification and clustering of domain-specific concepts represented by terms. The method is based on automatic extraction of significant patterns in which terms tend to appear. The approach is domain independent: it needs no manual description of domain-specific features and it is based on knowledge-poor processing of specific term features. However, automatically collected patterns are domain specific and identify significant contexts in which terms are used. Beside features that represent contextual patterns, we use lexical and functional similarities between terms to define a combined similarity measure. The approach has been tested and evaluated in the domain of molecular biology, and preliminary results are presented.