Explorations in Automatic Thesaurus Discovery
Explorations in Automatic Thesaurus Discovery
An endogeneous corpus-based method for structural noun phrase disambiguation
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Expansion of multi-word terms for indexing and retrieval using morphology and syntax
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Statistical models for unsupervised prepositional phrase attachment
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Terminology finite-state preprocessing for computational LFG
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Syntagmatic and paradigmatic representations of term variation
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
EPIA '99 Proceedings of the 9th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Exogeneous and endogeneous approaches to semantic categorization of unknown technical terms
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
The head-modifier principle and multilingual term extraction
Natural Language Engineering
Detecting novel compounds: the role of distributional evidence
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Unsupervised, corpus-based method for extending a biomedical terminology
BioMed '02 Proceedings of the ACL-02 workshop on Natural language processing in the biomedical domain - Volume 3
Mining semantically related terms from biomedical literature
ACM Transactions on Asian Language Information Processing (TALIP)
Effect of word density on measuring words association
COMPUTE '08 Proceedings of the 1st Bangalore Annual Compute Conference
Design and development of a concept-based multi-document summarization system for research abstracts
Journal of Information Science
Aligning medical domain ontologies for clinical query extraction
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
AcroDef: a quality measure for discriminating expansions of ambiguous acronyms
CONTEXT'07 Proceedings of the 6th international and interdisciplinary conference on Modeling and using context
Deriving clinical query patterns from medical corpora using domain ontologies
WBIE '09 Proceedings of the Workshop on Biomedical Information Extraction
Ontology learning for cost-effective large-scale semantic annotation of web service interfaces
EKAW'10 Proceedings of the 17th international conference on Knowledge engineering and management by the masses
The REG summarization system with question reformulation at QA@INEX track 2010
INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Pruning terminology extracted from a specialized corpus for CV ontology acquisition
OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
Comparison of feature-level learning methods for mining online consumer reviews
Expert Systems with Applications: An International Journal
Ontology acquisition from web service descriptions
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Hi-index | 0.00 |
A novel technique for automatic thesaurus construction is proposed. It is based on the complementary use of two tools: (1) a Term Extraction tool that acquires term candidates from tagged corpora through a shallow grammar of noun phrases, and (2) a Term Clustering tool that groups syntactic variants (insertions). Experiments performed on corpora in three technical domains yield clusters of term candidates with precision rates between 93% and 98%.