Word association norms, mutual information, and lexicography
Computational Linguistics
Visualizing science by citation mapping
Journal of the American Society for Information Science
In vitro evaluation of a program for machine-aided indexing
Information Processing and Management: an International Journal
PKDD '98 Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Hierarchical clustering of words
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Learning Domain Ontologies from Document Warehouses and Dedicated Web Sites
Computational Linguistics
Automatic discovery of term similarities using pattern mining
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Conceptual structuring through term variations
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Text mining without document context
Information Processing and Management: an International Journal - Special issue: Informetrics
Phrase clustering without document context
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Hi-index | 0.00 |
This paper presents a three-level structuring of multiword terms basing on lexical inclusion, WordNet similarity and a clustering approach. Term clustering by automatic data analysis methods offers an interesting way of organizing a domain's knowledge structure, useful for several information-oriented tasks like science and technology watch, textmining, computer-assisted ontology population, Question Answering (Q-A). This paper explores how this three-level term structuring brings to light the knowledge structures from a corpus of genomics and compares the mapping of the domain topics against a hand-built ontology (the GENIA ontology). Ways of integrating the results into a Q-A system are discussed.