Elements of information theory
Elements of information theory
Linux network administrator's guide
Linux network administrator's guide
Special Edition Using Windows NT 4.O
Special Edition Using Windows NT 4.O
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Retrieving collocations from text: Xtract
Computational Linguistics - Special issue on using large corpora: I
Mostly-unsupervised statistical segmentation of Japanese: applications to kanji
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Termight: identifying and translating technical terminology
ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Chinese word segmentation without using lexicon and hand-crafted training data
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Identifying terms by their family and friends
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Towards automatic extraction of monolingual and bilingual terminology
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Surface grammatical analysis for the extraction of terminological noun phrases
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
IRank: A Term-Based Innovation Ranking System for Conferences and Scholars
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Towards the web of concepts: extracting concepts from large datasets
Proceedings of the VLDB Endowment
A comprehensive dictionary of multiword expressions
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A prediction model for web search hit counts using word frequencies
Journal of Information Science
Dual filtering strategy for chinese term extraction
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Data & Knowledge Engineering
Research on automatic acquisition method of Chinese domain ontology backbone based on Hownet
International Journal of Wireless and Mobile Computing
Hi-index | 0.00 |
Term extraction is an important problem in natural language processing. In this paper, we propose a language independent statistical corpus-based term extraction algorithm. In previous approaches, evaluation has been subjective, at best relying on a lexicographer's judgement. We evaluate the quality of our term extractor by assessing its predictiveness on an unseen corpus using perplexity. Second, we evaluate the precision and recall of our extractor by comparing the Chinese words in a segmented corpus with the words extracted by our system.