Using Compression to Identify Acronyms in Text
DCC '00 Proceedings of the Conference on Data Compression
Using SVM to Extract Acronyms from Text
Soft Computing - A Fusion of Foundations, Methodologies and Applications
A term recognition approach to acronym recognition
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters & Demonstrations
Acronym extraction and disambiguation in large-scale organizational web pages
Proceedings of the 18th ACM conference on Information and knowledge management
Seeking Acronym Definitions: a Web-based Approach
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Mining, ranking, and using acronym patterns
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Hi-index | 0.00 |
This paper addresses the problem of extracting acronyms and their definitions from large documents in a setting, when high recall is required and user feedback is available. We propose a three step approach to deal with the problem. First, acronym candidates are extracted using a weak regular expression. This step results in a list of acronyms with high recall but low precision rates. Second, definitions are constructed for every acronym candidate from its surrounding text. And last, a classifier is used to select genuine acronym-definition pairs. At the last step we use relevance feedback mechanism to tune the classifier model for every particular document. This allows achieving reasonable precision without losing recall. As opposed to existing approaches, either created to be generic and domain independent or tuned to one particular domain, our method is adaptive to an input document. We evaluate the proposed approach using three datasets from different domains. The experiments prove the validity of the presented ideas.