Automatic expansion of abbreviations by using context and character information
Information Processing and Management: an International Journal
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
HLT '91 Proceedings of the workshop on Speech and Natural Language
Resolving abbreviations to their senses in Medline
Bioinformatics
Journal of Computer Science and Technology
Automatic Expansion of Chinese Abbreviations by Web Mining
AICI '09 Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence
Hi-index | 0.00 |
This paper presents a hybrid approach to Chinese abbreviation expansion. In this study, each short-form in Chinese text is assumed to be created by the method of reduction and the method of elimination or generalization, respectively. A mapping table between short words and long words and a dictionary of non-reduced short-form/full-form pairs are thus applied to generate the respective expansion candidates. Then, a hidden Markov model (HMM) based disambiguation is employed to rank these candidates and select a proper expansion for each ambiguous abbreviation. In order to improve expansion accuracy, some linguistic knowledge like discourse information and abbreviation patterns are further employed to double-check the expanded results and revise some error expansions if any. The proposed approach was evaluated on an abbreviation-expanded corpus built from the Peking University Corpus. The results showed that a recall of 83.8% and a precision of 86.3% can be achieved on average for different types of Chinese abbreviations.