Using SVM to Extract Acronyms from Text

  • Authors:
  • Jun Xu;Yalou Huang

  • Affiliations:
  • College of Software, Nankai University, No. 94 Weijin Road, 300071, Tianjin, China;College of Software, Nankai University, No. 94 Weijin Road, 300071, Tianjin, China

  • Venue:
  • Soft Computing - A Fusion of Foundations, Methodologies and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper addresses the problem of extracting acronyms and their expansions from text. We propose a support vector machines (SVM) based approach to deal with the problem. First, all likely acronyms are identified using heuristic rules. Second, expansion candidates are generated from surrounding text of acronyms. Last, SVM model is employed to select the genuine expansions. Analysis shows that the proposed approach has the advantages of saving over the conventional rule based approaches. Experimental results show that our approach outperforms the baseline method of using rules. We also show that the trained SVM model is generic and can adapt to other domains easily.