Restoring Punctuation and Casing in English Text

  • Authors:
  • Timothy Baldwin;Manuel Paul Joseph

  • Affiliations:
  • Department of Computer Science and Software Engineering, University of Melbourne, Australia 3010 and NICTA Victoria Laboratories, University of Melbourne, Australia 3010;Department of Computer Science and Software Engineering, University of Melbourne, Australia 3010

  • Venue:
  • AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper explores the use of machine learning techniques to restore punctuation and case in English text, as part of which it investigates the co-dependence of case information and punctuation. We achieve an overall F-score of .619 for the task using a variety of lexical and contextual features, and iterative retagging.