NLTK: the natural language toolkit

  • Authors:
  • Steven Bird

  • Affiliations:
  • University of Melbourne, Victoria, Australia and University of Pennsylvania, Philadelphia, PA

  • Venue:
  • COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Natural Language Toolkit is a suite of program modules, data sets and tutorials supporting research and teaching in computational linguistics and natural language processing. NLTK is written in Python and distributed under the GPL open source license. Over the past year the toolkit has been rewritten, simplifying many linguistic data structures and taking advantage of recent enhancements in the Python language. This paper reports on the simplified toolkit and explains how it is used in teaching NLP.