I will shoot your shopping down and you can shoot all my tins: automatic lexical acquisition from the CHILDES database

  • Authors:
  • Paula Buttery;Anna Korhonen

  • Affiliations:
  • University of Cambridge, Cambridge, UK;University of Cambridge, Cambridge, UK

  • Venue:
  • CACLA '07 Proceedings of the Workshop on Cognitive Aspects of Computational Language Acquisition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Empirical data regarding the syntactic complexity of children's speech is important for theories of language acquisition. Currently much of this data is absent in the annotated versions of the childes database. In this perliminary study, we show that a state-of-the-art subcategorization acquisition system of Preiss et al. (2007) can be used to extract large-scale subcategorization (frequency) information from the (i) child and (ii) child-directed speech within the childes database without any domain-specific tuning. We demonstrate that the acquired information is sufficiently accurate to confirm and extend previously reported research findings. We also report qualitative results which can be used to further improve parsing and lexical acquisition technology for child language data in the future.