Modelling early language acquisition skills: towards a general statistical learning mechanism

Authors:
Guillaume Aimetti
Affiliations:
University of Sheffield, Sheffield, UK
Venue:
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Year:
2009

Citing 1
Cited 4

Unsupervised Pattern Discovery in Speech

IEEE Transactions on Audio, Speech, and Language Processing

Learning meaningful units from multimodal input: the effect of interaction strategies

Proceedings of the 2nd Workshop on Child, Computer and Interaction
Syntactic pattern recognition from observations: a hybrid technique

ICIC'11 Proceedings of the 7th international conference on Intelligent Computing: bio-inspired computing and applications
Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions

Speech Communication
Bootstrapping a unified model of lexical and phonetic acquisition

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper reports the on-going research of a thesis project investigating a computational model of early language acquisition. The model discovers word-like units from cross-modal input data and builds continuously evolving internal representations within a cognitive model of memory. Current cognitive theories suggest that young infants employ general statistical mechanisms that exploit the statistical regularities within their environment to acquire language skills. The discovery of lexical units is modelled on this behaviour as the system detects repeating patterns from the speech signal and associates them to discrete abstract semantic tags. In its current state, the algorithm is a novel approach for segmenting speech directly from the acoustic signal in an unsupervised manner, therefore liberating it from a pre-defined lexicon. By the end of the project, it is planned to have an architecture that is capable of acquiring language and communicative skills in an online manner, and carry out robust speech recognition. Preliminary results already show that this method is capable of segmenting and building accurate internal representations of important lexical units as 'emergent' properties from cross-modal data.