GraSp: grammar learning from unlabelled speech corpora

Authors:
Peter Juel Henrichsen
Affiliations:
Center for Computational Modelling of Language, Frederiksberg, Denmark
Venue:
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Year:
2002

Citing 3
Cited 1

The calculus of constructions

Information and Computation - Semantics of Data Types
Type-theoretical grammar

Type-theoretical grammar
Structural Equations in Language Learning

LACL '01 Proceedings of the 4th International Conference on Logical Aspects of Computational Linguistics

Limitations of current grammar induction algorithms

ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the ongoing project Computational Models of First Language Acquisition, together with its current product, the learning algorithm GraSp. GraSp is designed specifically for inducing grammars from large, unlabelled corpora of spontaneous (i.e. unscripted) speech. The learning algorithm does not assume a predefined grammatical taxonomy; rather the determination of categories and their relations is considered as part of the learning task. While GraSp learning can be used for a range of practical tasks, the long-term goal of the project is to contribute to the debate of innate linguistic knowledge - under the hypothesis that there is no such.