Building Minority Language Corpora by Learning to Generate Web Search Queries
Knowledge and Information Systems
An ontology for accessing transcription systems (OATS)
AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
An ontology for accessing transcription systems
Language Resources and Evaluation
Hi-index | 0.00 |
Some languages' orthographic properties allow written data to be used for phonological research. This paper reports on an on-going project that uses a web-derived text corpus to study the phonology of Tagalog, a language for which large corpora are not otherwise available. Novel findings concerning the phenomenon of intervocalic tapping are discussed in detail, and an overview of other phonological phenomena in the language that can be investigated through written data is given.