Source models for natural language text
International Journal of Man-Machine Studies
Foundations of statistical natural language processing
Foundations of statistical natural language processing
A 300 MB Turkish Corpus and Word Analysis
ADVIS '02 Proceedings of the Second International Conference on Advances in Information Systems
Word Statistics of Turkish Language on a Large Scale Text Corpus - TurCo
ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
Turkish Word N-gram Analyzing Algorithms for a Large Scale Turkish Corpus - TurCo
ITCC '04 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2 - Volume 2
Extension of Zipf's law to words and phrases
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Hi-index | 0.00 |
Zipf's Law is a common law applied for different kinds of observations. Many investigations were carried out to find the correspondences between Zipf's Law and different languages. This study deals with the correspondence of Turkish with Zipf's Law and finding Mandelbrot constants (c and B ) by using a large scale Turkish corpus (TurCo). In order to determine these constants, coefficient of determination was used, and different c and B values were examined. As both languages show agglutinative characteristics, the most suitable B value was found smaller than 1 for Turkish like Korean, and c value was found as 0.27.