A language and character set determination method based on N-gram statistics
ACM Transactions on Asian Language Information Processing (TALIP)
UbiCrawler: a scalable fully distributed web crawler
Software—Practice & Experience
Proceedings of the 15th international conference on World Wide Web
Geographic locations of web servers
Proceedings of the 15th international conference on World Wide Web
Multilingual ICT education: language observatory as a monitoring instrument
SEARCC '05 Proceedings of the 2005 South East Asia Regional Computer Science Confederation (SEARCC) Conference - Volume 46
IEICE - Transactions on Information and Systems
Country domain governance: an analysis by data-mining of country domains
Artificial Life and Robotics
Hi-index | 0.00 |
The first part of the paper provides a brief description of the Language Observatory Project (LOP) and highlights the major technical difficulties to be challenged. The latter part gives how we responded to these difficulties by adopting UbiCrawler as a data collecting engine for the project. An interactive collaboration between the two groups is producing quite satisfactory results.