Maintaining distributed hypertext infostructures: welcome to MOMspider's Web
Selected papers of the first conference on World-Wide Web
WebCutter: a system for dynamic and tailorable site mapping
Selected papers from the sixth international conference on World Wide Web
A language and character set determination method based on N-gram statistics
ACM Transactions on Asian Language Information Processing (TALIP)
WebKhoj: Indian language IR from multiple character encodings
Proceedings of the 15th international conference on World Wide Web
Hi-index | 0.00 |
On the Web, the use of languages other than English (e.g. Amharic language) has been growing exponentially. The number of web documents in Amharic language as well as Internet users in Ethiopia is growing dramatically. However, the major search engines have been lagging behind in providing indexes, stemming and search features to handle this language. Therefore, the design and implementation of web search engine that considers the typical characteristics of the Amharic language is needed. In this paper, we design Amharic Search Engine system for Amharic language web documents and briefly discuss the algorithms for implementing the engine. The Crawler, Indexer and Query Engine are the basic components of this search engine. Typical characteristics of the Amharic language were considered by testing the engine for morphological variants as well as Amharic aliases support. For experimentation, two runs of the crawler were conducted by using 10 threads that crawl in parallel.