Design and implementation-algorithms of Amharic search engine system for Amharic web contents

Authors:
Hassen Redwan;Solomon Atnafu
Affiliations:
Department of Computer Science, Addis Ababa University, Ethiopia;Department of Computer Science, Addis Ababa University, Ethiopia
Venue:
NTMS'09 Proceedings of the 3rd international conference on New technologies, mobility and security
Year:
2009

Citing 4
Cited 0

Maintaining distributed hypertext infostructures: welcome to MOMspider's Web

Selected papers of the first conference on World-Wide Web
WebCutter: a system for dynamic and tailorable site mapping

Selected papers from the sixth international conference on World Wide Web
A language and character set determination method based on N-gram statistics

ACM Transactions on Asian Language Information Processing (TALIP)
WebKhoj: Indian language IR from multiple character encodings

Proceedings of the 15th international conference on World Wide Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

On the Web, the use of languages other than English (e.g. Amharic language) has been growing exponentially. The number of web documents in Amharic language as well as Internet users in Ethiopia is growing dramatically. However, the major search engines have been lagging behind in providing indexes, stemming and search features to handle this language. Therefore, the design and implementation of web search engine that considers the typical characteristics of the Amharic language is needed. In this paper, we design Amharic Search Engine system for Amharic language web documents and briefly discuss the algorithms for implementing the engine. The Crawler, Indexer and Query Engine are the basic components of this search engine. Typical characteristics of the Amharic language were considered by testing the engine for morphological variants as well as Amharic aliases support. For experimentation, two runs of the crawler were conducted by using 10 threads that crawl in parallel.