Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Web page classification: Features and algorithms
ACM Computing Surveys (CSUR)
Emerging Technologies of Text Mining: Techniques and Applications
Emerging Technologies of Text Mining: Techniques and Applications
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Data mining with an ant colony optimization algorithm
IEEE Transactions on Evolutionary Computation
Hi-index | 0.00 |
Our research deals with classification of Arabic web pages. This field is challenging because limited research has been done in this field so far, and currently available tools do not support Arabic language. The fact remains that Arabic has various complex and discrete characteristics as compared to those of other languages: highly inflectional and derivational, the writing direction, the change of characters shapes based on their location, the absence of capitalization, etc. We have developed an environment that consist of two parts: The learning phase which facilitates the essential preprocessing tasks for Arabic web pages using several methods and tools. The second part classifies a web page by applying the best parameters setups.