ACM SIGIR Forum
Information retrieval
Viewing morphology as an inference process
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
DARE: Domain analysis and reuse environment
Annals of Software Engineering
Information Processing and Management: an International Journal
Strength and similarity of affix removal stemming algorithms
ACM SIGIR Forum
Software Reuse Research: Status and Future
IEEE Transactions on Software Engineering
Automatic Information Organization and Retrieval.
Automatic Information Organization and Retrieval.
Hi-index | 0.00 |
In this study we used domain engineering as a method for gaining deeper formal understanding of a class of algorithms. Specifically, we analyzed 6 stemming algorithms from 4 different sub-domains of the conflation algorithms domain and developed formal domain models and generators based on these models. The application generator produces source code for not only affix removal but also successor variety, table lookup, and n-gram stemmers. The performance of the generated stemmers was compared with the stemmers developed manually in terms of stem similarity, source, and executable sizes, and development and execution times. Five of the stemmers generated by the application generator produced more than 99.9% identical stems with the manually developed stemmers. Some of the generated stemmers were as efficient as their manual equivalents and some were not.