Object-oriented application frameworks
Communications of the ACM
Frameworks = (components + patterns)
Communications of the ACM
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The Art of Software Architecture: Design Methods and Techniques
The Art of Software Architecture: Design Methods and Techniques
Developing Reusable and Robust Language Processing Components for Information Systems using GATE
DEXA '02 Proceedings of the 13th International Workshop on Database and Expert Systems Applications
Natural Language Engineering
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Integrated information management: an interactive, extensible architecture for information retrieval
HLT '01 Proceedings of the first international conference on Human language technology research
A Decomposition Scheme Based on Error-Correcting Output Codes for Ensembles of Text Categorisers
ICITA '05 Proceedings of the Third International Conference on Information Technology and Applications (ICITA'05) Volume 2 - Volume 02
Intrusion detection in web applications using text mining
Engineering Applications of Artificial Intelligence
Automatic text classification to support systematic reviews in medicine
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Information systems are using an increasing amount of unstructured information in the form of text. This situation has spawned a need to improve the text-mining technologies needed for information retrieval, filtering, and classification. This article compares some of the options available and how they can provide textual data-mining functionalities to software applications. In particular, the authors focus on Pimiento, a new object-oriented application framework for text mining. This framework allows developers to easily create distributed applications that use machine learning and statistical techniques to automatically process documents.