Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Information retrieval
The process of knowledge discovery in databases
Advances in knowledge discovery and data mining
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
Data Mining Techniques: For Marketing, Sales, and Customer Support
Data Mining Techniques: For Marketing, Sales, and Customer Support
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
On Clustering Validation Techniques
Journal of Intelligent Information Systems
Machine Learning
Model-Based Hierarchical Clustering
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Taxonomies by the numbers: building high-performance taxonomies
Proceedings of the 14th ACM international conference on Information and knowledge management
Multi-taxonomy: Determining Perceived Brand Characteristics from Web Data
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Enabling analysts in managed services for CRM analytics
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
COBRA --- A Visualization Solution to Monitor and Analyze Consumer Generated Medias
Proceedings of the Symposium on Human Interface 2009 on Human Interface and the Management of Information. Information and Interaction. Part II: Held as part of HCI International 2009
Business insights workbench: an interactive insights discovery solution
Proceedings of the 2007 conference on Human interface: Part II
Expert Systems with Applications: An International Journal
A smarter process for sensing the information space
IBM Journal of Research and Development
Understanding honest feedbacks and opinions in academic environments
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
BISON: providing business information analysis as a service
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Hi-index | 0.00 |
Taxonomies are meaningful hierarchical categorizations of documents into topics reflecting the natural relationships between the documents and their business objectives. Improving the quality of these taxonomies and reducing the overall cost required to create them is an important area of research. Supervised and unsupervised text clustering are important technologies that comprise only a part of a complete solution. However, there exists a great need for the ability for a human to efficiently interact with a taxonomy during the editing and validation phase. We have developed a comprehensive approach to solving this problem, and implemented this approach in a software tool called eClassifier. eClassifier provides features to help the taxonomy editor understand and evaluate each category of a taxonomy and visualize the relationships between the categories. Multiple techniques allow the user to make changes at both the category and document level. Metrics then establish how well the resultant taxonomy can be modeled for future document classification. In this paper, we present a comprehensive set of viewing, editing and validation techniques we have implemented in the Lotus Discovery Server resulting in a significant reduction in the time required to create a quality taxonomy.