Novel filing systems applicable to an automated office: a state-of-the-art study
Information Processing and Management: an International Journal
Implementing agglomerative hierarchic clustering algorithms for use in document retrieval
Information Processing and Management: an International Journal
Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Diversity in the use of electronic mail: a preliminary inquiry
ACM Transactions on Information Systems (TOIS)
Self-organization and associative memory: 3rd edition
Self-organization and associative memory: 3rd edition
Scatter/Gather: a cluster-based approach to browsing large document collections
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance feedback and inference networks
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
An example-based mapping method for text categorization and retrieval
ACM Transactions on Information Systems (TOIS)
Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Context as a factor in personal information management systems
Journal of the American Society for Information Science
A comparison of classifiers and document representations for the routing problem
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Siteseer: personalized navigation for the Web
Communications of the ACM
The effect of accessing nonmatching documents on relevance feedback
ACM Transactions on Information Systems (TOIS)
Hierarchic document classification using Ward's clustering method
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
User-oriented document clustering: a framework for learning in information retrieval
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Self-organizing maps
Fast and effective text mining using linear-time document clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Document clustering for electronic meetings: an experimental comparison of two techniques
Decision Support Systems - From information retrieval to knowledge management: enabling technologies and best practices
Partitioning-based clustering for Web document categorization
Decision Support Systems - Special issue on WITS '97
A semi-supervised document clustering technique for information organization
Proceedings of the ninth international conference on Information and knowledge management
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
An effective document clustering method using user-adaptable distance metrics
Proceedings of the 2002 ACM symposium on Applied computing
Document clustering with committees
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws
Journal of the American Society for Information Science and Technology
HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 5 - Volume 5
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Combining preference- and content-based approaches for improving document clustering effectiveness
Information Processing and Management: an International Journal
Verifying the proximity and size hypothesis for self-organizing maps
Journal of Management Information Systems - Special section: Exploring the outlands of the MIS discipline
Generating and Browsing Multiple Taxonomies Over a Document Collection
Journal of Management Information Systems
A collaborative filtering-based approach to personalized document clustering
Decision Support Systems
Journal of Management Information Systems
Managing Word Mismatch Problems in Information Retrieval: A Topic-Based Query Expansion Approach
Journal of Management Information Systems
Preserving User Preferences in Automated Document-Category Management: An Evolution-Based Approach
Journal of Management Information Systems
Discovering event episodes from news corpora: a temporal-based approach
Proceedings of the 11th International Conference on Electronic Commerce
A knowledge-based model using ontologies for personalized web information gathering
Web Intelligence and Agent Systems
Assessing the severity of phishing attacks: A hybrid data mining approach
Decision Support Systems
Expert Systems with Applications: An International Journal
A Data-Driven Approach to Measure Web Site Navigability
Journal of Management Information Systems
Hi-index | 0.00 |
As electronic commerce and knowledge economy environments proliferate, both individuals and organizations increasingly generate and consume large amounts of online information, typically available as textual documents. To manage this ever-increasing volume of documents, individuals and organizations frequently organize their documents into categories that facilitate document management and subsequent access and browsing. Document clustering is an intentional act that should reflect individual preferences with regard to the semantic coherency and relevant categorization of documents. Hence, effective document clustering must consider individual preferences and needs to support personalization in document categorization. In this paper, we present an automatic document-clustering approach that incorporates an individual's partial clustering as preferential information. Combining two document representation methods, feature refinement and feature weighting, with two clustering methods, precluster-based hierarchical agglomerative clustering (HAC) and atomic-based HAC, we establish four personalized document-clustering techniques. Using a traditional content-based document-clustering technique as a performance benchmark, we find that the proposed personalized document-clustering techniques improve clustering effectiveness, as measured by cluster precision and cluster recall.