Incremental clustering for dynamic information processing

Authors:
Fazli Can
Affiliations:
Miami Univ., Oxford, OH
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
1993

Citing 21
Cited 38

Access methods for text

ACM Computing Surveys (CSUR) - Annals of discrete mathematics, 24
The effectiveness and efficiency of agglomerative hierarchic clustering in document retrieval

The effectiveness and efficiency of agglomerative hierarchic clustering in document retrieval
A dynamic cluster maintenance system for information retrieval

SIGIR '87 Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval techniques

Annual review of information science and technology, vol. 22
Algorithms for clustering data

Algorithms for clustering data
Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Recent trends in hierarchic document clustering: a critical review

Information Processing and Management: an International Journal
Comparison of hierarchic agglomerative clustering methods for document retrieval

The Computer Journal
Dynamic cluster maintenance

Information Processing and Management: an International Journal
Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Hypertext and hypermedia

Hypertext and hypermedia
Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases

ACM Transactions on Database Systems (TODS)
The efficiency of inverted index and cluster searches

Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Optimization of inverted vector searches

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Adaptive document clustering

SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Generation and search of clustered files

ACM Transactions on Database Systems (TODS)
Approximating block accesses in database organizations

Communications of the ACM
The best-match problem in document retrieval

Communications of the ACM
Information Retrieval

Information Retrieval
Information Retrieval: Computational and Theoretical Aspects

Information Retrieval: Computational and Theoretical Aspects
Dynamic information and library processing

Dynamic information and library processing

Development of a modern OPAC: from REVTOLC to MARIAN

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
HypIR: a hypertext-based approach to information retrieval

SAC '93 Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: states of the art and practice
Text to hypertext: can clustering solve the problem in digital libraries?

Proceedings of the first ACM international conference on Digital libraries
Incremental clustering and dynamic information retrieval

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Discovering similar resources by content part-linking

CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Static and dynamic information organization with star clusters

Proceedings of the seventh international conference on Information and knowledge management
Efficient algorithms for geometric optimization

ACM Computing Surveys (CSUR)
A practical clustering algorithm for static and dynamic information organization

Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Data clustering: a review

ACM Computing Surveys (CSUR)
Using star clusters for filtering

Proceedings of the ninth international conference on Information and knowledge management
An On-Line Document Clustering Method Based on Forgetting Factors

ECDL '01 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
On the quality of ART1 text clustering

Neural Networks - 2003 Special issue: Advances in neural networks research — IJCNN'03
Improvement of Precision and Recall for Information Retrieval in a Narrow Domain: Reuse of Concepts by Formal Concept Analysis

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Efficiency and effectiveness of query processing in cluster-based retrieval

Information Systems
Clustering in Dynamic Spatial Databases

Journal of Intelligent Information Systems
Unsupervised clustering on dynamic databases

Pattern Recognition Letters
Autonomous authoring tools for hypertext

ACM Computing Surveys (CSUR)
Towards a machine learning approach based on incremental concept formation

Intelligent Data Analysis
An investigation into the stability of contextual document clustering

Journal of the American Society for Information Science and Technology
Incremental cluster-based retrieval using compressed cluster-skipping inverted files

ACM Transactions on Information Systems (TOIS)
Ontology construction and concept reuse with formal concept analysis for improved web document retrieval

Web Intelligence and Agent Systems
Incremental clustering of mixed data based on distance hierarchy

Expert Systems with Applications: An International Journal
Bilkent news portal: a personalizable system with new event detection and tracking capabilities

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Online Outlier Detection Based on Relative Neighbourhood Dissimilarity

WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
Aggregated cross-media news visualization and personalization

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Cover Coefficient-Based Multi-document Summarization

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Efficient parallel Text Retrieval techniques on Bulk Synchronous Parallel (BSP)/Coarse Grained Multicomputers (CGM)

The Journal of Supercomputing
Enhancing an Incremental Clustering Algorithm for Web Page Collections

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
A text mining approach for automatic construction of hypertexts

Expert Systems with Applications: An International Journal
New event detection and topic tracking in Turkish

Journal of the American Society for Information Science and Technology
RSMAT: Robust simultaneous modeling and tracking

Pattern Recognition Letters
XML data clustering: An overview

ACM Computing Surveys (CSUR)
A flexible architecture integrating monitoring and analytics for managing large-scale data centers

Proceedings of the 8th ACM international conference on Autonomic computing
Activity knowledge transfer in smart environments

Pervasive and Mobile Computing
Approximate kernel k-means: solution to large scale kernel clustering

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Modified adaptive resonance theory network for mixed data based on distance hierarchy

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part IV
BayesTH-MCRDR algorithm for automatic classification of web document

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
When Is the Right Time to Refresh Knowledge Discovered from Data?

Operations Research

Quantified Score

Hi-index	0.03

Visualization

Abstract

Clustering of very large document databases is useful for both searching and browsing. The periodic updating of clusters is required due to the dynamic nature of databases. An algorithm for incremental clustering is introduced. The complexity and cost analysis of the algorithm together with an investigation of its expected behavior are presented. Through empirical testing it is shown that the algorithm achieves cost effectiveness and generates statistically valid clusters that are compatible with those of reclustering. The experimental evidence shows that the algorithm creates an effective and efficient retrieval environment.