Applying biclustering to text mining: an immune-inspired approach

Authors:
Pablo A. D. de Castro;Fabrício O. de França;Hamilton M. Ferreira;Fernando J. Von Zuben
Affiliations:
Laboratory of Bioinformatics and Bio-Inspired Computing, School of Electrical and Computer Engineer, University of Campinas, Campinas, SP, Brazil;Laboratory of Bioinformatics and Bio-Inspired Computing, School of Electrical and Computer Engineer, University of Campinas, Campinas, SP, Brazil;Laboratory of Bioinformatics and Bio-Inspired Computing, School of Electrical and Computer Engineer, University of Campinas, Campinas, SP, Brazil;Laboratory of Bioinformatics and Bio-Inspired Computing, School of Electrical and Computer Engineer, University of Campinas, Campinas, SP, Brazil
Venue:
ICARIS'07 Proceedings of the 6th international conference on Artificial immune systems
Year:
2007

Citing 7
Cited 7

Using collaborative filtering to weave an information tapestry

Communications of the ACM - Special issue on information filtering
Automatic subspace clustering of high dimensional data for data mining applications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Co-clustering documents and words using bipartite spectral graph partitioning

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering by pattern similarity in large data sets

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Biclustering of Expression Data

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Interrelated Two-way Clustering: An Unsupervised Approach for Gene Expression Data Analysis

BIBE '01 Proceedings of the 2nd IEEE International Symposium on Bioinformatics and Bioengineering
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

A Multi-Objective Multipopulation Approach for Biclustering

ICARIS '08 Proceedings of the 7th international conference on Artificial Immune Systems
Improving a multi-objective multipopulation artificial immune network for biclustering

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Microarray data biclustering with multi-objective immune algorithm

ICNC'09 Proceedings of the 5th international conference on Natural computation
Query expansion using an immune-inspired biclustering algorithm

Natural Computing: an international journal
Mining coherent biclusters with fish school search

ICSI'11 Proceedings of the Second international conference on Advances in swarm intelligence - Volume Part II
A novel clustering and verification based microarray data bi-clustering method

ICSI'10 Proceedings of the First international conference on Advances in Swarm Intelligence - Volume Part II
Biclustering and subspace learning with regularization for financial risk analysis

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the rapid development of information technology, computers are proving to be a fundamental tool for the organization and classification of electronic texts, given the huge amount of available information. The existent methodologies for text mining apply standard clustering algorithms to group similar texts. However, these algorithms generally take into account only the global similarities between the texts and assign each one to only one cluster, limiting the amount of information that can be extracted from the texts. An alternative proposal capable of solving these drawbacks is the biclustering technique. The biclustering is able to perform clustering of rows and columns simultaneously, allowing a more comprehensive analysis of the texts. The main contribution of this paper is the development of an immune-inspired biclustering algorithm to carry out text mining, denoted BIC-aiNet. BIC-aiNet interprets the biclustering problem as several two-way bipartition problems, instead of considering a single two-way permutation framework. The experimental results indicate that our proposal is able to group similar texts efficiently and extract implicit useful information from groups of texts.