Data Mining by Decomposition: Adaptive Search for Hypothesis Generation

Authors:
Hemant K. Bhargava
Affiliations:
-
Venue:
INFORMS Journal on Computing
Year:
1999

Citing 0
Cited 1

The use of various data mining and feature selection methods in the analysis of a population survey dataset

AIDM '07 Proceedings of the 2nd international workshop on Integrating artificial intelligence and data mining - Volume 84

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data mining methods search large databases for interesting patterns that may lead to useful decisions in organizations. When the database is defined over scores of attributes, the complexity of the search increases due to the combinatorial explosion at the attribute-space level, because billions of attribute subsets are candidates for forming interesting patterns in the database. A useful way to address this complexity is to partition the search problem and apply separate, but intertwined, algorithms for attribute search and pattern search. A genetic algorithm is applied on the attribute search problem to identify subsets that lead to more interesting patterns. This method is applied on a real world database arising from the investigations into the "Persian Gulf Illness." Computational experiments resulted in significant success compared to random or manual attribute selection.