A parallel method for computing rough set approximations

Authors:
Junbo Zhang;Tianrui Li;Da Ruan;Zizhe Gao;Chengbing Zhao
Affiliations:
School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031, China;School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031, China;Belgian Nuclear Research Centre (SCKCEN), Boeretang 200, 2400 Mol, Belgium and Department of Applied Mathematics and Computer Science, Ghent University, 9000 Gent, Belgium;School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031, China;School of Information Science and Technology, Southwest Jiaotong University, Chengdu 610031, China
Venue:
Information Sciences: an International Journal
Year:
2012

Citing 34
Cited 6

Rough Sets: Theoretical Aspects of Reasoning about Data

Rough Sets: Theoretical Aspects of Reasoning about Data
A Fast Parallel Clustering Algorithm for Large Spatial Databases

Data Mining and Knowledge Discovery
Deriving two-stage learning sequences from knowledge in fuzzy sequential pattern mining

Information Sciences—Informatics and Computer Science: An International Journal
Mining massive document collections by the WEBSOM method

Information Sciences: an International Journal - Special issue: Soft computing data mining
Algorithms for mining association rules in bag databases

Information Sciences—Informatics and Computer Science: An International Journal
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
A rough sets based characteristic relation approach for dynamic attribute generalization in data mining

Knowledge-Based Systems
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

Pattern Recognition
Evaluating MapReduce for Multi-core and Multiprocessor Systems

HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Neighborhood classifiers

Expert Systems with Applications: An International Journal
Google's MapReduce programming model – Revisited

Science of Computer Programming
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
Mixed feature selection based on granulation and approximation

Knowledge-Based Systems
Neighborhood rough set based heterogeneous feature subset selection

Information Sciences: an International Journal
MapReduce for Data Intensive Scientific Analyses

ESCIENCE '08 Proceedings of the 2008 Fourth IEEE International Conference on eScience
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Parallel K-Means Clustering Based on MapReduce

CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
MGRS: A multi-granulation rough set

Information Sciences: an International Journal
Looking into the seeds of time: Discovering temporal patterns in large transaction sets

Information Sciences: an International Journal
Hadoop: The Definitive Guide

Hadoop: The Definitive Guide
A hybrid model based on rough sets theory and genetic algorithms for stock price forecasting

Information Sciences: an International Journal
A Dominance-based Rough Set Approach to customer behavior in the airline market

Information Sciences: an International Journal
Positive approximation: An accelerator for attribute reduction in rough set theory

Artificial Intelligence
Parallelizing XML data-streaming workflows via MapReduce

Journal of Computer and System Sciences
VDB-MR: MapReduce-based distributed data integration using virtual database

Future Generation Computer Systems
A rough set based dynamic maintenance approach for approximations in coarsening and refining attribute values

International Journal of Intelligent Systems
Sequential covering rule induction algorithm for variable consistency rough set approaches

Information Sciences: an International Journal
Parallel K-means clustering of remote sensing images based on mapreduce

WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
Attribute reduction for massive data based on rough set theory and MapReduce

RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
Scheduling divisible MapReduce computations

Journal of Parallel and Distributed Computing
Positive approximation and converse approximation in interval-valued fuzzy rough sets

Information Sciences: an International Journal
Incremental learning optimization on knowledge discovery in dynamic business intelligent systems

Journal of Global Optimization
Incomplete Multigranulation Rough Set

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans

Information-theoretic measures associated with rough set approximations

Information Sciences: an International Journal
Parallel rough set based knowledge acquisition using MapReduce from big data

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Composite rough sets

AICI'12 Proceedings of the 4th international conference on Artificial Intelligence and Computational Intelligence
Dynamic maintenance of approximations in set-valued ordered decision systems under the attribute generalization

Information Sciences: an International Journal
Speeding-up codon analysis on the cloud with local MapReduce aggregation

Information Sciences: an International Journal
A comparison of parallel large-scale knowledge acquisition using rough set theory on different MapReduce runtime systems

International Journal of Approximate Reasoning

Quantified Score

Hi-index	0.07

Visualization

Abstract

Massive data mining and knowledge discovery present a tremendous challenge with the data volume growing at an unprecedented rate. Rough set theory has been successfully applied in data mining. The lower and upper approximations are basic concepts in rough set theory. The effective computation of approximations is vital for improving the performance of data mining or other related tasks. The recently introduced MapReduce technique has gained a lot of attention from the scientific community for its applicability in massive data analysis. This paper proposes a parallel method for computing rough set approximations. Consequently, algorithms corresponding to the parallel method based on the MapReduce technique are put forward to deal with the massive data. An extensive experimental evaluation on different large data sets shows that the proposed parallel method is effective for data mining.