Distributed learning with data reduction

Authors:
Ireneusz Czarnowski
Affiliations:
Department of Information Systems, Gdynia Maritime University, Gdynia, Poland
Venue:
Transactions on computational collective intelligence IV
Year:
2011

Citing 137
Cited 2

Simplifying decision trees

International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 5
Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems

Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Instance-Based Learning Algorithms

Machine Learning
A practical approach to feature selection

ML92 Proceedings of the ninth international workshop on Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
A Weighted Nearest Neighbor Algorithm for Learning with Symbolic Features

Machine Learning
Experiments on multistrategy learning by meta-learning

CIKM '93 Proceedings of the second international conference on Information and knowledge management
Blackboard systems

Blackboard systems
Democracy in neural nets: voting schemes for classification

Neural Networks
Evaluation and Selection of Biases in Machine Learning

Machine Learning - Special issue on bias evaluation and selection
LVQ combined with simulated annealing for optimal design of large-set reference models

Neural Networks
Learning in the presence of concept drift and hidden contexts

Machine Learning
Genetic algorithms + data structures = evolution programs (3rd ed.)

Genetic algorithms + data structures = evolution programs (3rd ed.)
Tolerance approximation spaces

Fundamenta Informaticae - Special issue: rough sets
Selection of relevant features and examples in machine learning

Artificial Intelligence - Special issue on relevance
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
The Perceptron algorithm versus Winnow: linear versus logarithmic mistake bounds when few input variables are relevant

Artificial Intelligence - Special issue on relevance
Lazy learning

Lazy learning
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Data preparation for data mining

Data preparation for data mining
Collaboration rules for autonomous software agents

Decision Support Systems - Special issue on restructuring the electric power business—a new paradigm for reducing regulation
Papyrus: a system for data mining over local and wide area clusters and super-clusters

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Nearest neighbor classifier: simultaneous editing and feature selection

Pattern Recognition Letters - Special issue on pattern recognition in practice VI
Using analytic QP and sparseness to speed training of support vector machines

Proceedings of the 1998 conference on Advances in neural information processing systems II
Reduction Techniques for Instance-BasedLearning Algorithms

Machine Learning
Data mining: concepts and techniques

Data mining: concepts and techniques
Rough set algorithms in classification problem

Rough set methods and applications
A component-based architecture for problem solving environments

Mathematics and Computers in Simulation - IMACS sponsored special issue: 1999 international symposium on computational sciences, to honor John R. Rice
Radial basis function networks 1: recent developments in theory and applications

Radial basis function networks 1: recent developments in theory and applications
The distributed boosting algorithm

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Bayesian Networks and Decision Graphs

Bayesian Networks and Decision Graphs
Learning with Nested Generalized Exemplars

Learning with Nested Generalized Exemplars
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Machine Learning

Machine Learning
Instance Selection and Construction for Data Mining

Instance Selection and Construction for Data Mining
Boosting Algorithms for Parallel and Distributed Learning

Distributed and Parallel Databases - Special issue: Parallel and distributed data mining
Imputation of Missing Data in Industrial Databases

Applied Intelligence
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Asynchronous Teams: Cooperation Schemes for Autonomous Agents

Journal of Heuristics
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Strategies for Parallel Data Mining

IEEE Concurrency
The CN2 Induction Algorithm

Machine Learning
Distributed learning with bagging-like performance

Pattern Recognition Letters
Synthesizing High-Frequency Rules from Different Data Sources

IEEE Transactions on Knowledge and Data Engineering
Extending Learning to Multiple Agents: Issues and a Model for Multi-Agent Machine Learning (MA-ML)

EWSL '91 Proceedings of the European Working Session on Machine Learning
Model Combination in the Multiple-Data-Batches Scenario

ECML '97 Proceedings of the 9th European Conference on Machine Learning
Techniques for Estimating the Computation and Communication Costs of Distributed Data Mining

ICCS '02 Proceedings of the International Conference on Computational Science-Part I
Refining Initial Points for K-Means Clustering

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Creating Ensembles of Classifiers

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Footprint-Based Retrieval

ICCBR '99 Proceedings of the Third International Conference on Case-Based Reasoning and Development
Improvements in K-Nearest Neighbor Classification

ICAPR '01 Proceedings of the Second International Conference on Advances in Pattern Recognition
Identifying Relevant Databases for Multidatabase Mining

PAKDD '98 Proceedings of the Second Pacific-Asia Conference on Research and Development in Knowledge Discovery and Data Mining
Learning Network Designs for Asynchronous Teams

Proceedings of the 8th European Workshop on Modelling Autonomous Agents in a Multi-Agent World: Multi-Agent Rationality
Data Complexity Analysis for Classifier Combination

MCS '01 Proceedings of the Second International Workshop on Multiple Classifier Systems
Parallel and Distributed Data Mining: An Introduction

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
A-Teams: An Agent Architecture for Optimization and Decision Support

ATAL '98 Proceedings of the 5th International Workshop on Intelligent Agents V, Agent Theories, Architectures, and Languages
The Right Agent (Architecture) to do the Right Thing

ATAL '98 Proceedings of the 5th International Workshop on Intelligent Agents V, Agent Theories, Architectures, and Languages
SBL-PM: A Simple Algorithm for Selection of Reference Instances for Similarity Based Methods

Proceedings of the IIS'2000 Symposium on Intelligent Information Systems
Thesis: clustering and instance based learning in first order logic

AI Communications
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
Feature ranking in rough sets

AI Communications - Special issue on Artificial intelligence advances in China
Assessing semantic similarity among spatial entity classes

Assessing semantic similarity among spatial entity classes
Feature Weighting and Instance Selection for Collaborative Filtering: An Information-Theoretic Approach

Knowledge and Information Systems
Clustering classifiers for knowledge discovery from physically distributed databases

Data & Knowledge Engineering
Multiagent Collaborative Learning for Distributed Business Systems

AAMAS '04 Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 3
Communication Efficient Construction of Decision Trees Over Heterogeneously Distributed Data

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Model Averaging for Prediction with Discrete Bayesian Networks

The Journal of Machine Learning Research
Efficient Feature Selection via Analysis of Relevance and Redundancy

The Journal of Machine Learning Research
Learning classifiers from distributed, semantically heterogeneous, autonomous data sources

Learning classifiers from distributed, semantically heterogeneous, autonomous data sources
Modifications of the fuzzy-artmap algorithm for distributed learning in large data sets

Modifications of the fuzzy-artmap algorithm for distributed learning in large data sets
Handbook Of Bioinspired Algorithms And Applications (Chapman & Hall/Crc Computer & Information Science)

Handbook Of Bioinspired Algorithms And Applications (Chapman & Hall/Crc Computer & Information Science)
Scalable Representative Instance Selection and Ranking

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 03
JADE-Based A-Team as a Tool for Implementing Population-Based Algorithms

ISDA '06 Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications - Volume 03
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Algorithms for Feature Selection: An Evaluation

ICPR '96 Proceedings of the 13th International Conference on Pattern Recognition - Volume 2
Feature selection based on rough sets and particle swarm optimization

Pattern Recognition Letters
Innovations in multi-agent systems

Journal of Network and Computer Applications
A Framework for Learning from Distributed Data Using Sufficient Statistics and Its Application to Learning Decision Trees

International Journal of Hybrid Intelligent Systems
Application of elitist multi-objective genetic algorithm for classification rule generation

Applied Soft Computing
Learning drifting concepts: Example selection vs. example weighting

Intelligent Data Analysis
Selecting representative examples and attributes by a genetic algorithm

Intelligent Data Analysis
Finding Prototypes For Nearest Neighbor Classifiers

IEEE Transactions on Computers
Data mining for agent reasoning: A synergy for training intelligent agents

Engineering Applications of Artificial Intelligence
Machine learning: a review of classification and combining techniques

Artificial Intelligence Review
Efficient Dimensionality Reduction Approaches for Feature Selection

ICCIMA '07 Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007) - Volume 02
Introduction to Information Retrieval

Introduction to Information Retrieval
An Agent-Based Approach to the Multiple-Objective Selection of Reference Vectors

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Data Reduction Algorithm for Machine Learning and Data Mining

IEA/AIE '08 Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence
A Framework for Adaptive and Integrated Classification

ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Active Learning from Data Streams

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Adaptive Mechanisms for Classification Problems with Drifting Data

KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
Particle swarm optimization for prototype reduction

Neurocomputing
Cluster-based under-sampling approaches for imbalanced data distributions

Expert Systems with Applications: An International Journal
A search space reduction methodology for data mining in large databases

Engineering Applications of Artificial Intelligence
A-Team Middleware on a Cluster

KES-AMSTA '09 Proceedings of the Third KES International Symposium on Agent and Multi-Agent Systems: Technologies and Applications
Multiagent Framework for Bio-data Mining

RSKT '09 Proceedings of the 4th International Conference on Rough Sets and Knowledge Technology
Learning from Imbalanced Data

IEEE Transactions on Knowledge and Data Engineering
Relevance and Redundancy Analysis for Ensemble Classifiers

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Data Mining and Multi-agent Integration

Data Mining and Multi-agent Integration
EMADS: An extendible multi-agent data miner

Knowledge-Based Systems
Improved heterogeneous distance functions

Journal of Artificial Intelligence Research
Solving multiclass learning problems via error-correcting output codes

Journal of Artificial Intelligence Research
The COMPSET algorithm for subset selection

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A study of cross-validation and bootstrap for accuracy estimation and model selection

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining

Applied Soft Computing
A collaborative training algorithm for distributed learning

IEEE Transactions on Information Theory
A-Teams and Their Applications

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
An Efficient Feature Selection Using Ant Colony Optimization Algorithm

ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Distributed data mining and agents

Engineering Applications of Artificial Intelligence
Combining Distributed Classifies by Stacking

WGEC '09 Proceedings of the 2009 Third International Conference on Genetic and Evolutionary Computing
Implementation and performance evaluation of the agent-based algorithm for ANN training

International Journal of Knowledge-based and Intelligent Engineering Systems
Prototype selection algorithms for distributed learning

Pattern Recognition
Agent-based distributed data mining: the KDEC scheme

Intelligent information agents
Evaluating learning algorithms and classifiers

International Journal of Intelligent Information and Database Systems
MALEF: Framework for distributed machine learning and data mining

International Journal of Intelligent Information and Database Systems
An A-Team approach to learning classifiers from distributed data sources

International Journal of Intelligent Information and Database Systems
Learning with many irrelevant features

AAAI'91 Proceedings of the ninth National conference on Artificial intelligence - Volume 2
Boosting support vector machines for imbalanced data sets

Knowledge and Information Systems
Multi-sorting algorithm for finding pairs of similar short substrings from large-scale string data

Knowledge and Information Systems - Special Issue:Best Papers from the 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2008);Guest Editors: Takashi Washio, Einoshin Suzuki and Kai Ming Ting
Distributed data mining system based on multi-agent communication mechanism

KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Feature set reduction by evolutionary selection and construction

KES-AMSTA'10 Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part II
Scaling up: distributed machine learning with cooperation

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Bagging, boosting, and C4.S

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
An agent-based framework for distributed learning

Engineering Applications of Artificial Intelligence
Cellular GEP-induced classifiers

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI
JABAT middleware as a tool for solving optimization problems

Transactions on computational collective intelligence II
An agent-based PLA for the cascade correlation learning architecture

ICANN'05 Proceedings of the 15th international conference on Artificial neural networks: formal models and their applications - Volume Part II
Approximation spaces in machine learning and pattern recognition

PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
JADE-Based a-team environment

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Cluster-based instance selection for machine classification

Knowledge and Information Systems
The curse of dimensionality in data mining and time series prediction

IWANN'05 Proceedings of the 8th international conference on Artificial Neural Networks: computational Intelligence and Bioinspired Systems
Nearest prototype classification: clustering, genetic algorithms, or random search?

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Adaptive integrated image segmentation and object recognition

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study

IEEE Transactions on Evolutionary Computation
Ant system: optimization by a colony of cooperating agents

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Fast accurate fuzzy clustering through data reduction

IEEE Transactions on Fuzzy Systems

Machine learning and agents

KES-AMSTA'11 Proceedings of the 5th KES international conference on Agent and multi-agent systems: technologies and applications
Experimental evaluation of the agent-based population learning algorithm for the cluster-based instance selection

ICCCI'11 Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

The work deals with the distributed machine learning. Distributed learning from data is considered to be an important challenge faced by researchers and practice in the domain of the distributed data mining and distributed knowledge discovery from databases. Currently, learning from data is recognized as one of the most widely investigated paradigms of machine learning. At the same time it is perceived as a difficult and demanding computational problem. Even more complex and still to a large extent open is learning from the distributed data. One of the approaches suitable for learning from the geographically distributed data is to select from the local databases relevant local patterns, called also prototypes. Such prototypes are selected using some specialized data reduction methods. The dissertation contains an overview of the problem of learning classifiers from data, followed by a discussion of the distributed learning. The above includes the problem formulation and the state-of-the-art review. Next, data reduction, approaches, techniques and algorithms are discussed. The central part of the dissertation proposes an agent-based distributed learning framework. The idea is to carry-out data reduction in parallel in separate locations, employing specialized software agents. The process ends when locally selected prototypes are moved to a central site and merged into the global knowledge model. The following part of the work contains the results of an extensive computational experiment aiming at validation of the proposed approach. Finally, conclusions and suggestions for further research are formulated.