A single-pass online data mining algorithm combined with control theory with limited memory in dynamic data streams

  • Authors:
  • Yanxiang He;Naixue Xiong;Xavier Défago;Yan Yang;Jing He

  • Affiliations:
  • The State Key Lab of Software Engineering, Computer School, Wuhan University, P.R. China;The State Key Lab of Software Engineering, Computer School, Wuhan University, P.R. China;School of Information Science, Japan Advanced Institute of Science and Technology (JAIST), Japan;Computer School, Wuhan university of science and technology, P.R. China;Department of Computer Science, Utah State University, Logan, UT

  • Venue:
  • GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses a fundamental problem that arises in data streaming scenarios, namely, today’s data mining is ill-equipped to handle data streams effectively, and pays little attention to the network stability and the fast response [36]. To the question, we present a control-theoretic explicit rate (ER) online data mining control algorithm (ODMCA) to regulate the sending rate of mined data, which accounts for the main memory occupancies of terminal nodes. The proposed method uses a distributed proportional integrative plus derivative controller combined with data-mining, where the control parameters can be designed to ensure the stability of the control loop in terms of sending rate of mined data. We further analyze the theoretical aspects of the proposed algorithm, and simulation results show the efficiency of our scheme in terms of high main memory occupancy, fast response of the main memory occupancy as well as of the controlled sending rates.