Parallel scalable hardware implementation of asynchronous discrete particle swarm optimization

  • Authors:
  • Amin Farmahini-Farahani;Shervin Vakili;Sied Mehdi Fakhraie;Saeed Safari;Caro Lucas

  • Affiliations:
  • School of Electrical and Computer Engineering, University of Tehran, North Kargar Ave., Tehran 14395-515, Iran;School of Electrical and Computer Engineering, University of Tehran, North Kargar Ave., Tehran 14395-515, Iran;School of Electrical and Computer Engineering, University of Tehran, North Kargar Ave., Tehran 14395-515, Iran;School of Electrical and Computer Engineering, University of Tehran, North Kargar Ave., Tehran 14395-515, Iran;School of Electrical and Computer Engineering, University of Tehran, North Kargar Ave., Tehran 14395-515, Iran

  • Venue:
  • Engineering Applications of Artificial Intelligence
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel hardware framework of particle swarm optimization (PSO) for various kinds of discrete optimization problems based on the system-on-a-programmable-chip (SOPC) concept. PSO is a new optimization algorithm with a growing field of applications. Nevertheless, similar to the other evolutionary algorithms, PSO is generally a computationally intensive method which suffers from long execution time. Hence, it is difficult to use PSO in real-time applications in which reaching a proper solution in a limited time is essential. SOPC offers a platform to effectively design flexible systems with a high degree of complexity. A hardware pipelined PSO (PPSO) Core is applied with which the required computational operations of the algorithm are performed. Embedded processors have also been employed to evaluate the fitness values by running programmed software codes. Applying the subparticle method brings the benefit of full scalability to the framework and makes it independent of the particle length. Therefore, more complex and larger problems can be addressed without modifying the architecture of the framework. To speed up the computations, the optimization architecture is implemented on a single chip master-slave multiprocessor structure. Moreover, the asynchronous model of PSO gains parallel efficacy and provides an approach to update particles continuously. Five benchmarks are exploited to evaluate the effectiveness and robustness of the system. The results indicate a speed-up of up to 98 times over the software implementation in the elapsed computation time. Besides, the PPSO Core has been employed for neural network training in an SOPC-based embedded system which approves the system applicability for real-world applications.