Data mining and statistical techniques for characterizing the performance of thin-film photovoltaic modules

  • Authors:
  • Rafael Moreno Sáez;Mariano Sidrach-De-Cardona;Llanos Mora-López

  • Affiliations:
  • -;-;-

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2013

Quantified Score

Hi-index 12.05

Visualization

Abstract

A method for characterizing the performance ratio of thin-film photovoltaic modules based on the use of data mining and statistical techniques is developed. In general, this parameter changes when modules are working in outdoor conditions depending on irradiance, temperature, air mass and solar spectral irradiance distribution. The problem is that it is usually difficult to know how to include solar spectral irradiance information when estimating the performance of photovoltaic modules. We propose five different solar spectral irradiance distributions that summarize all the different distributions observed in Malaga. Using the probability distribution functions of these curves and a statistical test, we first checked when two spectral distributions measured can be considered to have the same contribution of energy per wavelength. Hence, using this test and the k-means data mining technique, all the measured spectra, more than two hundred and fifty thousand, are clustered in only five different groups. All the spectra in each cluster can be considered as equal and the k-means technique estimates one centroid for each cluster that corresponds to the cumulative probability distribution function that is the most similar to the rest of the samples in the cluster. The results obtained proves that 99.98% of the functions can be considered equal to the centroid of its cluster. With these five types of functions, we have explained the changes in the performance ratio measured for thin-film photovoltaic modules of different technologies.