ACM Computing Surveys (CSUR)
Data mining: concepts and techniques
Data mining: concepts and techniques
A data labeling method for clustering categorical data
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Smart meter monitoring and data mining techniques for predicting refrigeration system performance
Expert Systems with Applications: An International Journal
Hi-index | 12.05 |
A method for characterizing the performance ratio of thin-film photovoltaic modules based on the use of data mining and statistical techniques is developed. In general, this parameter changes when modules are working in outdoor conditions depending on irradiance, temperature, air mass and solar spectral irradiance distribution. The problem is that it is usually difficult to know how to include solar spectral irradiance information when estimating the performance of photovoltaic modules. We propose five different solar spectral irradiance distributions that summarize all the different distributions observed in Malaga. Using the probability distribution functions of these curves and a statistical test, we first checked when two spectral distributions measured can be considered to have the same contribution of energy per wavelength. Hence, using this test and the k-means data mining technique, all the measured spectra, more than two hundred and fifty thousand, are clustered in only five different groups. All the spectra in each cluster can be considered as equal and the k-means technique estimates one centroid for each cluster that corresponds to the cumulative probability distribution function that is the most similar to the rest of the samples in the cluster. The results obtained proves that 99.98% of the functions can be considered equal to the centroid of its cluster. With these five types of functions, we have explained the changes in the performance ratio measured for thin-film photovoltaic modules of different technologies.