Pattern Learning and Decision Making in a Photovoltaic System

  • Authors:
  • Rongxin Li;Peter Wang

  • Affiliations:
  • Autonomous Systems Laboratory, CSIRO ICT Centre, Australia;Autonomous Systems Laboratory, CSIRO ICT Centre, Australia

  • Venue:
  • SEAL '08 Proceedings of the 7th International Conference on Simulated Evolution and Learning
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the effects of different decision making schemes on theaccumulative rewards when photovoltaic (PV) facilities are intendedas a potential replacement for conventional peaking power plants.As the amount of solar irradiance usable by a PV module follows astochastic process, we compare the outcomes using the following twostrategies in a stochastic environment: (1) employing an optimaldecision making approach without any specific knowledge of theenvironment; and (2) optimal decision making based upon learningpatterns of the environment process. We examine the possibility ofintegrating a pattern learning approach --- called anε -Machine --- with a Partially Observable MarkovDecision Process (POMDP). This approach has been motivated in partby the fact that efforts in extending traditional learningapproaches to POMDPs have so far achieved only limited success. ThePV facility in our model consists of a PV panel and a battery, withan associated local, non-critical load. Under the assumption thatany PV generated power exceeding the maximum local consumptioncapacity must be dumped when the battery is full, the goal of theautonomous control agent is to maintain the maximum outputpotential to most effectively offset unexpected demand peaks, whileminimizing energy wastage in the presence of strong solarirradiance. The environment is assumed to follow a Markov processof a different order than the part of the system under theinfluence of the agent.