Predictive Modeling of Photovoltaic Solar Power Ge
Predictive Modeling of Photovoltaic Solar Power Ge
GIL-VERA V. D.
SISCO Research Group,
Luis Amigó Catholic University,
Trans. 51A N° 67 B-90, Medellín,
COLOMBIA
QUINTERO-LÓPEZ C.
NBA Research Group,
Luis Amigó Catholic University,
Trans. 51A N° 67 B-90, Medellín,
COLOMBIA
Abstract: - Photovoltaic solar power referred to as solar power using photovoltaic cells, is a renewable
energy source. The solar cells' electricity may be utilized to power buildings, neighborhoods, and even
entire cities. A stable and low-maintenance technology, photovoltaic solar power is an appealing
alternative for generating energy since it emits no greenhouse gases and has no moving components.
This paper aimed to provide a photovoltaic solar power generation forecasting model developed with
machine learning approaches and historical data. In conclusion, this type of predictive model enables
the evaluation of additional non-traditional sources of renewable energy, in this case, photovoltaic
solar power, which facilitates the planning process for the diversification of the energy matrix.
Random Forests obtain the highest performance, with this knowledge power systems operators may
forecast outcomes more precisely, this is the main contribution of this work.
Received: July 16, 2022. Revised: March 11, 2023. Accepted: April 9, 2023. Published: May 3, 2023.
The database's variables included the date and time The requirement to satisfy a particular energy
of each observation, the amount of DC power the demand unites the many uses for stand-alone
inverter produced in 15 minutes (in kW), the systems. Because of this, almost all standalone
amount of AC power the inverter (source key) systems have energy storage technology, [20].
produced in 15 minutes (kW), the total amount of According to their related applications, these
energy produced throughout the day, and the total systems may be divided into three groups:
return on investment. professional, rural electrification, and modest
The other sections of this work are: section 2 consumption. Little photovoltaic modules,
provides a broad background on photovoltaic solar frequently built of amorphous silicon, are used in
power, section 3 offers the problem formulation, modest consumption applications to power
section 4 generalization of ML, section 5 problem electronic devices like calculators or watches,
solution, section 6 discussion, and finally mobile phone chargers, tiny power tools, household
conclusions. beacons, etc., [21], [22].
In conclusion, predictive models provide There are many professional applications,
improved network management, detect the need for including radio links, cathodic protection of gas
panel cleaning/maintenance, and locate faulty or pipelines, hotels, traffic signals, air navigation,
inefficient equipment. Predictive modeling may be refrigeration of vaccines, equipment for remote data
used to forecast the long-term power reserve for PV acquisition and transmission, and even power
system design and size as well as minimize supply for satellites and other space equipment,
generation uncertainty. [23].
Due to the extremely high costs associated with
power failures in all of these applications, it is
2 Photovoltaic Solar Power typically chosen to add solar generators and
A set of electrical and electronic components called electrochemical accumulators that are bigger than
a photovoltaic array uses sun radiation to generate technically necessary, [24], [25]. This reduces the
electricity. The photovoltaic module, which is made likelihood of failure.
up of cells capable of converting incident light Often included in development cooperation
energy into direct electrical energy, is the main projects and funded by nonprofit organizations or
component of this system, [13], [14]. institutions like the World Bank or the European
The rest of the equipment in a photovoltaic Union, rural electrification systems provide energy
system is primarily determined by the system's to rural communities that are located distant from
intended use. Grid-connected, off-grid, and pumped traditional power lines, [26], [27]. Solar home
storage systems are the three basic categories under systems (SHS), hybrid power plants, and pumped-
which photovoltaic systems may be categorized. storage systems are the most common types of rural
Grid-connected systems generate electricity that electrification systems. Lighting devices, radio,
is supplied to the traditional grid; they are exempt television, and small power tools may all be
from the requirement for energy storage as they are powered by home systems and hybrid plants,
not directly responsible for meeting customer respectively, [28], [29].
demand or ensuring consumption, [15], [16]. These Domestic systems with 100 W or 200 W power
systems, which may be separated into ground- ratings are often found in a family home, however,
mounted systems and building-mounted systems, occasionally they can also be found in community
comprise inverter equipment that adjusts the centers or medical facilities. A rural village's
electricity supplied by the solar generator to the electrical grid is provided by hybrid power plants
circumstances of the traditional grid to permit the with a solar generator, an electrochemical
proper linkage with the electrical grid, [17]. accumulator, and a generator set or wind turbine.
Above-ground systems often have more power The size of these plants depends on the population
than 100 kW and are entirely intended for energy they serve, with capacity ranging from 10 kW to
generation and related economic efficiency. In- 100 kW, [30], [31]. Pumping systems employ a
building systems perform tasks in addition to motor pump to raise and move water from an
generating energy, such as replacing architectural aquifer to a reservoir or distribution system by using
elements, creating aesthetically pleasing effects, the electrical energy generated by the solar
shading glass, etc. They generally have power generator, [32]. These systems often store energy in
ratings of less than 100 kW and are smaller than the form of potential energy from the water stored in
ground-mounted systems, [18], [19]. the raised reservoir to minimize costs and boost
dependability. Pumping systems can be used to
2 pm. DC electricity is only at its peak on 2020-05- second stage is the data input, and the third stage is
25. Data provides us with a logistics-like function, the handling of the discrepancy between the
but after 18:00 the energy progressively declines estimated and measured data, [49]. By taking into
until breaking down completely at 00:00. As you account the fact that the deviation and forecasting
can see, certain daily yield dates (2020-02-06, 2020- abilities of the models depend not only on the
05-19, etc.) have a logistic shape with missing climatic conditions but also on the prediction
values, but not others (2020-02-06, 2020-05-19, horizon, criteria like the treatment of non-linearity,
etc.). Data are logged every 15 minutes, and then we the behavior when using multiple inputs, the
receive a fresh yield. Figure 2, presents the DC prediction horizon, the treatment of the deviation
Power Plot. associated with the prediction, and flexibility,
provide guidelines in model development, [50].
300000 Next, we provide a summary of the ML models
250000 examined in this work:
200000
4.1 Naïve-Bayes
150000 This probabilistic ML method is frequently
employed for classification problems. It is founded
100000
on the Bayes theorem, which calculates an event
50000 probability using information about prior
confounding variables. This model assumes that the
0
00:00 05:33 11:06 16:40 22:13 features used to categorize instances are
Fig. 2: DC Power Vs Time independent of one another in the context of
classification, i.e., one feature’s existence or
Figure 3 presents the Daily DC Power on each absence does not affect the presence or absence of
day of the considered period (34 days). any other feature, [51]. This model is based on the
Bayes Theorem.
𝑃(𝐵|𝐴). 𝑃(𝐴)
P(A|B) = (1)
𝑃(𝐵)
4.3 Support Vector Machine trained using many techniques, including ID3, C4.5,
This model is employed for classification and CART, and Random Forests, [55]. The objective is
regression problems. When dealing with high- to reduce the impurity or entropy of the subsets,
dimensional datasets and non-linear decision which quantifies the level of homogeneity of the
boundaries, SVMs are especially successful. class labels or values, [55]. In this model, entropy is
Finding the hyperplane that optimally separates the a measure of the randomness in the information
various classes in the input data is fundamental in being processed and information gain (IG) is a
this model. When there are more than two classes, decrease in entropy (equation 5).
the hyperplane may be a plane or a higher-
k
dimensional manifold, [53]. In a two-class
classification issue, the hyperplane is a line that IG = Entropy(𝑏𝑒𝑓𝑜𝑟𝑒) − ∑ Entropy(𝑗, 𝑎𝑓𝑡𝑒𝑟)
(5)
j=1
divides the two classes. Equation (3) presents a
feature vector.
4.6 Random Forest
This model is used for supervised learning tasks
𝑔(𝑥) = 𝑤 𝑡 𝑥 + 𝑏 (3) including classification, regression, and others. An
extensive number of decision trees are constructed
For a binary classification problem 𝐶1 , 𝐶2 , if during training using this ensemble learning
𝑔(𝑥) > 0 then x ∈ 𝐶1 , else x ∈ 𝐶2 . approach, which results in a class that reflects the
average of the predictions (regression) or
classifications produced by the individual trees.
4.4 Logistic Regression
This technique works by building a collection of
This model is a typical statistical learning approach
decision trees, each of which is trained using a
for binary classification problems, where the
portion of the input features and training data that is
objective is to predict the likelihood that an event
randomly chosen. The random forest aggregates all
will belong to one of two classes. A probability
of the individual trees’ predictions throughout the
score may be the threshold for a binary logistic
prediction phase to provide a final prediction, [56].
regression prediction result. The logistic function,
In this model, the Gini Index (equation 6) is used to
on which this model is built, converts an output
identify how much impurity has a particular node.
resulting from a linear combination of input data
into a probability score between 0 and 1. 𝑐
Weights are used to model the linear
combination of input characteristics, and they are 𝐺𝑖𝑛𝑖 𝐼𝑛𝑑𝑒𝑥 = 1 − ∑(𝑝𝑗 )2 (6)
learned from training data via maximum likelihood 𝑗=1
estimation, [54]. Equation (4) presents the general
equation of this model, where µ is the midpoint of Where 𝑝𝑗 is the proportion of samples belonging to
the curve and s is a scale parameter. class c for a given node.
𝑛
radiation that will arrive at a certain location at a
specific time using meteorological and satellite data.
𝑑 = √∑(𝑥𝑖 − 𝑦𝑖 )2 (7) Based on the amount of sunshine and the present
𝑖=1 weather, real-time PV monitors may also be used to
measure current generations and forecast future
𝑛 generations. Another approach is to estimate using
previous PV generation data from the same site and
𝑑 = ∑ | 𝑥𝑖 − 𝑦𝑖 | (8)
season. In this work, the latter is utilized. After
𝑖=1 cross-validation. Table 2 lists the optimal
parameters for each model examined.
𝑛 1/𝑞
Precision*
F1-Score*
Recall*
The models with the lowest ability to recognize F1-Score=.854); with this knowledge, power
negative instances (Specificity) were Naive Bayes systems operators may forecast outcomes with more
and Logistic Regression; for both of these models, precision.
this measure was below .50, making them This ML model is a robust algorithm that can
ineffective prediction models. The Logistic handle a large number of features, missing values,
Regression model has the lowest efficiency, with an and noisy data. It is less prone to overfitting than
Accuracy of .543. other algorithms, such as decision trees, thanks to
the use of bagging and feature randomness.
Random Forests is known for its high accuracy in
6 Discussion predicting outcomes, making it a popular choice for
Solar power projections will have a huge impact on many applications.
the future of large-scale renewable energy This ML model is adaptable and may be used for
installations. Predicting solar electricity generation both classification and regression problems. It can
is strongly reliant on changing weather patterns. offer crucial insights into the link between the
Beyond climatic and altitude factors, the total characteristics and the outcome, increasing the
amount of energy produced by a solar station model's interpretability. Overall, ML integration can
depends on its capacity; thus, the total amount of result in improved efficiency, reliability, and cost
energy produced by each solar power plant depends reductions in the photovoltaic power generation
on its capacity. sector.
A solar plant with a higher kilowatt peak (kWp)
capacity will produce more energy than a plant with
a lower kWp capacity. Because the variables that References:
affect PV power generation are unpredictable, it can [1] El-Hadary, M. I., Senthilraja, S., & Zayed, M.
be difficult to predict it with any degree of accuracy. E., “A hybrid system coupling spiral type
With the use of machine learning (ML), it is now solar photovoltaic thermal collector and
feasible to forecast power generation for the electrocatalytic hydrogen production cell:
upcoming few days to manage the grid better, Experimental investigation and numerical
recognize the need for panel cleaning and modeling,” Process Safety and Environmental
maintenance, and spot broken or underperforming Protection, vol. 170, p. 1101–1120, 2023.
machinery. DOI: 10.1016/j.psep.2022.12.079.
Future studies might concentrate on estimating [2] Mamatha, G., & Kulkarni, P. S., “Assessment
the yearly solar energy that a solar power plant is of floating solar photovoltaic potential in
anticipated to yield based on site attributes and local India’s existing hydropower reservoirs,”
meteorological data. Other environmental variables Energy for Sustainable Development, vol. 69,
that may impact daily solar power output, in p. 64–76, 2022. DOI:
addition to the height of a solar power station's 10.1016/j.esd.2022.05.011.
location, include temperature, wind speed, vapor [3] Shi, M., Lu, X., Jiang, H., Mu, Q., Chen, S.,
pressure, solar radiation, day length, precipitation, Fleming, R.M., Zhang, N., Wu, Y., & Foley,
and snowfall. A.M., “Opportunity of rooftop solar
photovoltaic as a cost-effective and
environment-friendly power source in
megacities,” iScience, vol. 25, No. 9, p.
7 Conclusion 104890, 2022. DOI:
Forecasting photovoltaic power output is crucial for
10.1016/j.isci.2022.104890.
grid planning and management because it enables
[4] Yang, H., & Wang, H., “Numerical simulation
operators to better plan and manage energy supply
of the dust particles deposition on solar
and demand, which leads to lower costs and higher
photovoltaic panels and its effect on power
system efficiency. Due to weather fluctuation, the
generation efficiency,” Renew Energy, vol.
difficulty of detecting solar radiation, system
201, p. 1111–1126, 2022. DOI:
capacity uncertainty, and a lack of historical data,
10.1016/j.renene.2022.11.043.
this task may be challenging. It helps to increase
[5] Veeramanikandan, M., Arjunan, T. V., &
prediction accuracy to utilize sensors and modeling.
Jidhesh, P., “Experimental investigation of
The Random Forests method demonstrated the
sandwich glazed solar photovoltaic module,”
highest performance (Accuracy=.843,
Mater Today Proc, vol. 27, p. 136–139, 2020.
Precision=.848, Recall=.875, Specificity=.793, and
DOI: 10.1016/j.matpr.2019.09.069.
[6] Lonergan, K.E., & Sansavini, G., “Business 52, p. 102168, 2022. DOI:
structure of electricity distribution system 10.1016/j.seta.2022.102168.
operator and effect on solar photovoltaic [14] Al-Ghussain, L., Taylan, O., Abujubbeh, M.,
uptake: An empirical case study for & Hassan, M.A., “Optimizing the orientation
Switzerland,” Energy Policy, vol. 160, p. of solar photovoltaic systems considering the
112683, 2022. DOI: effects of irradiation and cell temperature
10.1016/j.enpol.2021.112683. models with dust accumulation,” Solar
[7] Kumar, N., & Pal, N., “Location and Energy, vol. 249, p. 67–80, 2023. DOI:
orientation-based performance analysis of 10.1016/j.solener.2022.11.029.
4.98 kW solar photovoltaic system for [15] Nej, S.K., Sreejith, S., & Chakraborty, I.,
isolated Indian islands,” Sustainable Energy “Dual-Output Multistage Switched-Capacitor
Technologies and Assessments, vol. 52, p. Quadratic Boost (MSC-QBC) DC-DC
102138, 2022. DOI: Converter for Solar Photovoltaic
10.1016/j.seta.2022.102138. Application,” IFAC-PapersOnLine, vol. 55,
[8] Wang, J., Chen, H., Deng, H., & Dong, F., “A No. 1, p. 965–970, 2022. DOI:
combined power and steam system integrated 10.1016/j.ifacol.2022.04.159.
with solar photovoltaic/thermal collector: [16] He, B., Lu, H., Zheng, C., & Wang, Y.,
Thermodynamic characteristics and cost- “Characteristics and cleaning methods of dust
benefit analyses,” Case Studies in Thermal deposition on solar photovoltaic modules-A
Engineering, vol. 39, p. 102477, 2022. DOI: review,” Energy, vol. 263, p. 126083, 2023.
10.1016/j.csite.2022.102477. DOI: 10.1016/j.energy.2022.126083.
[9] Chirwa, D., Goyal, R., & Mulenga, E., [17] Xu, Z., Kong, Q., Qu, H., & Wang, C.,
“Floating solar photovoltaic (FSPV) potential “Cooling characteristics of solar photovoltaic
in Zambia: Case studies on six hydropower panels based on phase change materials,”
power plant reservoirs,” Renewable Energy Case Studies in Thermal Engineering, vol. 41,
Focus, vol. 44, p. 344–356, 2023. DOI: p. 102667, 2023. DOI:
10.1016/j.ref.2023.01.007. 10.1016/j.csite.2022.102667.
[10] Özçelep, Y., Bekdaş, G., & Apak, S., [18] Macías, R.J., Ceballos, C., Ordoñez-Loza, J.,
“Meeting the electricity demand for the & Ortiz, M., “Evaluation of the performance
heating of greenhouses with hydrogen: Solar of a solar photovoltaic - Biomass gasifier
photovoltaic-hydrogen-heat pump system system as an electricity supplier,” Energy, vol.
application in Turkey,” Int J Hydrogen 260, p. 125046, 2022. DOI:
Energy, vol. 48, No. 7, p. 2510–2517, 2023. 10.1016/j.energy.2022.125046.
DOI: 10.1016/j.ijhydene.2022.10.125. [19] Dhass, A.D., Patel, D., & Patel, B.,
[11] Hasheem, M.J., Wang, S., Ye, N., Farooq, “Estimation of power losses in single-junction
M.Z., & Shahid, H. M., “Factors influencing gallium-arsenide solar photovoltaic cells,”
purchase intention of solar photovoltaic International Journal of Thermofluids, vol. 17,
technology: An extended perspective of p. 100303, 2023. DOI:
technology readiness index and theory of 10.1016/j.ijft.2023.100303.
planned behavior,” Cleaner and Responsible [20] Kulikov, A.L., Shepovalova, O.V., Ilyushin,
Consumption, vol. 7, p. 100079, 2022. DOI: P.V., Filippov, S.P., & Chirkov, S.V.,
10.1016/j.clrc.2022.100079. “Control of electric power quality indicators
[12] Pimpalkar, R., Sahu, A., Patil, R. B., & Roy, in distribution networks comprising a high
A., “A comprehensive review on failure share of solar photovoltaic and wind power
modes and effect analysis of the solar stations,” Energy Reports, vol. 8, p. 1501–
photovoltaic system,” Mater Today Proc, 1514, 2022. DOI: 10.1016/j.egyr.2022.08.217.
2022. DOI: 10.1016/j.matpr.2022.11.353. [21] Gómez-Calvet, R., Martínez-Duart, J.M., &
[13] Manoj Kumar, N., Chakraborty, S., Kumar Gómez-Calvet, A.R., “The 2030 power sector
Yadav, S., Singh, J., & Chopra, S.S., transition in Spain: Too little storage for so
“Advancing simulation tools specific to many planned solar photovoltaics?”
floating solar photovoltaic systems – Renewable and Sustainable Energy Reviews,
Comparative analysis of field-measured and vol. 174, p. 113094, 2023. DOI:
simulated energy performance,” Sustainable 10.1016/j.rser.2022.113094.
Energy Technologies and Assessments, vol. [22] Mohammed, C., Mohamed, M., Larbi, E.M.,
Manale, B., Hassan, Z., Jalal, B., & Smail, Z.,
“Extended method for the sizing, energy scarce,” The Electricity Journal, vol. 35, No.
management, and techno-economic 10, p. 107222, 2022. DOI:
optimization of autonomous solar 10.1016/j.tej.2022.107222
Photovoltaic/Battery systems: Experimental [30] Luan, R., & Lin, B., “Positive or negative?
validation and analysis,” Energy Convers Study on the impact of government subsidy
Manag, vol. 270, p. 116267, 2022. DOI: on the business performance of China’s solar
10.1016/j.enconman.2022.116267. photovoltaic industry,” Renew Energy, vol.
[23] Nain, P., & Kumar, A., “A state-of-art review 189, p. 1145–1153, 2022. DOI:
on end-of-life solar photovoltaics,” J Clean 10.1016/j.renene.2022.03.082.
Prod, vol. 343, p. 130978, 2022. DOI: [31] Le, M., Luong, V.S., Nguyen, D.K., Dao,
10.1016/j.jclepro.2022.130978. V.D., Vu, N.H., & Vu, H.H.T., “Remote
[24] Kinsey, G.S., Riedel-Lyngskær, N.C., Miguel, anomaly detection and classification of solar
A.A., Boyd, M., Braga, M., Shou, Ch., photovoltaic modules based on deep neural
Cordero, R.R., Duck, B.C., Fell, Ch. J., Feron, network,” Sustainable Energy Technologies
S., Georghiou, G.E., Habryl, N., John, J.J., and Assessments, vol. 48, p. 101545, 2021.
Ketjoy, N., López, G., Louwen, A., Maweza, DOI: 10.1016/j.seta.2021.101545.
E.L., Minemoto, T., Mittal, A., Molto, C., [32] Oteng, D., Zuo, J., & Sharifi, E., “An expert-
Neves, G., Garrido, G.N., Norton, M., based evaluation on end-of-life solar
Paudyal, B.R., Pereira, E.B., Poissant, Y., photovoltaic management: An application of
Pratt, L., Shen, Q., Reindl, T., Rennhofer, M., Fuzzy Delphi Technique,” Sustainable
Rodríguez-Gallegos, C.D., Rüther, R., van Horizons, vol. 4, p. 100036, 2022. DOI:
Sark, W., Sevillano-Bendezú, M.A., 10.1016/j.horiz.2022.100036.
Seigneur, H., Tejero, J.A., Theristis, M., [33] Singh, S.K., & Chander, N., “Mid-life
Töfflinger, J.A., Ulbrich, C., Vilela, W.A., degradation evaluation of polycrystalline Si
Xia, X., & Yamasoe, M.A., “Impact of solar photovoltaic modules in a 100 kW grid-
measured spectrum variation on solar tied system in east-central India,” Renew
photovoltaic efficiencies worldwide,” Renew Energy, vol. 199, p. 351–367, 2022. DOI:
Energy, vol. 196, p. 995–1016, 2022. DOI: 10.1016/j.renene.2022.09.013.
10.1016/j.renene.2022.07.011. [34] Ilyushin, P.V., Shepovalova, O.V., Filippov,
[25] Liu, D., Qi, S., & Xu, T., “Visual observation S.P., & Nekrasov, A.A., “The effect of
or oral communication? The effect of social complex load on the reliable operation of
learning on solar photovoltaic adoption solar photovoltaic and wind power stations
intention in rural China,” Energy Res Soc Sci, integrated into energy systems and into off-
vol. 97, p. 102950, 2023. DOI: grid energy areas,” Energy Reports, vol. 8, p.
10.1016/j.erss.2023.102950. 1515–1529, 2022. DOI:
[26] Chandel, R., Chandel, S.S., & Malik, P., 10.1016/j.egyr.2022.08.218.
“Perspective of new distributed grid [35] Choi, J., Lee, I.W., & Cha, S.W., “Analysis of
connected rooftop solar photovoltaic power data errors in the solar photovoltaic
generation policy interventions in India,” monitoring system database: An overview of
Energy Policy, vol. 168, p. 113122, 2022. nationwide power plants in Korea,”
DOI: 10.1016/j.enpol.2022.113122. Renewable and Sustainable Energy Reviews,
[27] Prasad, M., & Prasad, R., “Bifacial vs vol. 156, p. 112007, 2022. DOI:
monofacial grid-connected solar photovoltaic 10.1016/j.rser.2021.112007.
for small islands: A case study of Fiji,” [36] Majji, R.K., Mishra, J.P., & Dongre, A.A.,
Renew Energy, vol. 203, p. 686–702, 2023. “Model predictive control based autonomous
DOI: 10.1016/j.renene.2022.12.068. DC microgrid integrated with solar
[28] Ndzibah, E., Pinilla-De La Cruz, G.A., & photovoltaic system and composite energy
Shamsuzzoha, A., “Collaboration towards storage,” Sustainable Energy Technologies
value creation for end-of-life solar and Assessments, vol. 54, p. 102862, 2022.
photovoltaic panel in Ghana,” J Clean Prod, DOI: 10.1016/j.seta.2022.102862.
vol. 333, p. 129969, 2022. DOI: [37] Sadanand, S., & Dwivedi, D.K., “Numerical
10.1016/j.jclepro.2021.129969. modeling for earth-abundant highly efficient
[29] Boretti, A., & Castelletto, S., “Lacking energy solar photovoltaic cell of non-toxic buffer
storage, and nuclear contribution, wind, and layer,” Opt Mater (Amst), vol. 109, p.
solar photovoltaic electricity is expensive and