The aim of this study was to build a regression model of solar irradiation in the Kulluk region of Turkey by using the multivariate adaptive regression splines (MARS) technique. Using the well-known data mining algorithm, MARS, this study has explored a convenient prediction model for continuous response variables, i.e., average daily energy production from the given system (Ed), average monthly energy production from given system (Em), average daily sum of global irradiation per square meter (Hd) and average annual sum of global irradiation per square meter (Hm). Four continuous estimators are included to estimate Ed, Em, Hd and Hm: Estimated losses due to temperature and low irradiance (ESLOTEM), estimated loss due to angular reflection effect (ESLOANGREF), combined photovoltaic system loss (COMPVLOSS) and rated power of the photovoltaic system (PPVS). Four prediction models as constructed by implementing the MARS algorithm, have been obtained by applying the smallest generalized cross-validation (GCV) criterion where the means of penalty are defined as 1 and the backward pruning method for the package "earth" of R software is used. As a result, it can be suggested that the procedure of the MARS algorithm, which achieves the greatest predictive accuracy of 100% or nearly 100%, permits researchers to obtain some remarkable hints for ascertaining predictors that affect solar irradiation parameters. The coefficient of determination denoted as R2 was estimated at the highest predictive accuracy to be nearly 1 for Ed, Em, Hd and Hm while the GCV values were found to be 0.000009, 0.018908, 0.000013 and 0.019021, respectively. The estimated results indicate that four MARS models with the first degree interaction effect have the best predictive performances for verification with the lowest GCV value.
Citation: Gokhan Sahin, W.G.J.H.M. Van Wilfried Sark. Estimation of most optimal azimuthal angles for maximum PV solar efficiency using multivariate adaptive regression splines (MARS)[J]. AIMS Energy, 2023, 11(6): 1328-1353. doi: 10.3934/energy.2023060
The aim of this study was to build a regression model of solar irradiation in the Kulluk region of Turkey by using the multivariate adaptive regression splines (MARS) technique. Using the well-known data mining algorithm, MARS, this study has explored a convenient prediction model for continuous response variables, i.e., average daily energy production from the given system (Ed), average monthly energy production from given system (Em), average daily sum of global irradiation per square meter (Hd) and average annual sum of global irradiation per square meter (Hm). Four continuous estimators are included to estimate Ed, Em, Hd and Hm: Estimated losses due to temperature and low irradiance (ESLOTEM), estimated loss due to angular reflection effect (ESLOANGREF), combined photovoltaic system loss (COMPVLOSS) and rated power of the photovoltaic system (PPVS). Four prediction models as constructed by implementing the MARS algorithm, have been obtained by applying the smallest generalized cross-validation (GCV) criterion where the means of penalty are defined as 1 and the backward pruning method for the package "earth" of R software is used. As a result, it can be suggested that the procedure of the MARS algorithm, which achieves the greatest predictive accuracy of 100% or nearly 100%, permits researchers to obtain some remarkable hints for ascertaining predictors that affect solar irradiation parameters. The coefficient of determination denoted as R2 was estimated at the highest predictive accuracy to be nearly 1 for Ed, Em, Hd and Hm while the GCV values were found to be 0.000009, 0.018908, 0.000013 and 0.019021, respectively. The estimated results indicate that four MARS models with the first degree interaction effect have the best predictive performances for verification with the lowest GCV value.
[1] | REN 21 (2019) Renewables 2019 global status report. Available from: https://www.ren21.net/gsr. |
[2] | Despotovic M, Nedic V, Despotovic D, et al. (2016) Evaluation of empirical models for predicting monthly mean horizontal diffuse solar radiation. Renewable Sustainable Energy Rev 56: 246–260. https://doi.org/10.1016/j.rser.2015.11.058 doi: 10.1016/j.rser.2015.11.058 |
[3] | Jamil B, Akhtar N (2017) Comparison of empirical models to estimate monthly mean diffuse solar radiation from measured data: Case study for humid-subtropical climatic region of India. Renewable Sustainable Energy Rev 77: 1326–1342. https://doi.org/10.1016/j.rser.2017.02.057 doi: 10.1016/j.rser.2017.02.057 |
[4] | Filho EPM, Oliveira AP, Vita WA, et al. (2016) Global, diffuse and direct solar radiation at the surface in the city of Rio de Janeiro: Observational characterization and empirical modeling. Renewable Energy 91: 64–74. https://doi.org/10.1016/j.renene.2016.01.040 doi: 10.1016/j.renene.2016.01.040 |
[5] | Liao W, Wang X, Fan Q, et al. (2015) Long-term atmospheric visibility, sunshine duration and precipitation trends in South China. Atm Environ 107: 204–216. https://doi.org/10.1016/j.atmosenv.2015.02.015 doi: 10.1016/j.atmosenv.2015.02.015 |
[6] | Chelbi M, Gagnon Y, Waewsak J (2015) Solar radiation mapping using sunshine duration-based models and interpolation techniques: Application to Tunisia. Energy Conv Manage 101: 203–215. https://doi.org/10.1016/j.enconman.2015.04.052 doi: 10.1016/j.enconman.2015.04.052 |
[7] | Zhu W, Lu A, Jia S, et al. (2017) Retrievals of all-weather daytime air temperature from MODIS products. Remote Sens Environ 189: 152–163. https://doi.org/10.1016/j.rse.2016.11.011 doi: 10.1016/j.rse.2016.11.011 |
[8] | Ho HC, Knudby A, Xu Y, et al. (2016) A comparison of urban heat islands mapped using skin temperature, air temperature, and apparent temperature (Humidex), for the greater Vancouver area. Sci Total Environ 544: 929–938. https://doi.org/10.1016/j.scitotenv.2015.12.021 doi: 10.1016/j.scitotenv.2015.12.021 |
[9] | Wenbin Z, Aifeng L, Shaofeng J (2013) Estimation of daily maximum and minimum air temperature using MODIS land surface temperature products. Remote Sens Environ 130: 62–73. https://doi.org/10.1016/j.rse.2012.10.034 doi: 10.1016/j.rse.2012.10.034 |
[10] | Chang K, Zhang Q (2019) Improvement of the hourly global solar model and solar radiation for air-conditioning design in China. Renewable Energy 138: 1232–1238. https://doi.org/10.1016/j.renene.2019.02.069 doi: 10.1016/j.renene.2019.02.069 |
[11] | Jiang H, Lu N, Qin J, et al. (2019) A deep learning algorithm to estimate hourly global solar radiation from geostationary satellite data. Renewable Sustainable Energy Rev 114: 1–13. https://doi.org/10.1016/j.rser.2019.109327 doi: 10.1016/j.rser.2019.109327 |
[12] | Li DHW, Chen W, Li S, et al. (2019) Estimation of hourly global solar radiation using Multivariate Adaptive Regression Spline (MARS)—A case study of Hong Kong. Energy 186: 1–14. https://doi.org/10.1016/j.energy.2019.115857 doi: 10.1016/j.energy.2019.115857 |
[13] | Cornejo-Bueno L, Casanova-Mateo C, Sanz-Justo J, et al. (2019) Machine learning regressors for solar radiation estimation from satellite data. Sol Energy 183: 768–775. https://doi.org/10.1016/j.solener.2019.03.079 doi: 10.1016/j.solener.2019.03.079 |
[14] | Manju S, Sandeep M (2019) Prediction and performance assessment of global solar radiation in Indian cities: A comparison of satellite and surface measured data. J Clean Prod 230: 116–128. https://doi.org/10.1016/j.jclepro.2019.05.108 doi: 10.1016/j.jclepro.2019.05.108 |
[15] | Gouda SG, Hussein Z, Luo S, et al. (2019) Model selection for accurate daily global solar radiation prediction in China. J Clean Prod 221: 132–144. https://doi.org/10.1016/j.jclepro.2019.02.211 doi: 10.1016/j.jclepro.2019.02.211 |
[16] | Guermoui M, Melgani F, Danilo C (2018) Multi-step ahead forecasting of daily global and direct solar radiation: A review and case study of Ghardaia region. J Clean Prod 201: 716–734. https://doi.org/10.1016/j.jclepro.2018.08.006 doi: 10.1016/j.jclepro.2018.08.006 |
[17] | Feng Y, Gong D, Zhang Q, et al. (2019) Evaluation of temperature-based machine learning and empirical models for predicting daily global solar radiation. Energy Conver Manag 198: 111780. https://doi.org/10.1016/j.enconman.2019.111780 doi: 10.1016/j.enconman.2019.111780 |
[18] | Makade RG, Chakrabarti S, Jamil B, et al. (2020) Estimation of global solar radiation for the tropical wet climatic region of India: A theory of experimentation approach. Renewable Energy 146: 2044–2059. https://doi.org/10.1016/j.renene.2019.08.054 doi: 10.1016/j.renene.2019.08.054 |
[19] | Kisi O, Heddam S, Yaseen ZM (2019) The implementation of univariable scheme-based air temperature for solar radiation prediction: New development of dynamic evolving neural-fuzzy inference system model. Appl Energy 241: 184–195. https://doi.org/10.1016/j.apenergy.2019.03.089 doi: 10.1016/j.apenergy.2019.03.089 |
[20] | Kaplan AG, Kaplan YA (2020) Developing of the new models in solar radiation estimation with curve fitting based on moving least-squares approximation. Renewable Energy 146: 2462–2471. https://doi.org/10.1016/j.renene.2019.08.095 doi: 10.1016/j.renene.2019.08.095 |
[21] | Anis MS, Jamil B, Ansari MA, et al. (2019) Generalized models for estimation of global solar radiation based on sunshine duration and detailed comparison with the existing: A case study for India. Sustainable Energy Technol Assess 31: 179–198. https://doi.org/10.1016/j.seta.2018.12.009 doi: 10.1016/j.seta.2018.12.009 |
[22] | Gürel AE, Ağbulut U, Biçen Y (2020) Assessment of machine learning, time series, response surface methodology and empirical models in prediction of global solar radiation. J Clean Prod 277: 122353. https://doi.org/10.1016/j.jclepro.2020.122353 doi: 10.1016/j.jclepro.2020.122353 |
[23] | El-Baz W, Tzscheutschler P, Wagner U (2018) Day-ahead probabilistic PV generation forecast for buildings energy management systems. Sol Energy 171: 478–490. https://doi.org/10.1016/j.solener.2018.06.100 doi: 10.1016/j.solener.2018.06.100 |
[24] | Hu K, Cao S, Wang L, et al. (2018) A new ultra-short-term photovoltaic power prediction model based on ground-based cloud images. J Clean Prod 200: 731–745. https://doi.org/10.1016/j.jclepro.2018.07.311 doi: 10.1016/j.jclepro.2018.07.311 |
[25] | Dong J, Olama MM, Kuruganti T, et al. (2020) Novel stochastic methods to predict short-term solar radiation and photovoltaic power. Renewable Energy 145: 333–346. https://doi.org/10.1016/j.renene.2019.05.073 doi: 10.1016/j.renene.2019.05.073 |
[26] | Gao M, Li J, Hong F, et al. (2019) Day-ahead power forecasting in a large-scale photovoltaic plant based on weather classification using LSTM. Energy 187: 115838. https://doi.org10.1016/j.energy.2019.07.168 doi: 10.1016/j.energy.2019.07.168 |
[27] | Gulin M, Pavlovic T, Vašak M (2017) A one-day-ahead photovoltaic array power production prediction with combined static and dynamic on-line correction. Sol Energy 142: 49–60. https://doi.org/10.1016/j.solener.2016.12.008 doi: 10.1016/j.solener.2016.12.008 |
[28] | Wang K, Qi X, Liu H (2019) A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl Energy 251: 1–14. https://doi.org/10.1016/j.apenergy.2019.113315 doi: 10.1016/j.apenergy.2019.113315 |
[29] | Wang G, Su Y, Shu L (2016) One-day-ahead daily power forecasting of photovoltaic systems based on partial functional linear regression models. Renewable Energy 96: 469–478. https://doi.org/10.1016/j.renene.2016.04.089 doi: 10.1016/j.renene.2016.04.089 |
[30] | Han S, Qiaoa YH, Yan J, et al. (2019) Mid-to-long term wind and photovoltaic power generation prediction based on copula function and long short term memory networke. Appl Energy 239: 181–191. https://doi.org/10.1016/j.apenergy.2019.01.193 doi: 10.1016/j.apenergy.2019.01.193 |
[31] | Yang XS, Deb S (2009) Cuckoo search via Lévy flights. 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC), Coimbatore, India, 210–214. https://doi.org/10.1109/NABIC.2009.5393690 |
[32] | Wang H, Yi H, Peng J, et al. (2017) Deterministic and probabilistic forecasting of photovoltaic power based on deep convolutional neural network. Energy Conver Manag 153: 409–422. https://doi.org/10.1016/j.enconman.2017.10.008 doi: 10.1016/j.enconman.2017.10.008 |
[33] | Sharifzadeh M, Sikinioti-Locka A, Shah N (2019) Machine-learning methods for integrated renewable power generation: A comparative study of artificial neural networks, support vector regression, and gaussian process regression. Renewable Sustainable Energy Rev 108: 513–538. https://doi.org/10.1016/j.rser.2019.03.040 doi: 10.1016/j.rser.2019.03.040 |
[34] | Wood DA (2019) German solar power generation data mining and prediction with transparent open box learning network integrating weather, environmental and market variables. Energy Conver Manage 196: 354–369. https://doi.org/10.1016/j.enconman.2019.05.114 doi: 10.1016/j.enconman.2019.05.114 |
[35] | Heydari A, Garcia DA, Keynia F, et al. (2019) A novel composite neural network based method for wind and solar power forecasting in microgrids. Appl Energy 251: 1–17. https://doi.org/10.1016/j.apenergy.2019.113353 doi: 10.1016/j.apenergy.2019.113353 |
[36] | Zhang WG, Goh ATC (2012) Multivariate adaptive regression splines for analysis of geotechnical engineering systems. Comput Geotech 48: 82–95. https://doi.org/10.1016/j.compgeo.2012.09.016 doi: 10.1016/j.compgeo.2012.09.016 |
[37] | Zhang G, Bateni SM, Jun C, et al. (2022) Feasibility of random forest and multivariate adaptive regression splines for predicting long-term mean monthly dew point temperature. Front Environ Sci 10: 826165. https://doi.org/10.3389/fenvs.2022.826165 doi: 10.3389/fenvs.2022.826165 |
[38] | Raj N, Gharineiat Z (2021) Evaluation of multivariate adaptive regression splines and artificial neural network for prediction of mean sea level trend around northern Australian coastlines. Mathematics 9: 2696. https://doi.org/10.3390/math9212696 doi: 10.3390/math9212696 |
[39] | Zhang WG, Goh ATC (2013) Multivariate adaptive regression splines for analysis of geotechnical engineering systems. Comput Geotech 48: 82–95. https://doi.org/10.1016/j.compgeo.2012.09.016 doi: 10.1016/j.compgeo.2012.09.016 |
[40] | Dhimish M, Silvestre S (2019) Estimating the impact of azimuth-angle variations on photovoltaic annual energy production. Clean Energy 3: 47–58. https://doi.org/10.1093/ce/zky022 doi: 10.1093/ce/zky022 |
[41] | Li DHW, Chen W, Li S, et al. (2019) Estimation of hourly global solar radiation using Multivariate Adaptive Regression Spline (MARS)—A case study of Hong Kong. Energy 186: 115857. https://doi.org/10.1016/j.energy.2019.115857 doi: 10.1016/j.energy.2019.115857 |
[42] | Srivastava R, Tiwari AN, Giri VK (2019) Solar radiation forecasting using MARS, CART, M5, and random forest model: A case study for India. Heliyon 5: e02692. https://doi.org/10.1016/j.heliyon.2019.e02692 doi: 10.1016/j.heliyon.2019.e02692 |
[43] | Turk S, Sahin G (2020) Multi-criteria decision-making in the location selection for a solar PV power plant using AHP. Measurement 153: 107384. https://doi.org/10.1016/j.measurement.2019.107384 doi: 10.1016/j.measurement.2019.107384 |
[44] | Sahin G, Isik G, van Sark W (2023) Predictive modeling of PV solar power plant efficiency considering weather conditions: A comparative analysis of artificial neural networks and multiple linear regression. Energy Rep 10: 2837–2849. https://doi.org/10.1016/j.egyr.2023.09.097 doi: 10.1016/j.egyr.2023.09.097 |
[45] | Friedman JH (2011) Multivariate adaptive regression splines. The Ann Stat 19: 14122. |
[46] | Duzen H, Aydin H (2012) Sunshine-based estimation of global solar radiation on horizontal surface at Lake Van region (Turkey). Energy Convers Manage 58: 35e46. https://doi.org/10.1016/j.enconman.2011.11.028 doi: 10.1016/j.enconman.2011.11.028 |
[47] | Bou-Rabeea M, Sulaimanb SA, Salehc MS, et al. (2017) Using artificial neural networks to estimate solar radiation in Kuwait. Renewable Sustainable Energy Rev 72: 434–438. https://doi.org/10.1016/j.rser.2017.01.013 doi: 10.1016/j.rser.2017.01.013 |
[48] | Li DHW, Chen W, Li S, et al. (2019) Estimation of hourly global solar radiation using Multivariate Adaptive Regression Spline (MARS)—A case study of Hong Kong. Energy 186: 115857. https://doi.org/10.1016/j.energy.2019.115857 doi: 10.1016/j.energy.2019.115857 |
[49] | Mohammad Rezaie-Balf M, Maleki N, Kim S, et al. (2019) Forecasting daily solar radiation using CEEMDAN decomposition-based mars model trained by crow search algorithm. Energies 12: 1416. https://doi.org/10.3390/en12081416 doi: 10.3390/en12081416 |
[50] | Rehman S, Mohandes M (2008) Artificial neural network estimation of global solar radiation using air temperature and relative humidity. Energy Policy 36: 571–576. https://doi.org/10.1016/j.enpol.2007.09.033 doi: 10.1016/j.enpol.2007.09.033 |
[51] | Olatomiwa L, Mekhilef S, Shamshirband S, et al. (2015) Adaptive neuro-fuzzy approach for solar radiation prediction in Nigeria. Renewable Sustainable Energy Rev 51: 1784–1791. https://doi.org/10.1016/j.rser.2015.05.068 doi: 10.1016/j.rser.2015.05.068 |
[52] | Zeng J, Qiao W (2013) Short-term solar power prediction using a support vector machine. Renewable Energy 52: 118–127. https://doi.org/10.1016/j.renene.2012.10.009 doi: 10.1016/j.renene.2012.10.009 |
[53] | Wang L, Kisi O, Zounemat-Kermani M, et al. (2017) Prediction of solar radiation in China using different adaptive neuro-fuzzy methods and M5 model tree. Int J Climatol 37: 1141–1155. https://doi.org/10.1002/joc.4762 doi: 10.1002/joc.4762 |
[54] | Kim S, Seo Y, Rezaie-Balf M, et al. (2018) Evaluation of daily solar radiation flux using soft computing approaches based on different meteorological information: peninsula vs continent. Theor Appl Climatol 137: 693–712. https://doi.org/10.1007/s00704-018-2627-x doi: 10.1007/s00704-018-2627-x |
[55] | Sahin G, Eyduran E, Turkoglu M, et al. (2018) Estimation of global irradiation parameters at location of migratory birds in Igdir, Turkey by means of MARS algorithm. Pakistan J Zool 50: 2317–2324. http://dx.doi.org/10.17582/journal.pjz/2018.50.6.2317.2324 doi: 10.17582/journal.pjz/2018.50.6.2317.2324 |
[56] | Sahin F, Isik G, Sahin G, et al. (2020) Estimation of PM10 levels using feed forward neural networks in Igdir, Turkey. Urban Clim 34: 100721. https://doi.org/10.1016/j.uclim.2020.100721 doi: 10.1016/j.uclim.2020.100721 |
[57] | Kaya F, Sahin G, Alma MH (2021) Investigation effects of environmental and operating factors on PV panel efficiency using by multivariate linear regression. Int J Energy Res 45: 554–567. https://doi.org/10.1002/er.5717 doi: 10.1002/er.5717 |
[58] | Friedman JH (2011) Multivariate daptive regression spline. Ann Stat 19: 1–141. https://doi.org/10.1214/aos/1176347963 doi: 10.1214/aos/1176347963 |
[59] | Sahin F, Kara MK, Koc A, et al. (2020) Multi-criteria decision-making using GIS-AHP for air pollution problem in Igdir Province/Turkey. Environ Sci Poll Rsch 27: 36215–36230. https://doi.org/10.1007/s11356-020-09710-3 doi: 10.1007/s11356-020-09710-3 |
[60] | Sahraei MA, Duman H, Çodur MY, et al. (2021) Prediction of transportation energy demand: Multivariate adaptive regression splines. Energy 224: 120090. https://doi.org/10.1016/j.energy.2021.120090 doi: 10.1016/j.energy.2021.120090 |
[61] | Yadav P, Chandel SS (2014) Comparative analysis of diffused solar radiation models for optimum tilt angle determination for Indian locations. Appl Sol Energy 50: 53–59. https://doi.org/10.3103/S0003701X14010137 doi: 10.3103/S0003701X14010137 |
[62] | Samui P (2013) Multivariate adaptive regression spline (Mars) for prediction of elastic modulus of jointed rock mass. Geotech Geol Eng 31: 249–253. https://doi.org/10.1007/s10706-012-9584-4 doi: 10.1007/s10706-012-9584-4 |
[63] | Rustemli S, İlcihan Z, Sahin G, et al. (2023) A novel design and simulation of a mechanical coordinate based photovoltaic solar tracking system. AIMS Energy 11: 753–773. https://doi:10.3934/energy.2023037 doi: 10.3934/energy.2023037 |
[64] | Hastie T, Tibshirani R, Friedman JH, et al. (2004) The elements of statistical learning: Data mining, inference, and prediction. Math Intel 27: 83–85. Available from: https://hastie.su.domains/Papers/ESLII.pdf. |
[65] | Craven P, Wahba G (1979) Smoothing noisy data with spline functions: Estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31: 377–403. https://doi.org/10.1007/BF01404567 doi: 10.1007/BF01404567 |
[66] | Kornacki J, Ćwik J (2005) Statistical learning systems (in Polish). WNT Warsaw, 16. |
[67] | Hastie T, Tibshirani R, Friedman J (2011) The elements of statistical learning: Data mining, inference and prediction. Stanford, California: Springer-Verlag, 337–343. |
[68] | Put R, Xu Q, Massart D, et al. (2004) Multivariate adaptive regression splines (MARS) in chromatographic quantitative structure-retention relationship studies. J Chromatog 1055: 11–19. https://doi.org/10.1016/j.chroma.2004.07.112 doi: 10.1016/j.chroma.2004.07.112 |
[69] | Khan MA, Tariq MM, Eyduran E, et al. (2014) Estimating body weight from several body measurements in Harnai sheep without multicollinearity problem. J Anim Pl Sc 24: 120–126. |
[70] | Mohammad MT, Rafeeq M, Bajwa MA, et al. (2012) Prediction of body weight from body measurements using regression tree (RT) method for indigenous sheep breeds in Balochistan. Pakistan J Anim Pl Sci 22: 20–24. |
[71] | Roy SS, Roy R, Balas VE (2018) Estimating heating load in buildings using multivariate adaptive regression splines, extreme learning machine, a hybrid model of MARS and ELM. Renewable Sustainable Energy Rev 82: 4256e68. https://doi.org/10.1016/j.rser.2017.05.249 doi: 10.1016/j.rser.2017.05.249 |
[72] | Huang H, Ji X, Xia F, et al. (2020) Multivariate adaptive regression splines for estimating riverine constituent concentrations. Process 34: 1213e27. https://doi.org/10.1002/hyp.13669 doi: 10.1002/hyp.13669 |
[73] | Gross F, Persaud B, Lyon C, et al. (2010) A guide to developing quality crash modification factors. Publication, FHWA-SA-10-032, FHWA. U.S. Department of Transportation. Available from: https://www.cmfclearinghouse.org/collateral/CMF_Guide.pdf. |
[74] | Tunay KB (2001) Estimated income velocity of money method MARS in Turkey. METU Develop J 28: 431–454. |
[75] | Li Y, He Y, Su Y, et al. (2016), Forecasting the daily power output of a grid-connected photovoltaic system based on multivariate adaptive regression splines. Appl Energy 180: 392e401. https://doi.org/10.1016/j.apenergy.2016.07.052 doi: 10.1016/j.apenergy.2016.07.052 |