The rise in global ozone levels over the last few decades has harmed human health. This problem exists in several cities throughout South America due to dangerous levels of particulate matter in the air, particularly during the winter season, making it a public health issue. Lima, Peru, is one of the ten cities in South America with the worst levels of air pollution. Thus, efficient and precise modeling and forecasting are critical for ozone concentrations in Lima. The focus is on developing precise forecasting models to anticipate ozone concentrations, providing timely information for adequate public health protection and environmental management. This work used hourly O$ _{3} $ data in metropolitan areas for multi-step-ahead (one-, two-, three-, and seven-day-ahead) O$ _{3} $ forecasts. A multiple linear regression model was used to represent the deterministic portion, and four-time series models, autoregressive, nonparametric autoregressive, autoregressive moving average, and nonlinear neural network autoregressive, were used to describe the stochastic component. The various horizon out-of-sample forecast results for the considered data suggest that the proposed component-based forecasting technique gives a highly consistent, accurate, and efficient gain. This may be expanded to other districts of Lima, different regions of Peru, and even the global level to assess the efficacy of the proposed component-based modeling and forecasting approach. Finally, no analysis has been undertaken using a component-based estimation to forecast ozone concentrations in Lima in a multi-step-ahead manner.
Citation: Flor Quispe, Eddy Salcedo, Hasnain Iftikhar, Aimel Zafar, Murad Khan, Josué E. Turpo-Chaparro, Paulo Canas Rodrigues, Javier Linkolk López-Gonzales. Multi-step ahead ozone level forecasting using a component-based technique: A case study in Lima, Peru[J]. AIMS Environmental Science, 2024, 11(3): 401-425. doi: 10.3934/environsci.2024020
The rise in global ozone levels over the last few decades has harmed human health. This problem exists in several cities throughout South America due to dangerous levels of particulate matter in the air, particularly during the winter season, making it a public health issue. Lima, Peru, is one of the ten cities in South America with the worst levels of air pollution. Thus, efficient and precise modeling and forecasting are critical for ozone concentrations in Lima. The focus is on developing precise forecasting models to anticipate ozone concentrations, providing timely information for adequate public health protection and environmental management. This work used hourly O$ _{3} $ data in metropolitan areas for multi-step-ahead (one-, two-, three-, and seven-day-ahead) O$ _{3} $ forecasts. A multiple linear regression model was used to represent the deterministic portion, and four-time series models, autoregressive, nonparametric autoregressive, autoregressive moving average, and nonlinear neural network autoregressive, were used to describe the stochastic component. The various horizon out-of-sample forecast results for the considered data suggest that the proposed component-based forecasting technique gives a highly consistent, accurate, and efficient gain. This may be expanded to other districts of Lima, different regions of Peru, and even the global level to assess the efficacy of the proposed component-based modeling and forecasting approach. Finally, no analysis has been undertaken using a component-based estimation to forecast ozone concentrations in Lima in a multi-step-ahead manner.
[1] | Ghorani-Azam, A, Riahi-Zanjani B, Balali-Mood M (2016) Effects of air pollution on human health and practical measures for prevention in Iran. J Res Med Sci 21. https://doi.org/10.4103/1735-1995.189646 |
[2] | Hailstone J. Hailstone J (2023) https://www.forbes.com/sites/jamiehailstone/2023/03/07/nearly-nowhere-on-earth-safe-from-air-pollution-study-finds/?sh=1e0d9fd9da1d/ (accessed july 25, 2023). |
[3] | Ordóñez C, Garrido-Perez J M, García-Herrera R (2020) Early spring near-surface ozone in Europe during the COVID-19 shutdown: Meteorological effects outweigh emission changes. Sci Total Environ 747: 141322. https://doi.org/10.1016/j.scitotenv.2020.141322 doi: 10.1016/j.scitotenv.2020.141322 |
[4] | Mostafa M K, Gamal G, Wafiq A (2021) The impact of COVID 19 on air pollution levels and other environmental indicators-A case study of Egypt. J Environ Manage 277: 111496. https://doi.org/10.1016/j.jenvman.2020.111496 doi: 10.1016/j.jenvman.2020.111496 |
[5] | Gagliardi R V, Andenna C (2020) A machine learning approach to investigate the surface ozone behavior. Atmosphere 11: 1173. https://doi.org/10.3390/atmos11111173 doi: 10.3390/atmos11111173 |
[6] | Jaffe DA, Cooper OR, Fiore AM, et al. (2018) Scientific assessment of background ozone over the US: Implications for air quality management. Elem Sci Anth 6: 56. https://doi.org/10.1525/elementa.309 doi: 10.1525/elementa.309 |
[7] | Lu H, Lyu X, Cheng H, et al.(2019) Overview on the spatial–temporal characteristics of the ozone formation regime in China. Environmental Science: Processes & Impacts 21: 916-929. https://doi.org/10.1039/C9EM00098D doi: 10.1039/C9EM00098D |
[8] | Käffer M I, Domingos M, Lieske I, et al. (2019) Predicting ozone levels from climatic parameters and leaf traits of Bel-W3 tobacco variety. Environ Pollut 248: 471-477. https://doi.org/10.1016/j.envpol.2019.01.130 doi: 10.1016/j.envpol.2019.01.130 |
[9] | Li Y, Xue Y, Guang J, et al. (2018) Ground-level PM2. 5 concentration estimation from satellite data in the Beijing area using a specific particle swarm extinction mass conversion algorithm. Remote Sens 10: 1906. https://doi.org/10.3390/rs10121906 doi: 10.3390/rs10121906 |
[10] | Velasco E, Retama A (2017) Ozone's threat hits back Mexico City. Sustain Cities Soc 31: 260-263. https://doi.org/10.1016/j.scs.2016.12.015 doi: 10.1016/j.scs.2016.12.015 |
[11] | Carbo-Bustinza N, Belmonte M, Jimenez V, et al. (2022) A machine learning approach to analyse ozone concentration in metropolitan area of Lima, Peru. Sci Rep 12: 22084. https://doi.org/10.1038/s41598-022-26575-3 doi: 10.1038/s41598-022-26575-3 |
[12] | Cohen AJ, Brauer M, Burnett R, et al. (2017) Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015. The Lancet 389: 1907-1918. https://doi.org/10.1016/S0140-6736(17)30505-6 doi: 10.1016/S0140-6736(17)30505-6 |
[13] | Iftikhar H, Khan M, Khan Z, et al. (2023). A Comparative Analysis of Machine Learning Models: A Case Study in Predicting Chronic Kidney Disease. Sustainability 15: 2754. https://doi.org/10.3390/su15032754 doi: 10.3390/su15032754 |
[14] | Jakovlev A R, Smyshlyaev S P et al. (2019) Numerical simulation of world ocean effects on temperature and ozone in the lower and middle atmosphere. Russ Meteorol Hydrol 44: 594-602. https://doi.org/10.3103/S1068373919090036 doi: 10.3103/S1068373919090036 |
[15] | Gaudel A, Cooper OR, Ancellet G, et al. (2018) Assessment Report: Present-day distribution and trends of tropospheric ozone relevant to climate and global atmospheric chemistry model evaluation. Elem Sci Anth 6: 39. https://doi.org/10.1525/elementa.291 doi: 10.1525/elementa.291 |
[16] | Rodríguez-Urrego D, Rodríguez-Urrego L (2020) Air quality during the COVID-19: PM2. 5 analysis in the 50 most polluted capital cities in the world. Environ Pollut 266: 115042. https://doi.org/10.1016/j.envpol.2020.115042 doi: 10.1016/j.envpol.2020.115042 |
[17] | Rybarczyk Y, Zalakeviciute R (2018) Machine learning approaches for outdoor air quality modelling: A systematic review. Appl Sci 8: 2570. https://doi.org/10.3390/app8122570 doi: 10.3390/app8122570 |
[18] | Iftikhar H, Khan N, Raza MA, et al. (2024). Electricity theft detection in smart grid using machine learning. Front Energy Res 12: 1383090. https://doi.org/10.3389/fenrg.2024.1383090 doi: 10.3389/fenrg.2024.1383090 |
[19] | Comrie A C (1997) Comparing neural networks and regression models for ozone forecasting. J Air Waste Manage 47: 653-663. https://doi.org/10.1080/10473289.1997.10463925 doi: 10.1080/10473289.1997.10463925 |
[20] | Carbo-Bustinza N, Iftikhar H, Belmonte M, et al. (2023). Short-term forecasting of Ozone concentration in metropolitan Lima using hybrid combinations of time series models. Appl Sci 13: 10514. https://doi.org/10.3390/app131810514 doi: 10.3390/app131810514 |
[21] | Harrou F, Fillatre L, Bobbia M, et al. (2013) Statistical detection of abnormal ozone measurements based on constrained generalized likelihood ratio test. In 52nd IEEE Conference on Decision and Control, Firenze, Italy, 10-13 December 2013. https://doi.org/10.1109/CDC.2013.6760673 |
[22] | Duenas C, Fernandez M C, Canete S, et al. (2005) Stochastic model to forecast ground-level ozone concentration at urban and rural areas. Chemosphere 61: 1379-1389. https://doi.org/10.1016/j.chemosphere.2005.04.079 doi: 10.1016/j.chemosphere.2005.04.079 |
[23] | Iftikhar H, Khan M, Turpo-Chaparro J E, et al. (2024). Forecasting stock prices using a novel filtering-combination technique: Application to the Pakistan stock exchange. AIMS Math 9: 3264-3288. https://doi.org/10.3934/math.2024159 doi: 10.3934/math.2024159 |
[24] | Petetin H, Bowdalo D, Soret A, et al. (2020) Meteorology-normalized impact of the COVID-19 lockdown upon NO 2 pollution in Spain. Atmos Chem Phys 20: 19-11141. https://doi.org/10.5194/acp-20-11119-2020 doi: 10.5194/acp-20-11119-2020 |
[25] | Aljanabi M, Shkoukani M, Hijjawi M (2020) Ground-level ozone prediction using machine learning techniques: A case study in Amman, Jordan. Int J Autom Comput 17: 667-677. https://doi.org/10.1007/s11633-020-1233-4 doi: 10.1007/s11633-020-1233-4 |
[26] | Sousa S I V, Martins F G, Alvim-Ferraz M C, et al. (2007) Multiple linear regression and artificial neural networks based on principal components to predict ozone concentrations. Environ Modell Softw 22: 97-103. https://doi.org/10.1016/j.envsoft.2005.12.002 doi: 10.1016/j.envsoft.2005.12.002 |
[27] | Chelani A B (2010) Prediction of daily maximum ground ozone concentration using support vector machine. Environmental monitoring and assessment 162: 169-176. https://doi.org/10.1007/s10661-009-0785-0 doi: 10.1007/s10661-009-0785-0 |
[28] | Ren X, Mi Z, Georgopoulos P G (2020) Comparison of Machine Learning and Land Use Regression for fine scale spatiotemporal estimation of ambient air pollution: Modeling ozone concentrations across the contiguous United States. Environ Int 142: 105827. https://doi.org/10.1016/j.envint.2020.105827 doi: 10.1016/j.envint.2020.105827 |
[29] | Yafouz A, AlDahoul N, Birima AH, et al. (2022) Comprehensive comparison of various machine learning algorithms for short-term ozone concentration prediction. Alex Eng J 61: 4607-4622. https://doi.org/10.1016/j.aej.2021.10.021 doi: 10.1016/j.aej.2021.10.021 |
[30] | Pan Q, Harrou F, Sun Y A (2023) comparison of machine learning methods for ozone pollution prediction. J Big Data 10: 63. https://doi.org/10.1186/s40537-023-00748-x doi: 10.1186/s40537-023-00748-x |
[31] | Iftikhar H, Zafar A, Turpo-Chaparro J E, et al. (2023) Forecasting Day-Ahead Brent Crude Oil Prices Using Hybrid Combinations of Time Series Models. Mathematics 16: 3548. https://doi.org/10.3390/math11163548 doi: 10.3390/math11163548 |
[32] | Iftikhar H, Bibi N, Canas Rodrigues P, et al. (2023) Multiple Novel Decomposition Techniques for Time Series Forecasting: Application to Monthly Forecasting of Electricity Consumption in Pakistan. Energies 16: 2579. https://doi.org/10.3390/en16062579 doi: 10.3390/en16062579 |
[33] | Alshanbari H M, Iftikhar H, Khan F, et al. (2023). On the Implementation of the Artificial Neural Network Approach for Forecasting Different Healthcare Events. Diagnostics 13: 1310. https://doi.org/10.3390/diagnostics13071310 doi: 10.3390/diagnostics13071310 |
[34] | Iftikhar H (2018) Modeling and Forecasting Complex Time Series: A Case of Electricity Demand. Master's Thesis, Quaidi-Azam University, Islamabad, Pakistan, 1-94. |
[35] | Shah I, Iftikhar H, Ali S (2020) Modeling and forecasting medium-term electricity consumption using component estimation technique. Forecasting 2: 163–179. https://doi.org/10.3390/forecast2020009 doi: 10.3390/forecast2020009 |
[36] | Iftikhar H, Turpo-Chaparro J E, Canas Rodrigues P, et al. (2023). Day-Ahead Electricity Demand Forecasting Using a Novel Decomposition Combination Method. Energies 16: 6675. https://doi.org/10.3390/en16186675 doi: 10.3390/en16186675 |
[37] | Shah I, Iftikhar H, Ali S, et al. (2019) Short-term electricity demand forecasting using components estimation technique. Energies 12: 2532. https://doi.org/10.3390/en12132532 doi: 10.3390/en12132532 |
[38] | Van Buuren S, Oudshoorn C G (2000) Multivariate imputation by chained equations. |
[39] | Iftikhar H, Turpo-Chaparro J E, Canas Rodrigues P, et al. (2023). Forecasting Day-Ahead Electricity Prices for the Italian Electricity Market Using a New Decomposition—Combination Technique. Energies 16: 6669. https://doi.org/10.3390/en16186669 doi: 10.3390/en16186669 |
[40] | Diebold F X, Mariano R S (2022) Comparing predictive accuracy. J Bus Econ Stat 20: 134–144. https://doi.org/10.1198/073500102753410444 doi: 10.1198/073500102753410444 |
[41] | Iftikhar H, Khan M, Khan M S, et al. (2023). Short-Term Forecasting of Monkeypox Cases Using a Novel Filtering and Combining Technique. Diagnostics 13: 1923. https://doi.org/10.3390/diagnostics13111923 doi: 10.3390/diagnostics13111923 |
[42] | Shah I, Iftikhar H, Ali S (2022) Modeling and forecasting electricity demand and prices: A comparison of alternative approaches. J Math 2022: 3581037. https://doi.org/10.1155/2022/3581037 doi: 10.1155/2022/3581037 |
[43] | Iftikhar H, Daniyal M, Qureshi M, et al. (2023). A hybrid forecasting technique for infection and death from the mpox virus. Digit Health 9: 20552076231204748. https://doi.org/10.1177/20552076231204748 doi: 10.1177/20552076231204748 |
[44] | Dickey D A, Fuller W A (1979) Distribution of the estimators for autoregressive time series with a unit root. J Am Stat Assoc 74: 427–431. https://doi.org/10.1080/01621459.1979.10482531 doi: 10.1080/01621459.1979.10482531 |
[45] | Romero Y, Diaz C, Meldrum I, et al. (2020) Temporal and spatial analysis of traffic–Related pollutant under the influence of the seasonality and meteorological variables over an urban city in Peru. Heliyon https://doi.org/10.1016/j.heliyon.2020.e04029 |
[46] | Leon C A M, Felix M F M, Olivera C A C, et al (2022) Influence of Social Confinement by COVID-19 on Air Quality in the District of San 503 Juan de Lurigancho in Lima, Perù. Chem Eng Trans 91: 475–480. |
[47] | Aaker D A, Jacobson R (1987). The sophistication of 'naive'modeling. Int J Forecast 3: 449-451. https://doi.org/10.1016/0169-2070(87)90039-2 doi: 10.1016/0169-2070(87)90039-2 |
[48] | Gonzales Javier L L, Calili Rodrigo F, Souza Reinaldo C, et al. (2016) Simulation of the energy efficiency auction prices in Brazil. Renew Energ Power Qual J 1: 574-579. https://doi.org/10.24084/repqj14.396 doi: 10.24084/repqj14.396 |
[49] | López-Gonzales J L, Souza RC, Da Silva FLC, et al. (2020) Simulation of the energy efficiency auction prices via the markov chain monte carlo method. Energies 13: 4544. https://doi.org/10.3390/en13174544 doi: 10.3390/en13174544 |
[50] | da Silva KLS, López-Gonzales J L, Turpo-Chaparro JE, et al. Spatio-temporal visualization and forecasting of PM10 in the Brazilian state of Minas Gerais. Sci Rep 13: 3269. https://doi.org/10.1038/s41598-023-30365-w |
[51] | Jeldes N, Ibacache-Pulgar G, Marchant C, et al. (2022) Modeling Air Pollution Using Partially Varying Coefficient Models with Heavy Tails. Mathematics 10: 3677. https://doi.org/10.3390/math10193677 doi: 10.3390/math10193677 |
[52] | Cabello-Torres RJ, Estela MAP, Sánchez-Ccoyllo O, et al. (2022) Statistical modeling approach for PM10 prediction before and during confinement by COVID-19 in South Lima, Perú. Sci Rep 12: 1. https://doi.org/10.1038/s41598-022-20904-2 doi: 10.1038/s41598-022-20904-2 |
[53] | Orrego Granados D, Ugalde J, Salas R, et al. (2022) Visual-Predictive Data Analysis Approach for the Academic Performance of Students from a Peruvian University. Appl Sci 12: 11251. https://doi.org/10.3390/app122111251 doi: 10.3390/app122111251 |