Abstract
Despite the existing public and government measures for monitoring and control of air quality in Bulgaria, in many regions, including typical and most numerous small towns, air quality is not satisfactory. In this paper, factor analysis and Box–Jenkins methodology are applied to examine concentrations of primary air pollutants such as NO, NO2, NOx, PM10, SO2 and ground level O3 in the town of Blagoevgrad, Bulgaria within a 1 year period from 1st September 2011 to 31st August 2012, based on hourly measurements. By using factor analysis with PCA and Promax rotation, a high multicollinearity between the six pollutants has been detected. The pollutants were grouped in three factors and the degree of contribution of the factors to the overall pollution was determined. This was interpreted as the presence of common sources of pollution. The main part of the study involves the performance of time series analysis and the development of univariate stochastic seasonal autoregressive integrated moving average (ARIMA) models with recording on a hourly basis as seasonality. The study also incorporates the Yeo–Johnson power transformation for variance stabilizing of the data and model selection by using Bayersian information criterion. The obtained SARIMA models demonstrated very good fitting performance with regard to the observed air pollutants and short-term predictions for 72 h ahead, in particular in the case of ozone and particulate matter PM10. The presented statistical approaches allow the building of non-complex models, effective for short-term air pollution forecasting and useful for advance warning purposes in urban areas.
Similar content being viewed by others
References
Air Quality Standards (2013) European Commission. Environment. http://ec.europa.eu/environment/air/quality/standards.htm. Accessed 26 April 2013
Blagoevgrad (2013) Wikipedia. http://en.wikipedia.org/wiki/Blagoevgrad. Accessed 26 April 2013
Blifford IH Jr, Meeker GO (1967) A factor analysis model of large scale pollution. Atmos Environ 1:147–157. doi:10.1016/0004-6981(67)90042-X
Box GEP, Cox DR (1964) An analysis of transformations. J R Stat Soc Ser B 26:211–252
Box GEP, Jenkins GM (1976) Time series analysis, forecasting and control, revized edn. Holden Day, San Francisco
Brunelli U, Piazza V, Pignato L, Sorbello F, Vitabile S (2007) Two-days ahead prediction of daily maximum concentrations of SO2, O3, PM10, NO2, CO in the urban area of Palermo, Italy. Atmos Environ 41(11):2967–2995. doi:10.1016/j.atmosenv.2006.12.013
Burnham KP, Anderson DR (2002) Model selection and inference: a practical information-theoretic approach, 2nd edn. Springer, New York
Chatfield C (1996) The analysis of time series: an introduction. Chapman & Hall/CRC, Boca Raton
Chatfield C (2000) Time-series forecasting. Chapman & Hall CRC, Boca Raton
Comparison of Statistical Packages. http://en.wikipedia.org/wiki/Comparison_of_statistical_packages. Accessed 15 July 2013
Díaz-Robles LA, Ortega JC, Fu JS, Reed GD, Chow JC, Watson JG, Moncada-Herrera JA (2008) A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: the case of Temuco, Chile. Atmos Environ 42(35):8331–8340. doi:10.1016/j.atmosenv.2008.07.020
Dimov I, Georgieva R, Ivanovska S, Ostromsky T, Zlatev Z (2010) Studying the sensitivity of pollutants’ concentrations caused by variations of chemical rates. J Comput Appl Math 235:391–402. doi:10.1016/j.cam.2010.05.041
Dimov I, Georgieva R, Ostromsky T, Zlatev Z (2013) Advanced algorithms for multidimensional sensitivity studies of large-scale air pollution models based on Sobol sequences. Comput Math Appl 65(3):338–351. doi:10.1016/j.camwa.2012.07.005
Directive 2008/50/EC of the European Parliament and of the council of 21 May 2008 on ambient air quality and cleaner air for Europe (2008) Official Journal of the European Union L 152/1. http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2008:152:0001:0044:EN:PDF. Accessed 26 April 2013
Dueñas C, Fernández MC, Cañete S, Carretero J, Liger E (2005) Stochastic model to forecast ground-level ozone concentration at urban and rural areas. Chemosphere 61(10):1379–1389. doi:10.1016/j.chemosphere.2005.04.079
EEA Daily Bulletin for air quality in the country (2013) EEA—Executive environment agency, National system for realtime air quality control in Bulgaria. http://pdbase.government.bg/airq/bulletin-en.jsp. Accessed 26 April 2013
EEA Reports, Bulletins (2012) EEA—Executive environment agency. http://eea.government.bg/en/output/index.html. Accessed 26 April 2013
EViews 7 for Windows (2013) http://www.eviews.com. Accessed 26 April 2013
Gardner MW, Dorling SR (1999) Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London. Atmos Environ 33(5):709–719. doi:10.1016/S1352-2310(98)00230-1
Henry RC, Hidy GM (1979) Multivariate analysis of particulate sulfate and other air quality variables by principal components—part I: annual data from Los Angeles and New York. Atmos Environ 13(11):1581–1596. doi:10.1016/0004-6981(79)90068-4
Huang J, Ho M, Du P (2011) Assessment of temporal and spatial variation of coastal water quality and source identification along Macau peninsula. Stoch Environ Res Risk A 25(3):353–361. doi:10.1007/s00477-010-0373-4
ISO/IEC 17025 (2013) Wikipedia. http://en.wikipedia.org/wiki/ISO/IEC_17025. Accessed 26 April 2013
Ivanov A, Voynikova D, Gocheva-Ilieva S, Boyadzhiev D (2012) Parametric time series analysis of daily air pollutants of city of Shumen, Bulgaria. AIP Conf Proc 1487:386–396. American Institute of Physics, Melville, NY. doi:10.1063/1.4758982
Jacobson MZ (2005) Fundamentals of atmospheric modeling, 2nd edn. Cambridge Univ. Press, Cambridge
Jian L, Zhao Y, Zhu YP, Zhang MB, Bertolatti D (2012) An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. Sci Total Environ 426:336–345. doi:10.1016/j.scitotenv.2012.03.025
Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer, New York
Kaplunovsky AS (2005) Factor analysis in environmental studies. HAIT J Sci Eng B 2(1–2):54–94
Kim SE (2010) Tree-based threshold modeling for short-term forecast of daily maximum ozone level. Stoch Environ Res Risk A 24(1):19–28. doi:10.1007/s00477-008-0295-6
Kim SE, Kumar A (2005) Accounting seasonal nonstationarity in time series models for short-term ozone level forecast. Stoch Environ Res Risk A 19:241–248. doi:10.1007/s00477-004-0228-y
Kim J, Mueller CW (1986) Factor analysis: statistical methods and practical issue. Sage, Beverly Hills
Ko FWS, Tam W, Wong TW, Chan DPS, Tung AH, Lai CKW, Hui DSC (2007) Temporal relationship between air pollutants and hospital admissions for chronic obstructive pulmonary disease in Hong Kong. Thorax 62:780–785. doi:10.1136/thx.2006.076166
Kumar U, Jain VK (2010) ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stoch Environ Res Risk A 24:751–760. doi:10.1007/s00477-009-0361-8
Kumar R, Joseph AE (2006) Air pollution concentrations of PM2.5, PM10 and NO2 at Ambient and Kerbsite and their correlation in Metro City—Mumbai. Environ Monit Asses 119(1–3):191–199. doi:10.1007/s10661-005-9022-7
Lengyel A, Heberger K, Paksy L, Banhidi O, Rajko R (2004) Prediction of ozone concentration in ambient air using multivariate methods. Chemosphere 57(8):889–896. doi:10.1016/j.chemosphere.2004.07.043
Liddle AR (2008) Information criteria for astrophysical model selection. Cornell University Library, arHiv.org. http://arxiv.org/PS_cache/astro-ph/pdf/0701/0701113v2.pdf. Accessed 26 April 2013
Liu PWG (2009) Simulation of the daily average PM10 concentrations at Ta-Liao with Box–Jenkins time series models and multivariate analysis. Atmos Environ 43:2104–2113. doi:10.1016/j.atmosenv.2009.01.055
McBerthouex P, Brown LC (2002) Statistics for environmental engineers. Lewis Publishers, Boca Raton
Milionis AE, Davies TD (1994) Regression and stochastic models for air pollution. I. Review, comments and suggestions. Atmos Environ 28:2801–2810. doi:10.1016/1352-2310(94)90083-3
National System for Environmental Monitoring, Bulgaria (2013) http://eea.government.bg/en/nsmos/index.html. Accessed 26 April 2013
Pankratz A (1983) Forecasting with univariate Box–Jenkins models: concepts and cases. Wiley, New York
Petelin D, Grancharova A, Kocijana J (2013) Evolving Gaussian process models for prediction of ozone concentration in the air. Simul Model Pract Th (EUROSIM 2010), 33:68–80. doi:http://dx.doi.org/10.1016/j.simpat.2012.04.005
Polydoras GN, Anagnostopoulos JS, Bergeles GCh (1998) Air quality predictions: dispersion model vs Box-Jenkins stochastic models. An implementation and comparison for Athens, Greece. Appl Therm Eng 18(11):1037–1048. doi:10.1016/S1359-4311(98)00016-7
Program for reducing the harmful air emission of PM10 in the territory of Blagoevgrad (2011) Municipality of Blagoevgrad. http://eea.government.bg/bg/nsmos/air/roukav/obshtini2/KAV-Blagoevgrad.doc (in Bulgarian)
Richman MB (1986) Rotation of principal components. J Climatol 6:293–335. doi:10.1002/joc.3370060305
Said ES, Dickey DA (1984) Testing for unit roots in autoregressive-moving average models of unknown order. Biometrika 71(3):599–607. doi:10.1093/biomet/71.3.599
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464. doi:10.1214/aos/1176344136
Sharma P, Chandra A, Kaushik SC (2009) Forecasts using Box–Jenkins models for the ambient air quality data of Delhi City. Environ Monit Assess 157(1–4):105–112. doi:10.1007/s10661-008-0520-2
Slini Th, Karatzas K, Moussiopoulos N (2002) Statistical analysis of environmental data as the basis of forecasting: an air quality application. Sci Total Environ 288:227–237. doi:10.1016/S0048-9697(01)00991-3
SPSS IBM Statistics (2013) http://www-01.ibm.com/software/analytics/spss/. Accessed 26 April 2013
Tabachnik BG, Fidell LS (2005) Using multivariate statistics, 5th edn. Pearson Int. Edition, Boston
Yeo IK, Johnson RA (2000) A new family of power transformations to improve normality or symmetry. Biometrika 87(4):954–959. doi:10.1093/biomet/87.4.954
Zwozdziak A, Samek L, Sowka I, Furman L, Skrętowicz M (2012) Aerosol pollution from small combustors in a village. Sci World J 2012: Article ID 956401. doi:10.1100/2012/956401
Acknowledgments
This paper is published in cooperation with project of the Bulgarian Ministry of Education, Youth and Science, BG051PO001/3.3-05-0001 “Science and business”, financed under Operational program “Human Resources Development” by the European Social Fund and is also partially supported by the Scientific Research Department of Plovdiv University Paisii Hilendarski under Grant NI13 FMI-002.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gocheva-Ilieva, S.G., Ivanov, A.V., Voynikova, D.S. et al. Time series analysis and forecasting for air pollution in small urban area: an SARIMA and factor analysis approach. Stoch Environ Res Risk Assess 28, 1045–1060 (2014). https://doi.org/10.1007/s00477-013-0800-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-013-0800-4
Keywords
- Air quality modeling
- Air pollution forecast
- Factor analysis
- Time series
- SARIMA
- Seasonal Box–Jenkins models
- Univariate stochastic models