Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A Bayesian approach to forecasting daily air-pollutant levels

Published: 01 December 2018 Publication History

Abstract

Forecasting air-pollutant levels is an important issue, due to their adverse effects on public health, and often a legislative necessity. The advantage of Bayesian methods is their ability to provide density predictions which can easily be transformed into ordinal or binary predictions given a set of thresholds. We develop a Bayesian approach to forecasting PM$$_{10}$$10 and O$$_3$$3 levels that efficiently deals with extensive amounts of input parameters, and test whether it outperforms classical models and experts. The new approach is used to fit models for PM$$_{10}$$10 and O$$_3$$3 level forecasting that can be used in daily practice. We also introduce a novel approach for comparing models to experts based on estimated cost matrices. The results for diverse air quality monitoring sites across Slovenia show that Bayesian models outperform classical models in both PM$$_{10}$$10 and O$$_3$$3 predictions. The proposed models perform better than experts in PM$$_{10}$$10 and are on par with experts in O$$_3$$3 predictions--where experts already base their predictions on predictions from a statistical model. A Bayesian approach--especially using Gaussian processes--offers several advantages: superior performance, robustness to overfitting, more information, and the ability to efficiently adapt to different cost matrices.

References

[1]
Aldrin M, Haff IH (2005) Generalised additive modelling of air pollution, traffic volume and meteorology. Atmos Environ 39(11):2145---2155
[2]
Biancofiore F, Busilacchio M, Verdecchia M, Tomassetti B, Aruffo E, Bianco S, Di Tommaso S, Colangeli C, Rosatelli G, Di Carlo P (2017) Recursive neural network model for analysis and forecast of PM10 and PM2.5. Atmos Pollut Res 4:652---659
[3]
Breiman L (2001) Random forests. Mach Learn 45(1):5---32
[4]
Brunelli U, Piazza V, Pignato L, Sorbello F, Vitabile S (2007) Two-days ahead prediction of daily maximum concentrations of SO$$_2$$2, O$$_3$$3, PM$$_{10}$$10, NO$$_{2}$$2, CO in the urban area of Palermo, Italy. Atmos Environ 41(14):2967---2995
[5]
Chaloulakou A, Grivas G, Spyrellis N (2003) Neural network and multiple regression models for PM10 prediction in Athens: a comparative assessment. J Air Waste Manag Assoc 53(10):1183---1190
[6]
Chelani AB (2010) Prediction of daily maximum ground ozone concentration using support vector machine. Environ Monit Assess 162(1):169---176
[7]
Cobourn WG, Dolcine L, French M, Hubbard MC (2000) A comparison of nonlinear regression and neural network models for ground-level ozone forecasting. J Air Waste Manag Assoc 50(11):1999---2009
[8]
Cocchi D, Greco F, Trivisano C (2007) Hierarchical space-time modelling of PM$$_{10}$$10 pollution. Atmos Environ 41(3):532---542
[9]
de Gennaro G, Trizio L, Di Gilio A, Pey J, Pérez N, Cusack M, Alastuey A, Querol X (2013) Neural network model for the prediction of PM 10 daily concentrations in two sites in the Western Mediterranean. Sci Total Environ 463:875---883
[10]
Dutot A-L, Rynkiewicz J, Steiner FE, Rude J (2007) A 24-h forecast of ozone peaks and exceedance levels using neural classifiers and weather predictions. Environ Model Softw 22(9):1261---1269
[11]
Dye, T. S (2003) Guidelines for developing an air quality (Ozone and PM2. 5) forecasting program
[12]
Elkan, C (2001) The foundations of cost-sensitive learning. In: International joint conference on artificial intelligence, vol 17. Lawrence Erlbaum Associates Ltd, pp 973---978
[13]
Council European (2005) Directive 2004/107/EC of the European Parliament and of the Council of 15 December 2004 relating to arsenic, cadmium, mercury, nickel and polycyclic aromatic hydrocarbons in ambient air. Off J L 23(26/01):0003---0016
[14]
EC Directive (2008) Directive 2008/50/EC of the European Parliament and of the Council of 21 May 2008 on Ambient air quality and cleaner air for Europe. Off J L152:1---44
[15]
European Environment Agency (2014) Air quality in Europe-2016 report. European Environment Agency, Copenhagen
[16]
Friedman J, Hastie T, Tibshirani R (2009) glmnet: Lasso and elastic-net regularized generalized linear models. R Package. https://cran.r-project.org/web/packages/glmnet/index.html
[17]
Gomez-Sanchis J, Martín-Guerrero JD, Soria-Olivas E, Vila-Frances J, Carrasco JL, del Valle-Tascón S (2006) Neural networks for analysing the relevance of input variables in the prediction of tropospheric ozone concentration. Atmos Environ 40(32):6173---6180
[18]
Gramacy RB (2010) monomvn: Estimation for multivariate normal and Student-t data with monotone missingness-Monomvn package manual. R Package
[19]
Grivas G, Chaloulakou A (2006) Artificial neural network models for prediction of PM10 hourly concentrations, in the Greater Area of Athens, Greece. Atmos Environ 40(7):1216---1229
[20]
Grošelj N, Zupan J, Reich S, Dawidowski L, Gomez D, Magallanes J (2004) 2D mapping by Kohonen networks of the air quality data from a large city. J Chem Inf Comput Sci 44(2):339---346
[21]
Hooyberghs J, Mensink C, Dumont G, Fierens F, Brasseur O (2005) A neural network forecast for daily average PM10 concentrations in Belgium. Atmos Environ 39(18):3279---3289
[22]
Horányi A, Ihász I, Radnóti G (1996) ARPEGE/ALADIN: a numerical weather prediction model for Central-Europe with the participation of the Hungarian Meteorological Service. Idojárás 100(4):277---301
[23]
Kalaitzis A, Honkela A, Gao P, Lawrence ND (2013) gptk: Gaussian processes tool-kit. URL http://CRAN. R-project. org/package= gptk, R package version 1
[24]
Kibria BG, Sun L, Zidek JV, Le ND (2002) Bayesian spatial prediction of random space-time fields with application to mapping PM2.5 exposure. J Am Stat Assoc 97(457):112---124
[25]
Liaw, A, Wiener, M (2002) Classification and regression by randomforest. R News 2 (3): 18---22. URL: http://CRAN.R-project.org/doc/Rnews
[26]
Lu W-Z, Wang D (2008) Ground-level ozone prediction by support vector machine approach with a cost-sensitive classification scheme. Sci Total Environ 395(2):109---116
[27]
McMillan N, Bortnick SM, Irwin ME, Berliner LM (2005) A hierarchical Bayesian model to estimate and forecast ozone through space and time. Atmos Environ 39(8):1373---1382
[28]
Molteni F, Buizza R, Palmer TN, Petroliagis T (1996) The ECMWF ensemble prediction system: methodology and validation. Quart J R Meteorol Soc 122(529):73---119
[29]
Muñoz E, Martin M, Turias I, Jimenez-Come M, Trujillo F (2014) Prediction of pm10 and so2 exceedances to control air pollution in the bay of algeciras, spain. Stoch Env Res Risk Assess 28(6):1409---1420
[30]
Papanastasiou D, Melas D, Kioutsioukis I (2007) Development and assessment of neural network and multiple regression models in order to predict PM10 levels in a medium-sized Mediterranean city. Water Air Soil Pollut 182(1---4):325---334
[31]
Park T, Casella G (2008) The Bayesian Lasso. J Am Stat Assoc 103(482):681---686
[32]
Perez P, Reyes J (2002) Prediction of maximum of 24-h average of PM10 concentrations 30h in advance in Santiago, Chile. Atmos Environ 36(28):4555---4561
[33]
Perez P, Reyes J (2006) An integrated neural network model for PM10 forecasting. Atmos Environ 40(16):2845---2851
[34]
Petelin D, Grancharova A, Kocijan J (2013) Evolving Gaussian process models for prediction of ozone concentration in the air. Simul Model Pract Theory 33:68---80
[35]
Poggi J-M, Portier B (2011) PM10 forecasting using clusterwise regression. Atmos Environ 45(38):7005---7014
[36]
Salazar-Ruiz E, Ordieres J, Vergara E, Capuz-Rizo S (2008) Development and comparative analysis of tropospheric ozone prediction models using linear and artificial intelligence-based models in Mexicali, Baja California (Mexico) and Calexico, California (US). Environ Model Softw 23(8):1056---1069
[37]
Slini T, Kaprara A, Karatzas K, Moussiopoulos N (2006) PM$$_{10}$$10 forecasting for Thessaloniki, Greece. Environ Model Softw 21(4):559---565
[38]
Solaiman T, Coulibaly P, Kanaroglou P (2008) Ground-level ozone forecasting using data-driven methods. Air Qual Atmos Health 1(4):179---193
[39]
Stadlober E, Hörmann S, Pfeiler B (2008) Quality and performance of a PM10 daily forecasting model. Atmos Environ 42(6):1098---1109
[40]
Suárez Sánchez A, García Nieto PJ, Fernández PR, del Coz Díaz J, Iglesias-Rodríguez FJ (2011) Application of an SVM-based regression model to the air quality study at local scale in the Avilés urban area (Spain). Math Comput Modell 54(5):1453---1466
[41]
Suárez Sánchez A, García Nieto PJ, Iglesias-Rodríguez FJ, Vilán JA (2013) Nonlinear air quality modeling using support vector machines in Gijón urban area (Northern Spain) at local scale. Int J Nonlinear Sci Numer Simul 14(5):291---305
[42]
Sucar LE, Pérez-Brito J, Ruiz-Suárez JC, Morales E (1997) Learning structure from data and its application to ozone prediction. Appl Intell 7(4):327---338
[43]
Sun L, Zidek JV, Le ND, Özkaynak H (2000) Interpolating Vancouver's daily ambient PM10 field. Environmetrics 11(6):651---663
[44]
Taheri Shahraiyni H, Sodoudi S (2016) Statistical modeling approaches for PM10 prediction in urban areas; a review of 21st-century studies. Atmosphere 7(2):15
[45]
Ul-Saufie AZ, Yahaya AS, Ramli NA, Rosaida N, Hamid HA (2013) Future daily PM10 concentrations prediction by combining regression models and feedforward backpropagation models with principle component analysis (PCA). Atmos Environ 77:621---630
[46]
Varadhan R, Borchers HW (2011) dfoptim: Derivative-free Optimization. R package. https://cran.r-roject.org/web/packages/monomvn/index.html
[47]
Wang D, Lu W-Z (2006) Ground-level ozone prediction using multilayer perceptron trained with an innovative hybrid approach. Ecol Model 198(3):332---340
[48]
World Health Organization (2016) Ambient (outdoor) air quality and health. http://www.who.int/mediacentre/factsheets/fs313/en/. Accessed 16 May 2017
[49]
Yi J, Prybutok VR (1996) A neural network model forecasting for prediction of daily maximum ozone concentration in an industrialized urban area. Environ Pollut 92(3):349---357
[50]
Zhang H, Zhang W, Palazoglu A, Sun W (2012) Prediction of ozone levels using a Hidden Markov Model (HMM) with Gamma distribution. Atmos Environ 62:64---73
[51]
Zhang K, Fan W (2008) Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond. Knowl Inf Syst 14(3):299---326
[52]
Zickus M, Greig A, Niranjan M (2002) Comparison of four machine learning methods for predicting PM10 concentrations in Helsinki, Finland. Water Air Soil Pollut Focus 2(5---6):717---729
[53]
Zolghadri A, Monsion M, Henry D, Marchionini C, Petrique O (2004) Development of an operational model-based warning system for tropospheric ozone concentrations in Bordeaux, France. Environ Model Softw 19(4):369---382

Cited By

View all
  • (2022)Air quality forecasting with artificial intelligence techniquesEnvironmental Modelling & Software10.1016/j.envsoft.2022.105329149:COnline publication date: 9-May-2022
  • (2021)Bayesian combination of probabilistic classifiers using multivariate normal mixturesThe Journal of Machine Learning Research10.5555/3322706.336199220:1(1892-1909)Online publication date: 9-Mar-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Knowledge and Information Systems
Knowledge and Information Systems  Volume 57, Issue 3
December 2018
242 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 December 2018

Author Tags

  1. Air pollutants
  2. Bayesian statistics
  3. Cost matrix
  4. Forecasting
  5. Gaussian processes
  6. Machine learning

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Air quality forecasting with artificial intelligence techniquesEnvironmental Modelling & Software10.1016/j.envsoft.2022.105329149:COnline publication date: 9-May-2022
  • (2021)Bayesian combination of probabilistic classifiers using multivariate normal mixturesThe Journal of Machine Learning Research10.5555/3322706.336199220:1(1892-1909)Online publication date: 9-Mar-2021

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media