A random forest partition model for predicting NO 2 concentrations from traffic flow and meteorological conditions.

Autor: Kamińska JA; Department of Mathematics, Wroclaw University of Environmental and Life Sciences, ul. Grunwaldzka 53, 50-357 Wrocław, Poland. Electronic address: joanna.kaminska@upwr.edu.pl.
Jazyk: angličtina
Zdroj: The Science of the total environment [Sci Total Environ] 2019 Feb 15; Vol. 651 (Pt 1), pp. 475-483. Date of Electronic Publication: 2018 Sep 17.
DOI: 10.1016/j.scitotenv.2018.09.196
Abstrakt: High concentrations of nitrogen dioxide in the air, particularly in heavily urbanised areas, have an adverse effect on many aspects of residents' health (short-term and long-term damage, unpleasant odour and other). A method is proposed for modelling atmospheric NO 2 concentrations in a conurbation, using a partition model M consisting of two separate models: M L for lower concentration values and M U for upper values. An advanced data mining technique, that of random forests, is used. This is a method based on machine learning, involving the simultaneous compilation of information from multiple random trees. Using the example of data recorded in Wrocław (Poland) in 2015-2017, an iterative method was applied to determine the boundary concentration y˜ for which the mean absolute deviation error for the partition model attained its lowest value. The resulting model had an R 2 value of 0.82, compared with 0.60 for a classical random forest model. The importances of the variables in the model M L , similarly as in the classical case, indicate that the greatest influence on NO 2 concentrations comes from traffic flow, followed by meteorological factors, in particular the wind direction and speed. In the model M U the importances of the variables are significantly different: while traffic flow still has the greatest impact, the effects of temperature and relative humidity are almost as great. This confirms the justifiability of constructing separate models for low and high pollution concentrations.
(Copyright © 2018 Elsevier B.V. All rights reserved.)
Databáze: MEDLINE