Local exposure misclassification in national models: relationships with urban infrastructure and demographics.

Autor: Chambliss SE; Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX, 78712, USA. sechambliss@utexas.edu., Campmier MJ; Department of Civil and Environmental Engineering, University of California, Berkeley, Berkeley, CA, 94720, USA., Audirac M; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA., Apte JS; Department of Civil and Environmental Engineering, University of California, Berkeley, Berkeley, CA, 94720, USA.; School of Public Health, University of California, Berkeley, Berkeley, CA, 94720, USA., Zigler CM; Department of Statistics and Data Sciences, The University of Texas at Austin, Austin, TX, 78712, USA.
Jazyk: angličtina
Zdroj: Journal of exposure science & environmental epidemiology [J Expo Sci Environ Epidemiol] 2024 Sep; Vol. 34 (5), pp. 761-769. Date of Electronic Publication: 2023 Dec 22.
DOI: 10.1038/s41370-023-00624-z
Abstrakt: Background: National-scale linear regression-based modeling may mischaracterize localized patterns, including hyperlocal peaks and neighborhood- to regional-scale gradients. For studies focused on within-city differences, this mischaracterization poses a risk of exposure misclassification, affecting epidemiological and environmental justice conclusions.
Objective: Characterize the difference between intraurban pollution patterns predicted by national-scale land use regression modeling and observation-based estimates within a localized domain and examine the relationship between that difference and urban infrastructure and demographics.
Methods: We compare highly resolved (0.01 km 2 ) observations of NO 2 mixing ratio and ultrafine particle (UFP) count obtained via mobile monitoring with national model predictions in thirteen neighborhoods in the San Francisco Bay Area. Grid cell-level divergence between modeled and observed concentrations is termed "localized difference." We use a flexible machine learning modeling technique, Bayesian Additive Regression Trees, to investigate potentially nonlinear relationships between discrepancy between localized difference and known local emission sources as well as census block group racial/ethnic composition.
Results: We find that observed local pollution extremes are not represented by land use regression predictions and that observed UFP count significantly exceeds regression predictions. Machine learning models show significant nonlinear relationships among localized differences between predictions and observations and the density of several types of pollution-related infrastructure (roadways, commercial and industrial operations). In addition, localized difference was greater in areas with higher population density and a lower share of white non-Hispanic residents, indicating that exposure misclassification by national models differs among subpopulations.
Impact: Comparing national-scale pollution predictions with hyperlocal observations in the San Francisco Bay Area, we find greater discrepancies near major roadways and food service locations and systematic underestimation of concentrations in neighborhoods with a lower share of non-Hispanic white residents. These findings carry implications for using national-scale models in intraurban epidemiological and environmental justice applications and establish the potential utility of supplementing large-scale estimates with publicly available urban infrastructure and pollution source information.
(© 2023. The Author(s).)
Databáze: MEDLINE