TY - JOUR
T1 - Estimating monthly PM2.5 concentrations from satellite remote sensing data, meteorological variables, and land use data using ensemble statistical modeling and a random forest approach
AU - Chen, Chu Chih
AU - Wang, Yin Ru
AU - Yeh, Hung Yi
AU - Lin, Tang Huang
AU - Huang, Chun Sheng
AU - Wu, Chang Fu
N1 - Publisher Copyright:
© 2021 Elsevier Ltd
PY - 2021/12/15
Y1 - 2021/12/15
N2 - Fine particulate matter (PM2.5) is associated with various adverse health outcomes and poses serious concerns for public health. However, ground monitoring stations for PM2.5 measurements are mostly installed in population-dense or urban areas. Thus, satellite retrieved aerosol optical depth (AOD) data, which provide spatial and temporal surrogates of exposure, have become an important tool for PM2.5 estimates in a study area. In this study, we used AOD estimates of surface PM2.5 together with meteorological and land use variables to estimate monthly PM2.5 concentrations at a spatial resolution of 3 km2 over Taiwan Island from 2015 to 2019. An ensemble two-stage estimation procedure was proposed, with a generalized additive model (GAM) for temporal-trend removal in the first stage and a random forest model used to assess residual spatiotemporal variations in the second stage. We obtained a model-fitting R2 of 0.98 with a root mean square error (RMSE) of 1.40 μg/m3. The leave-one-out cross-validation (LOOCV) R2 with seasonal stratification was 0.82, and the RMSE was 3.85 μg/m3, whereas the R2 and RMSE obtained by using the pure random forest approach produced R2 and RMSE values of 0.74 and 4.60 μg/m3, respectively. The results indicated that the ensemble modeling approach had a higher predictive ability than the pure machine learning method and could provide reliable PM2.5 estimates over the entire island, which has complex terrain in terms of land use and topography.
AB - Fine particulate matter (PM2.5) is associated with various adverse health outcomes and poses serious concerns for public health. However, ground monitoring stations for PM2.5 measurements are mostly installed in population-dense or urban areas. Thus, satellite retrieved aerosol optical depth (AOD) data, which provide spatial and temporal surrogates of exposure, have become an important tool for PM2.5 estimates in a study area. In this study, we used AOD estimates of surface PM2.5 together with meteorological and land use variables to estimate monthly PM2.5 concentrations at a spatial resolution of 3 km2 over Taiwan Island from 2015 to 2019. An ensemble two-stage estimation procedure was proposed, with a generalized additive model (GAM) for temporal-trend removal in the first stage and a random forest model used to assess residual spatiotemporal variations in the second stage. We obtained a model-fitting R2 of 0.98 with a root mean square error (RMSE) of 1.40 μg/m3. The leave-one-out cross-validation (LOOCV) R2 with seasonal stratification was 0.82, and the RMSE was 3.85 μg/m3, whereas the R2 and RMSE obtained by using the pure random forest approach produced R2 and RMSE values of 0.74 and 4.60 μg/m3, respectively. The results indicated that the ensemble modeling approach had a higher predictive ability than the pure machine learning method and could provide reliable PM2.5 estimates over the entire island, which has complex terrain in terms of land use and topography.
KW - Aerosol optical depth
KW - Generalized additive model
KW - Inverse distance weighting
KW - Land use regression
KW - Leave-one-out cross-validation
UR - http://www.scopus.com/inward/record.url?scp=85115010456&partnerID=8YFLogxK
U2 - 10.1016/j.envpol.2021.118159
DO - 10.1016/j.envpol.2021.118159
M3 - 期刊論文
C2 - 34543952
AN - SCOPUS:85115010456
SN - 0269-7491
VL - 291
JO - Environmental Pollution
JF - Environmental Pollution
M1 - 118159
ER -