A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina

Autores
Diaz Resquin, Melisa; Lichtig, Pablo; Alessandrello, Diego; De Oto, Marcelo; Gómez, Darío; Rossler, Cristina Elena; Castesana, Paula Soledad; Dawidowski, Laura Elena
Año de publicación
2023
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
Having a prediction model for air quality at a low computational cost can be useful for research, forecasting, regulatory, and monitoring applications. This is of particular importance for Latin America, where rapid urbanization has imposed increasing stress on the air quality of almost all cities. In recent years, machine learning techniques have been increasingly accepted as a useful tool for air quality forecasting. Out of these, random forest has proven to be an approach that is both well-performing and computationally efficient while still providing key components reflecting the nonlinear relationships among emissions, chemical reactions, and meteorological effects. In this work, we employed the random forest methodology to build and test a forecasting model for the city of Buenos Aires. We used this model to study the deep decline in most pollutants during the lockdown imposed by the COVID-19 (COronaVIrus Disease 2019) pandemic by analyzing the effects of the change in emissions, while taking into account the changes in the meteorology, using two different approaches. First, we built random forest models trained with the data from before the beginning of the lockdown periods. We used the data to make predictions of the business-as-usual scenario during the lockdown periods and estimated the changes in concentrations by comparing the model results with the observations. This allowed us to assess the combined effects of the particular weather conditions and the reduction in emissions during the period when restrictions were in place. Second, we used random forest with meteorological normalization to compare the observational data from the lockdown periods with the data from the same dates in 2019, thus decoupling the effects of the meteorology from short-term emission changes. This allowed us to analyze the general effect that restrictions similar to those imposed during the pandemic could have on pollutant concentrations, and this information could be useful to design mitigation strategies. The results during testing showed that the model captured the observed hourly variations and the diurnal cycles of these pollutants with a normalized mean bias of less than 6 % and Pearson correlation coefficients of the diurnal variations between 0.64 and 0.91 for all the pollutants considered. Based on the random forest results, we estimated that the lockdown implied relative changes in concentration of up to −45 % for CO, −75 % for NO, −46 % for NO2, −12 % for SO2, and −33 % for PM10 during the strictest mobility restrictions. O3 had a positive relative change in concentration (up to an 80 %) that is consistent with the response in a volatile-organic-compound-limited chemical regime to the decline in NOx emissions. The relative changes estimated using the meteorological normalization technique show mostly smaller changes than those obtained by the random forest predictive model. The relative changes were up to −26 % for CO, up to −47 % for NO, −36 % for NO2, −20 % for PM10, and up to 27 % for O3. SO2 is the only species that had a larger relative change when the meteorology was normalized (up to 20 %). This points out the need for accounting not only for differences in emissions but also in meteorological variables in order to evaluate the lockdown effects on air quality. The findings of this study may be valuable for formulating emission control strategies that do not disregard their implication on secondary pollutants. We believe that the model itself can also be a valuable contribution to a forecasting system in the city and that the general methodology could also be easily applied to other Latin American cities as well. We also provide the first O3 and SO2 observational dataset in more that a decade for a residential area in Buenos Aires, and it is openly available at https://doi.org/10.17632/h9y4hb8sf8.1 (Diaz Resquin et al., 2021).
Fil: Diaz Resquin, Melisa. Centro de Ciencia del Clima y la Resiliencia; Chile. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; Argentina
Fil: Lichtig, Pablo. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; Argentina
Fil: Alessandrello, Diego. Comisión Nacional de Energía Atómica; Argentina
Fil: De Oto, Marcelo. Comisión Nacional de Energía Atómica; Argentina
Fil: Gómez, Darío. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; Argentina
Fil: Rossler, Cristina Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; Argentina
Fil: Castesana, Paula Soledad. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; Argentina. YPF - Tecnología; Argentina
Fil: Dawidowski, Laura Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; Argentina
Materia
Machine leraning
Random Forest
Air Quality
COVID-19
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/218709

id CONICETDig_03b6ca828d19377539ee736c47a63129
oai_identifier_str oai:ri.conicet.gov.ar:11336/218709
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, ArgentinaDiaz Resquin, MelisaLichtig, PabloAlessandrello, DiegoDe Oto, MarceloGómez, DaríoRossler, Cristina ElenaCastesana, Paula SoledadDawidowski, Laura ElenaMachine leraningRandom ForestAir QualityCOVID-19https://purl.org/becyt/ford/1.5https://purl.org/becyt/ford/1Having a prediction model for air quality at a low computational cost can be useful for research, forecasting, regulatory, and monitoring applications. This is of particular importance for Latin America, where rapid urbanization has imposed increasing stress on the air quality of almost all cities. In recent years, machine learning techniques have been increasingly accepted as a useful tool for air quality forecasting. Out of these, random forest has proven to be an approach that is both well-performing and computationally efficient while still providing key components reflecting the nonlinear relationships among emissions, chemical reactions, and meteorological effects. In this work, we employed the random forest methodology to build and test a forecasting model for the city of Buenos Aires. We used this model to study the deep decline in most pollutants during the lockdown imposed by the COVID-19 (COronaVIrus Disease 2019) pandemic by analyzing the effects of the change in emissions, while taking into account the changes in the meteorology, using two different approaches. First, we built random forest models trained with the data from before the beginning of the lockdown periods. We used the data to make predictions of the business-as-usual scenario during the lockdown periods and estimated the changes in concentrations by comparing the model results with the observations. This allowed us to assess the combined effects of the particular weather conditions and the reduction in emissions during the period when restrictions were in place. Second, we used random forest with meteorological normalization to compare the observational data from the lockdown periods with the data from the same dates in 2019, thus decoupling the effects of the meteorology from short-term emission changes. This allowed us to analyze the general effect that restrictions similar to those imposed during the pandemic could have on pollutant concentrations, and this information could be useful to design mitigation strategies. The results during testing showed that the model captured the observed hourly variations and the diurnal cycles of these pollutants with a normalized mean bias of less than 6 % and Pearson correlation coefficients of the diurnal variations between 0.64 and 0.91 for all the pollutants considered. Based on the random forest results, we estimated that the lockdown implied relative changes in concentration of up to −45 % for CO, −75 % for NO, −46 % for NO2, −12 % for SO2, and −33 % for PM10 during the strictest mobility restrictions. O3 had a positive relative change in concentration (up to an 80 %) that is consistent with the response in a volatile-organic-compound-limited chemical regime to the decline in NOx emissions. The relative changes estimated using the meteorological normalization technique show mostly smaller changes than those obtained by the random forest predictive model. The relative changes were up to −26 % for CO, up to −47 % for NO, −36 % for NO2, −20 % for PM10, and up to 27 % for O3. SO2 is the only species that had a larger relative change when the meteorology was normalized (up to 20 %). This points out the need for accounting not only for differences in emissions but also in meteorological variables in order to evaluate the lockdown effects on air quality. The findings of this study may be valuable for formulating emission control strategies that do not disregard their implication on secondary pollutants. We believe that the model itself can also be a valuable contribution to a forecasting system in the city and that the general methodology could also be easily applied to other Latin American cities as well. We also provide the first O3 and SO2 observational dataset in more that a decade for a residential area in Buenos Aires, and it is openly available at https://doi.org/10.17632/h9y4hb8sf8.1 (Diaz Resquin et al., 2021).Fil: Diaz Resquin, Melisa. Centro de Ciencia del Clima y la Resiliencia; Chile. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; ArgentinaFil: Lichtig, Pablo. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; ArgentinaFil: Alessandrello, Diego. Comisión Nacional de Energía Atómica; ArgentinaFil: De Oto, Marcelo. Comisión Nacional de Energía Atómica; ArgentinaFil: Gómez, Darío. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; ArgentinaFil: Rossler, Cristina Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; ArgentinaFil: Castesana, Paula Soledad. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; Argentina. YPF - Tecnología; ArgentinaFil: Dawidowski, Laura Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; ArgentinaCopernicus Publications2023-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/218709Diaz Resquin, Melisa; Lichtig, Pablo; Alessandrello, Diego; De Oto, Marcelo; Gómez, Darío; et al.; A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina; Copernicus Publications; Earth System Science Data; 15; 1; 1-2023; 189-2091866-35081866-3516CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/https://essd.copernicus.org/articles/15/189/2023/info:eu-repo/semantics/altIdentifier/doi/10.5194/essd-15-189-2023info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-10T13:09:12Zoai:ri.conicet.gov.ar:11336/218709instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-10 13:09:12.537CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
title A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
spellingShingle A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
Diaz Resquin, Melisa
Machine leraning
Random Forest
Air Quality
COVID-19
title_short A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
title_full A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
title_fullStr A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
title_full_unstemmed A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
title_sort A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
dc.creator.none.fl_str_mv Diaz Resquin, Melisa
Lichtig, Pablo
Alessandrello, Diego
De Oto, Marcelo
Gómez, Darío
Rossler, Cristina Elena
Castesana, Paula Soledad
Dawidowski, Laura Elena
author Diaz Resquin, Melisa
author_facet Diaz Resquin, Melisa
Lichtig, Pablo
Alessandrello, Diego
De Oto, Marcelo
Gómez, Darío
Rossler, Cristina Elena
Castesana, Paula Soledad
Dawidowski, Laura Elena
author_role author
author2 Lichtig, Pablo
Alessandrello, Diego
De Oto, Marcelo
Gómez, Darío
Rossler, Cristina Elena
Castesana, Paula Soledad
Dawidowski, Laura Elena
author2_role author
author
author
author
author
author
author
dc.subject.none.fl_str_mv Machine leraning
Random Forest
Air Quality
COVID-19
topic Machine leraning
Random Forest
Air Quality
COVID-19
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.5
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv Having a prediction model for air quality at a low computational cost can be useful for research, forecasting, regulatory, and monitoring applications. This is of particular importance for Latin America, where rapid urbanization has imposed increasing stress on the air quality of almost all cities. In recent years, machine learning techniques have been increasingly accepted as a useful tool for air quality forecasting. Out of these, random forest has proven to be an approach that is both well-performing and computationally efficient while still providing key components reflecting the nonlinear relationships among emissions, chemical reactions, and meteorological effects. In this work, we employed the random forest methodology to build and test a forecasting model for the city of Buenos Aires. We used this model to study the deep decline in most pollutants during the lockdown imposed by the COVID-19 (COronaVIrus Disease 2019) pandemic by analyzing the effects of the change in emissions, while taking into account the changes in the meteorology, using two different approaches. First, we built random forest models trained with the data from before the beginning of the lockdown periods. We used the data to make predictions of the business-as-usual scenario during the lockdown periods and estimated the changes in concentrations by comparing the model results with the observations. This allowed us to assess the combined effects of the particular weather conditions and the reduction in emissions during the period when restrictions were in place. Second, we used random forest with meteorological normalization to compare the observational data from the lockdown periods with the data from the same dates in 2019, thus decoupling the effects of the meteorology from short-term emission changes. This allowed us to analyze the general effect that restrictions similar to those imposed during the pandemic could have on pollutant concentrations, and this information could be useful to design mitigation strategies. The results during testing showed that the model captured the observed hourly variations and the diurnal cycles of these pollutants with a normalized mean bias of less than 6 % and Pearson correlation coefficients of the diurnal variations between 0.64 and 0.91 for all the pollutants considered. Based on the random forest results, we estimated that the lockdown implied relative changes in concentration of up to −45 % for CO, −75 % for NO, −46 % for NO2, −12 % for SO2, and −33 % for PM10 during the strictest mobility restrictions. O3 had a positive relative change in concentration (up to an 80 %) that is consistent with the response in a volatile-organic-compound-limited chemical regime to the decline in NOx emissions. The relative changes estimated using the meteorological normalization technique show mostly smaller changes than those obtained by the random forest predictive model. The relative changes were up to −26 % for CO, up to −47 % for NO, −36 % for NO2, −20 % for PM10, and up to 27 % for O3. SO2 is the only species that had a larger relative change when the meteorology was normalized (up to 20 %). This points out the need for accounting not only for differences in emissions but also in meteorological variables in order to evaluate the lockdown effects on air quality. The findings of this study may be valuable for formulating emission control strategies that do not disregard their implication on secondary pollutants. We believe that the model itself can also be a valuable contribution to a forecasting system in the city and that the general methodology could also be easily applied to other Latin American cities as well. We also provide the first O3 and SO2 observational dataset in more that a decade for a residential area in Buenos Aires, and it is openly available at https://doi.org/10.17632/h9y4hb8sf8.1 (Diaz Resquin et al., 2021).
Fil: Diaz Resquin, Melisa. Centro de Ciencia del Clima y la Resiliencia; Chile. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; Argentina
Fil: Lichtig, Pablo. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; Argentina
Fil: Alessandrello, Diego. Comisión Nacional de Energía Atómica; Argentina
Fil: De Oto, Marcelo. Comisión Nacional de Energía Atómica; Argentina
Fil: Gómez, Darío. Comisión Nacional de Energía Atómica; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería; Argentina
Fil: Rossler, Cristina Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; Argentina
Fil: Castesana, Paula Soledad. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Comisión Nacional de Energía Atómica; Argentina. YPF - Tecnología; Argentina
Fil: Dawidowski, Laura Elena. Comisión Nacional de Energía Atómica; Argentina. Universidad Nacional de San Martín. Instituto de Investigación e Ingeniería Ambiental. - Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigación e Ingeniería Ambiental; Argentina
description Having a prediction model for air quality at a low computational cost can be useful for research, forecasting, regulatory, and monitoring applications. This is of particular importance for Latin America, where rapid urbanization has imposed increasing stress on the air quality of almost all cities. In recent years, machine learning techniques have been increasingly accepted as a useful tool for air quality forecasting. Out of these, random forest has proven to be an approach that is both well-performing and computationally efficient while still providing key components reflecting the nonlinear relationships among emissions, chemical reactions, and meteorological effects. In this work, we employed the random forest methodology to build and test a forecasting model for the city of Buenos Aires. We used this model to study the deep decline in most pollutants during the lockdown imposed by the COVID-19 (COronaVIrus Disease 2019) pandemic by analyzing the effects of the change in emissions, while taking into account the changes in the meteorology, using two different approaches. First, we built random forest models trained with the data from before the beginning of the lockdown periods. We used the data to make predictions of the business-as-usual scenario during the lockdown periods and estimated the changes in concentrations by comparing the model results with the observations. This allowed us to assess the combined effects of the particular weather conditions and the reduction in emissions during the period when restrictions were in place. Second, we used random forest with meteorological normalization to compare the observational data from the lockdown periods with the data from the same dates in 2019, thus decoupling the effects of the meteorology from short-term emission changes. This allowed us to analyze the general effect that restrictions similar to those imposed during the pandemic could have on pollutant concentrations, and this information could be useful to design mitigation strategies. The results during testing showed that the model captured the observed hourly variations and the diurnal cycles of these pollutants with a normalized mean bias of less than 6 % and Pearson correlation coefficients of the diurnal variations between 0.64 and 0.91 for all the pollutants considered. Based on the random forest results, we estimated that the lockdown implied relative changes in concentration of up to −45 % for CO, −75 % for NO, −46 % for NO2, −12 % for SO2, and −33 % for PM10 during the strictest mobility restrictions. O3 had a positive relative change in concentration (up to an 80 %) that is consistent with the response in a volatile-organic-compound-limited chemical regime to the decline in NOx emissions. The relative changes estimated using the meteorological normalization technique show mostly smaller changes than those obtained by the random forest predictive model. The relative changes were up to −26 % for CO, up to −47 % for NO, −36 % for NO2, −20 % for PM10, and up to 27 % for O3. SO2 is the only species that had a larger relative change when the meteorology was normalized (up to 20 %). This points out the need for accounting not only for differences in emissions but also in meteorological variables in order to evaluate the lockdown effects on air quality. The findings of this study may be valuable for formulating emission control strategies that do not disregard their implication on secondary pollutants. We believe that the model itself can also be a valuable contribution to a forecasting system in the city and that the general methodology could also be easily applied to other Latin American cities as well. We also provide the first O3 and SO2 observational dataset in more that a decade for a residential area in Buenos Aires, and it is openly available at https://doi.org/10.17632/h9y4hb8sf8.1 (Diaz Resquin et al., 2021).
publishDate 2023
dc.date.none.fl_str_mv 2023-01
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/218709
Diaz Resquin, Melisa; Lichtig, Pablo; Alessandrello, Diego; De Oto, Marcelo; Gómez, Darío; et al.; A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina; Copernicus Publications; Earth System Science Data; 15; 1; 1-2023; 189-209
1866-3508
1866-3516
CONICET Digital
CONICET
url http://hdl.handle.net/11336/218709
identifier_str_mv Diaz Resquin, Melisa; Lichtig, Pablo; Alessandrello, Diego; De Oto, Marcelo; Gómez, Darío; et al.; A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina; Copernicus Publications; Earth System Science Data; 15; 1; 1-2023; 189-209
1866-3508
1866-3516
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/https://essd.copernicus.org/articles/15/189/2023/
info:eu-repo/semantics/altIdentifier/doi/10.5194/essd-15-189-2023
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Copernicus Publications
publisher.none.fl_str_mv Copernicus Publications
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1842980448596131840
score 12.993085