A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology

Autores
Wallner, Markus; Müller, Omar Vicente; Gomez, Andrea Alejandra; Joost, Ingeborg; Düker, Urda; Klawonn, Frank; Nogueira, Regina
Año de publicación
2025
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to analyse pathogen concentrations and to relate these measurements to cases from clinical data. This prediction of cases is subject to large errors, due to various factors such as dilution effects, decay or wastewater matrix and inhibitors. In this study we used different models to identify the most important, what we call, wastewater-based epidemiologically relevant parameters (WBERP) to describe these errors. We used linear regression and random forest regression as base models for predicting cases and random forest regression also to analyse the importance of different WBERP.Two catchments, one with a large proportion of combined sewers and one with separate sewers, served as study areas. Our results show that the most important information to be included in any model are the variants of concern (VOCs), a time-variable parameter. The performance for both catchments is improved by ~30 % in terms of root mean square error when the VOCs are used as additional information. For practical applications, this is a real drawback as it means that every time a new pathogen variant becomes dominant, we need to know the specific behaviour of the variant in the wastewater and its detection in order to interpret the WBE data correctly. This limits the predictive capabilities of such systems, perhaps not in terms of dynamics but for quantitative statements. The addition of other physicochemical parameters and faecal markers only marginally improved the results. Furthermore, there were differences in the importance of the parameters between the catchments, which limits the generalisability of the conclusions. The results show that more complex wastewater matrices (high proportion of combined sewer system) influence the relationship between pathogen concentration and medical cases more than those of less complex wastewater matrices (separate sewer system).
Fil: Wallner, Markus. Ostfalia University Of Applied Sciences; Alemania
Fil: Müller, Omar Vicente. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; Argentina
Fil: Gomez, Andrea Alejandra. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; Argentina
Fil: Joost, Ingeborg. Ostfalia University of Applied Science; Alemania
Fil: Düker, Urda. Leibniz Universitat Hannover.; Alemania
Fil: Klawonn, Frank. Ostfalia University Of Applied Sciences; Alemania
Fil: Nogueira, Regina. Leibniz Universitat Hannover.; Alemania
Materia
COVID-19
WASTEWATER
EPIDEMIOLOGY
PREDICTION
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/267048

id CONICETDig_eaa88e25aa19c590adaef702708f6e21
oai_identifier_str oai:ri.conicet.gov.ar:11336/267048
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiologyWallner, MarkusMüller, Omar VicenteGomez, Andrea AlejandraJoost, IngeborgDüker, UrdaKlawonn, FrankNogueira, ReginaCOVID-19WASTEWATEREPIDEMIOLOGYPREDICTIONhttps://purl.org/becyt/ford/3.3https://purl.org/becyt/ford/3With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to analyse pathogen concentrations and to relate these measurements to cases from clinical data. This prediction of cases is subject to large errors, due to various factors such as dilution effects, decay or wastewater matrix and inhibitors. In this study we used different models to identify the most important, what we call, wastewater-based epidemiologically relevant parameters (WBERP) to describe these errors. We used linear regression and random forest regression as base models for predicting cases and random forest regression also to analyse the importance of different WBERP.Two catchments, one with a large proportion of combined sewers and one with separate sewers, served as study areas. Our results show that the most important information to be included in any model are the variants of concern (VOCs), a time-variable parameter. The performance for both catchments is improved by ~30 % in terms of root mean square error when the VOCs are used as additional information. For practical applications, this is a real drawback as it means that every time a new pathogen variant becomes dominant, we need to know the specific behaviour of the variant in the wastewater and its detection in order to interpret the WBE data correctly. This limits the predictive capabilities of such systems, perhaps not in terms of dynamics but for quantitative statements. The addition of other physicochemical parameters and faecal markers only marginally improved the results. Furthermore, there were differences in the importance of the parameters between the catchments, which limits the generalisability of the conclusions. The results show that more complex wastewater matrices (high proportion of combined sewer system) influence the relationship between pathogen concentration and medical cases more than those of less complex wastewater matrices (separate sewer system).Fil: Wallner, Markus. Ostfalia University Of Applied Sciences; AlemaniaFil: Müller, Omar Vicente. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; ArgentinaFil: Gomez, Andrea Alejandra. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; ArgentinaFil: Joost, Ingeborg. Ostfalia University of Applied Science; AlemaniaFil: Düker, Urda. Leibniz Universitat Hannover.; AlemaniaFil: Klawonn, Frank. Ostfalia University Of Applied Sciences; AlemaniaFil: Nogueira, Regina. Leibniz Universitat Hannover.; AlemaniaElsevier2025-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/267048Wallner, Markus; Müller, Omar Vicente; Gomez, Andrea Alejandra; Joost, Ingeborg; Düker, Urda; et al.; A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology; Elsevier; Science of the Total Environment; 959; 1-2025; 1-150048-9697CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/https://linkinghub.elsevier.com/retrieve/pii/S0048969724083074info:eu-repo/semantics/altIdentifier/doi/10.1016/j.scitotenv.2024.178149info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T09:44:40Zoai:ri.conicet.gov.ar:11336/267048instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 09:44:41.194CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
title A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
spellingShingle A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
Wallner, Markus
COVID-19
WASTEWATER
EPIDEMIOLOGY
PREDICTION
title_short A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
title_full A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
title_fullStr A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
title_full_unstemmed A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
title_sort A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology
dc.creator.none.fl_str_mv Wallner, Markus
Müller, Omar Vicente
Gomez, Andrea Alejandra
Joost, Ingeborg
Düker, Urda
Klawonn, Frank
Nogueira, Regina
author Wallner, Markus
author_facet Wallner, Markus
Müller, Omar Vicente
Gomez, Andrea Alejandra
Joost, Ingeborg
Düker, Urda
Klawonn, Frank
Nogueira, Regina
author_role author
author2 Müller, Omar Vicente
Gomez, Andrea Alejandra
Joost, Ingeborg
Düker, Urda
Klawonn, Frank
Nogueira, Regina
author2_role author
author
author
author
author
author
dc.subject.none.fl_str_mv COVID-19
WASTEWATER
EPIDEMIOLOGY
PREDICTION
topic COVID-19
WASTEWATER
EPIDEMIOLOGY
PREDICTION
purl_subject.fl_str_mv https://purl.org/becyt/ford/3.3
https://purl.org/becyt/ford/3
dc.description.none.fl_txt_mv With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to analyse pathogen concentrations and to relate these measurements to cases from clinical data. This prediction of cases is subject to large errors, due to various factors such as dilution effects, decay or wastewater matrix and inhibitors. In this study we used different models to identify the most important, what we call, wastewater-based epidemiologically relevant parameters (WBERP) to describe these errors. We used linear regression and random forest regression as base models for predicting cases and random forest regression also to analyse the importance of different WBERP.Two catchments, one with a large proportion of combined sewers and one with separate sewers, served as study areas. Our results show that the most important information to be included in any model are the variants of concern (VOCs), a time-variable parameter. The performance for both catchments is improved by ~30 % in terms of root mean square error when the VOCs are used as additional information. For practical applications, this is a real drawback as it means that every time a new pathogen variant becomes dominant, we need to know the specific behaviour of the variant in the wastewater and its detection in order to interpret the WBE data correctly. This limits the predictive capabilities of such systems, perhaps not in terms of dynamics but for quantitative statements. The addition of other physicochemical parameters and faecal markers only marginally improved the results. Furthermore, there were differences in the importance of the parameters between the catchments, which limits the generalisability of the conclusions. The results show that more complex wastewater matrices (high proportion of combined sewer system) influence the relationship between pathogen concentration and medical cases more than those of less complex wastewater matrices (separate sewer system).
Fil: Wallner, Markus. Ostfalia University Of Applied Sciences; Alemania
Fil: Müller, Omar Vicente. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; Argentina
Fil: Gomez, Andrea Alejandra. Universidad Nacional del Litoral. Facultad de Ingenieria y Ciencias Hidricas. Centro de Estudios de Variabilidad y Cambio Climatico.; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe; Argentina
Fil: Joost, Ingeborg. Ostfalia University of Applied Science; Alemania
Fil: Düker, Urda. Leibniz Universitat Hannover.; Alemania
Fil: Klawonn, Frank. Ostfalia University Of Applied Sciences; Alemania
Fil: Nogueira, Regina. Leibniz Universitat Hannover.; Alemania
description With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to analyse pathogen concentrations and to relate these measurements to cases from clinical data. This prediction of cases is subject to large errors, due to various factors such as dilution effects, decay or wastewater matrix and inhibitors. In this study we used different models to identify the most important, what we call, wastewater-based epidemiologically relevant parameters (WBERP) to describe these errors. We used linear regression and random forest regression as base models for predicting cases and random forest regression also to analyse the importance of different WBERP.Two catchments, one with a large proportion of combined sewers and one with separate sewers, served as study areas. Our results show that the most important information to be included in any model are the variants of concern (VOCs), a time-variable parameter. The performance for both catchments is improved by ~30 % in terms of root mean square error when the VOCs are used as additional information. For practical applications, this is a real drawback as it means that every time a new pathogen variant becomes dominant, we need to know the specific behaviour of the variant in the wastewater and its detection in order to interpret the WBE data correctly. This limits the predictive capabilities of such systems, perhaps not in terms of dynamics but for quantitative statements. The addition of other physicochemical parameters and faecal markers only marginally improved the results. Furthermore, there were differences in the importance of the parameters between the catchments, which limits the generalisability of the conclusions. The results show that more complex wastewater matrices (high proportion of combined sewer system) influence the relationship between pathogen concentration and medical cases more than those of less complex wastewater matrices (separate sewer system).
publishDate 2025
dc.date.none.fl_str_mv 2025-01
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/267048
Wallner, Markus; Müller, Omar Vicente; Gomez, Andrea Alejandra; Joost, Ingeborg; Düker, Urda; et al.; A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology; Elsevier; Science of the Total Environment; 959; 1-2025; 1-15
0048-9697
CONICET Digital
CONICET
url http://hdl.handle.net/11336/267048
identifier_str_mv Wallner, Markus; Müller, Omar Vicente; Gomez, Andrea Alejandra; Joost, Ingeborg; Düker, Urda; et al.; A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology; Elsevier; Science of the Total Environment; 959; 1-2025; 1-15
0048-9697
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/https://linkinghub.elsevier.com/retrieve/pii/S0048969724083074
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.scitotenv.2024.178149
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1842268682844110848
score 13.13397