Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences

Autores
Buccella, Agustina; Cechich, Alejandra; Garrido, Walter; Montenegro, Ayelen
Año de publicación
2026
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
EEA Alto Valle
Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina
Fuente
Applied Sciences 16 (3) : 1650. (February 2026)
Materia
Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing
Nivel de accesibilidad
acceso abierto
Condiciones de uso
http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
INTA Digital (INTA)
Institución
Instituto Nacional de Tecnología Agropecuaria
OAI Identificador
oai:localhost:20.500.12123/25203

id INTADig_c14036331d8b5f429916452efa6b6773
oai_identifier_str oai:localhost:20.500.12123/25203
network_acronym_str INTADig
repository_id_str l
network_name_str INTA Digital (INTA)
spelling Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge InfluencesBuccella, AgustinaCechich, AlejandraGarrido, WalterMontenegro, AyelenMacrodatoAgua SubterráneaAgricultura de PrecisiónProcesamiento de DatosBig DataGroundwaterPrecision AgricultureData ProcessingThe process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.EEA Alto ValleFil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; ArgentinaMDPI2026-02-13T14:05:38Z2026-02-13T14:05:38Z2026-02info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfhttp://hdl.handle.net/20.500.12123/25203https://www.mdpi.com/2076-3417/16/3/16502076-3417https://doi.org/10.3390/app16031650Applied Sciences 16 (3) : 1650. (February 2026)reponame:INTA Digital (INTA)instname:Instituto Nacional de Tecnología Agropecuariaenginfo:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)2026-02-26T11:47:42Zoai:localhost:20.500.12123/25203instacron:INTAInstitucionalhttp://repositorio.inta.gob.ar/Organismo científico-tecnológicoNo correspondehttp://repositorio.inta.gob.ar/oai/requesttripaldi.nicolas@inta.gob.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:l2026-02-26 11:47:42.571INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuariafalse
dc.title.none.fl_str_mv Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
spellingShingle Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
Buccella, Agustina
Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing
title_short Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_full Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_fullStr Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_full_unstemmed Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_sort Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
dc.creator.none.fl_str_mv Buccella, Agustina
Cechich, Alejandra
Garrido, Walter
Montenegro, Ayelen
author Buccella, Agustina
author_facet Buccella, Agustina
Cechich, Alejandra
Garrido, Walter
Montenegro, Ayelen
author_role author
author2 Cechich, Alejandra
Garrido, Walter
Montenegro, Ayelen
author2_role author
author
author
dc.subject.none.fl_str_mv Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing
topic Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing
dc.description.none.fl_txt_mv The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
EEA Alto Valle
Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina
description The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
publishDate 2026
dc.date.none.fl_str_mv 2026-02-13T14:05:38Z
2026-02-13T14:05:38Z
2026-02
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/20.500.12123/25203
https://www.mdpi.com/2076-3417/16/3/1650
2076-3417
https://doi.org/10.3390/app16031650
url http://hdl.handle.net/20.500.12123/25203
https://www.mdpi.com/2076-3417/16/3/1650
https://doi.org/10.3390/app16031650
identifier_str_mv 2076-3417
dc.language.none.fl_str_mv eng
language eng
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv Applied Sciences 16 (3) : 1650. (February 2026)
reponame:INTA Digital (INTA)
instname:Instituto Nacional de Tecnología Agropecuaria
reponame_str INTA Digital (INTA)
collection INTA Digital (INTA)
instname_str Instituto Nacional de Tecnología Agropecuaria
repository.name.fl_str_mv INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuaria
repository.mail.fl_str_mv tripaldi.nicolas@inta.gob.ar
_version_ 1858207933869850624
score 13.176822