Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences

Autores: Buccella, Agustina; Cechich, Alejandra; Garrido, Walter; Montenegro, Ayelen
Año de publicación: 2026
Idioma: inglés
Tipo de recurso: artículo
Estado: versión publicada
Descripción: The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
EEA Alto Valle
Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina
Fuente: Applied Sciences 16 (3) : 1650. (February 2026)
Materia: Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing
Nivel de accesibilidad: acceso abierto
Condiciones de uso: http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
Institución: Instituto Nacional de Tecnología Agropecuaria
OAI Identificador: oai:localhost:20.500.12123/25203

Acceder

id	INTADig_c14036331d8b5f429916452efa6b6773
oai_identifier_str	oai:localhost:20.500.12123/25203
network_acronym_str	INTADig
repository_id_str	l
network_name_str	INTA Digital (INTA)
spelling	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge InfluencesBuccella, AgustinaCechich, AlejandraGarrido, WalterMontenegro, AyelenMacrodatoAgua SubterráneaAgricultura de PrecisiónProcesamiento de DatosBig DataGroundwaterPrecision AgricultureData ProcessingThe process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.EEA Alto ValleFil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; ArgentinaMDPI2026-02-13T14:05:38Z2026-02-13T14:05:38Z2026-02info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfhttp://hdl.handle.net/20.500.12123/25203https://www.mdpi.com/2076-3417/16/3/16502076-3417https://doi.org/10.3390/app16031650Applied Sciences 16 (3) : 1650. (February 2026)reponame:INTA Digital (INTA)instname:Instituto Nacional de Tecnología Agropecuariaenginfo:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)2026-06-04T09:46:15Zoai:localhost:20.500.12123/25203instacron:INTAInstitucionalhttp://repositorio.inta.gob.ar/Organismo científico-tecnológicoNo correspondehttp://repositorio.inta.gob.ar/oai/requesttripaldi.nicolas@inta.gob.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:l2026-06-04 09:46:15.613INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuariafalse
dc.title.none.fl_str_mv	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
spellingShingle	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences Buccella, Agustina Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing
title_short	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_full	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_fullStr	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_full_unstemmed	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
title_sort	Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
dc.creator.none.fl_str_mv	Buccella, Agustina Cechich, Alejandra Garrido, Walter Montenegro, Ayelen
author	Buccella, Agustina
author_facet	Buccella, Agustina Cechich, Alejandra Garrido, Walter Montenegro, Ayelen
author_role	author
author2	Cechich, Alejandra Garrido, Walter Montenegro, Ayelen
author2_role	author author author
dc.subject.none.fl_str_mv	Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing
topic	Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing
dc.description.none.fl_txt_mv	The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition. EEA Alto Valle Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina
description	The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
publishDate	2026
dc.date.none.fl_str_mv	2026-02-13T14:05:38Z 2026-02-13T14:05:38Z 2026-02
dc.type.none.fl_str_mv	info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo
format	article
status_str	publishedVersion
dc.identifier.none.fl_str_mv	http://hdl.handle.net/20.500.12123/25203 https://www.mdpi.com/2076-3417/16/3/1650 2076-3417 https://doi.org/10.3390/app16031650
url	http://hdl.handle.net/20.500.12123/25203 https://www.mdpi.com/2076-3417/16/3/1650 https://doi.org/10.3390/app16031650
identifier_str_mv	2076-3417
dc.language.none.fl_str_mv	eng
language	eng
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv	openAccess
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	MDPI
publisher.none.fl_str_mv	MDPI
dc.source.none.fl_str_mv	Applied Sciences 16 (3) : 1650. (February 2026) reponame:INTA Digital (INTA) instname:Instituto Nacional de Tecnología Agropecuaria
reponame_str	INTA Digital (INTA)
collection	INTA Digital (INTA)
instname_str	Instituto Nacional de Tecnología Agropecuaria
repository.name.fl_str_mv	INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuaria
repository.mail.fl_str_mv	tripaldi.nicolas@inta.gob.ar
_version_	1867091743224102912
score	12.8315525

Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences

Publicaciones similares