Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences
- Autores
- Buccella, Agustina; Cechich, Alejandra; Garrido, Walter; Montenegro, Ayelen
- Año de publicación
- 2026
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.
EEA Alto Valle
Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina
Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina - Fuente
- Applied Sciences 16 (3) : 1650. (February 2026)
- Materia
-
Macrodato
Agua Subterránea
Agricultura de Precisión
Procesamiento de Datos
Big Data
Groundwater
Precision Agriculture
Data Processing - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/4.0/
- Repositorio
.jpg)
- Institución
- Instituto Nacional de Tecnología Agropecuaria
- OAI Identificador
- oai:localhost:20.500.12123/25203
Ver los metadatos del registro completo
| id |
INTADig_c14036331d8b5f429916452efa6b6773 |
|---|---|
| oai_identifier_str |
oai:localhost:20.500.12123/25203 |
| network_acronym_str |
INTADig |
| repository_id_str |
l |
| network_name_str |
INTA Digital (INTA) |
| spelling |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge InfluencesBuccella, AgustinaCechich, AlejandraGarrido, WalterMontenegro, AyelenMacrodatoAgua SubterráneaAgricultura de PrecisiónProcesamiento de DatosBig DataGroundwaterPrecision AgricultureData ProcessingThe process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition.EEA Alto ValleFil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); ArgentinaFil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; ArgentinaMDPI2026-02-13T14:05:38Z2026-02-13T14:05:38Z2026-02info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfhttp://hdl.handle.net/20.500.12123/25203https://www.mdpi.com/2076-3417/16/3/16502076-3417https://doi.org/10.3390/app16031650Applied Sciences 16 (3) : 1650. (February 2026)reponame:INTA Digital (INTA)instname:Instituto Nacional de Tecnología Agropecuariaenginfo:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)2026-02-26T11:47:42Zoai:localhost:20.500.12123/25203instacron:INTAInstitucionalhttp://repositorio.inta.gob.ar/Organismo científico-tecnológicoNo correspondehttp://repositorio.inta.gob.ar/oai/requesttripaldi.nicolas@inta.gob.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:l2026-02-26 11:47:42.571INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuariafalse |
| dc.title.none.fl_str_mv |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| title |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| spellingShingle |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences Buccella, Agustina Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing |
| title_short |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| title_full |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| title_fullStr |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| title_full_unstemmed |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| title_sort |
Contextual Reuse of Big Data Systems: A Case Study Assessing Groundwater Recharge Influences |
| dc.creator.none.fl_str_mv |
Buccella, Agustina Cechich, Alejandra Garrido, Walter Montenegro, Ayelen |
| author |
Buccella, Agustina |
| author_facet |
Buccella, Agustina Cechich, Alejandra Garrido, Walter Montenegro, Ayelen |
| author_role |
author |
| author2 |
Cechich, Alejandra Garrido, Walter Montenegro, Ayelen |
| author2_role |
author author author |
| dc.subject.none.fl_str_mv |
Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing |
| topic |
Macrodato Agua Subterránea Agricultura de Precisión Procesamiento de Datos Big Data Groundwater Precision Agriculture Data Processing |
| dc.description.none.fl_txt_mv |
The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition. EEA Alto Valle Fil: Buccella, Agustina. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Cechich, Alejandra. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Garrido, Walter. Universidad Nacional del Comahue. Facultad de Informática. Departamento de Ingeniería de Sistemas. Grupo de Investigación en Ingeniería de Software del Comahue (GIISCo); Argentina Fil: Montenegro, Ayelen. Instituto Nacional de Tecnología Agropecuaria (INTA). Estación Experimental Agropecuaria Alto Valle; Argentina |
| description |
The process of building data analytics systems, including big data systems, is currently being investigated from various perspectives that generally focus on specific aspects, such as data security or privacy, to the detriment of an engineering perspective on systems development. To address this limitation, our proposal focuses on developing analytics systems through a reuse-based approach, including stages ranging from problem definition to results analysis by identifying variations and building reusable, context-based assets. This study presents the reuse process by constructing two case studies that address the water table level prediction problem in two different contexts: the irrigated period and the non-irrigated period in the same study area. The objective of this study is to demonstrate the influence of context on the performance of widely used predictive models for this problem, including long short-term memory (LSTM), artificial neural networks (ANNs), and support vector machines (SVMs), as well as the potential for reusing the developed analytics system. Additionally, we applied the permutation feature importance (PFI) to determine the contribution of individual variables to the prediction. The results confirm that the same problem hypotheses yield different performance in each case in terms of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean square error (MSE). They also show that the best-performing predictive models differ for some of the hypotheses (ANN in one case and LSTM in another), supporting the assumption that context can influence model selection and performance. Reusing assets allows for more efficient evaluation of these alternatives during development time, resulting in analytics systems that are more closely aligned with reality, while also offering the advantages of software system composition. |
| publishDate |
2026 |
| dc.date.none.fl_str_mv |
2026-02-13T14:05:38Z 2026-02-13T14:05:38Z 2026-02 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
| format |
article |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
http://hdl.handle.net/20.500.12123/25203 https://www.mdpi.com/2076-3417/16/3/1650 2076-3417 https://doi.org/10.3390/app16031650 |
| url |
http://hdl.handle.net/20.500.12123/25203 https://www.mdpi.com/2076-3417/16/3/1650 https://doi.org/10.3390/app16031650 |
| identifier_str_mv |
2076-3417 |
| dc.language.none.fl_str_mv |
eng |
| language |
eng |
| dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
MDPI |
| publisher.none.fl_str_mv |
MDPI |
| dc.source.none.fl_str_mv |
Applied Sciences 16 (3) : 1650. (February 2026) reponame:INTA Digital (INTA) instname:Instituto Nacional de Tecnología Agropecuaria |
| reponame_str |
INTA Digital (INTA) |
| collection |
INTA Digital (INTA) |
| instname_str |
Instituto Nacional de Tecnología Agropecuaria |
| repository.name.fl_str_mv |
INTA Digital (INTA) - Instituto Nacional de Tecnología Agropecuaria |
| repository.mail.fl_str_mv |
tripaldi.nicolas@inta.gob.ar |
| _version_ |
1858207933869850624 |
| score |
13.176822 |