A prototypical tool for analyzing functional dependencies induced from spreadsheets
- Autores
- Gómez, Sergio Alejandro; Fillottrani, Pablo Rubén
- Año de publicación
- 2023
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results.
Red de Universidades con Carreras en Informática - Materia
-
Ciencias Informáticas
Spreadsheets
TANE
Functional dependencies
Databases - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/4.0/
- Repositorio
.jpg)
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/164976
Ver los metadatos del registro completo
| id |
SEDICI_2ba79046caf4bafa2659c9e218245554 |
|---|---|
| oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/164976 |
| network_acronym_str |
SEDICI |
| repository_id_str |
1329 |
| network_name_str |
SEDICI (UNLP) |
| spelling |
A prototypical tool for analyzing functional dependencies induced from spreadsheetsGómez, Sergio AlejandroFillottrani, Pablo RubénCiencias InformáticasSpreadsheetsTANEFunctional dependenciesDatabasesWe present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results.Red de Universidades con Carreras en Informática2023-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf409-417http://sedici.unlp.edu.ar/handle/10915/164976enginfo:eu-repo/semantics/altIdentifier/isbn/978-987-9285-51-0info:eu-repo/semantics/reference/url/https://sedici.unlp.edu.ar/handle/10915/163107info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2025-10-22T17:24:40Zoai:sedici.unlp.edu.ar:10915/164976Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292025-10-22 17:24:40.931SEDICI (UNLP) - Universidad Nacional de La Platafalse |
| dc.title.none.fl_str_mv |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| spellingShingle |
A prototypical tool for analyzing functional dependencies induced from spreadsheets Gómez, Sergio Alejandro Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases |
| title_short |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_full |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_fullStr |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_full_unstemmed |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| title_sort |
A prototypical tool for analyzing functional dependencies induced from spreadsheets |
| dc.creator.none.fl_str_mv |
Gómez, Sergio Alejandro Fillottrani, Pablo Rubén |
| author |
Gómez, Sergio Alejandro |
| author_facet |
Gómez, Sergio Alejandro Fillottrani, Pablo Rubén |
| author_role |
author |
| author2 |
Fillottrani, Pablo Rubén |
| author2_role |
author |
| dc.subject.none.fl_str_mv |
Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases |
| topic |
Ciencias Informáticas Spreadsheets TANE Functional dependencies Databases |
| dc.description.none.fl_txt_mv |
We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results. Red de Universidades con Carreras en Informática |
| description |
We present an extension to the GF framework for OntologyBased Data Access with the aim of determining the functional dependencies that hold in a spreadsheet. Spreadsheets are restricted to a single table expressed as a CSV text file. An initial set of tentative functional dependencies is computed using the TANE datamining algorithm. This set is then presented to the user who is used as an oracle to revise it. Given a functional dependency, the user can see the tuples from the spreadsheet justifying it. The user can revise the validity of the functional dependency with the help of our system, which will generate tuples not present in the dataset by using values already present in the table. The user can then add some of the new records to the table when he considers their feasibility and rerun the miner to see if the functional dependency still holds. We present a running example along with a downloadable JAVA-based application with source code of the miner in the C programming language and the files used in our experiments to help with the reproducibility of our results. |
| publishDate |
2023 |
| dc.date.none.fl_str_mv |
2023-10 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
| format |
conferenceObject |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/164976 |
| url |
http://sedici.unlp.edu.ar/handle/10915/164976 |
| dc.language.none.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/isbn/978-987-9285-51-0 info:eu-repo/semantics/reference/url/https://sedici.unlp.edu.ar/handle/10915/163107 |
| dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.format.none.fl_str_mv |
application/pdf 409-417 |
| dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
| reponame_str |
SEDICI (UNLP) |
| collection |
SEDICI (UNLP) |
| instname_str |
Universidad Nacional de La Plata |
| instacron_str |
UNLP |
| institution |
UNLP |
| repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
| repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
| _version_ |
1846783701819588608 |
| score |
12.982451 |