A wrapper&mediator prototype for web data warehouses
- Autores
- Giaudrone, Verónica; Guerra, Marcelo; Vaccaro, Marcelo; Motz, Regina; Marotta, Adriana
- Año de publicación
- 2005
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- There is a lot of information published on the Web that can be useful for decision-making. The work reported in this paper focuses on how to extract and integrate this information in order to construct a Data Warehouse that makes it available. The manual process of extracting and integrating information is expensive and complex. That is the reason why we suggest the development of a tool, based on Wrappers and Mediators, which allows the extraction of information from the Web and integrate it automatically. Wrappers are in charge of information extraction, which is based on page’s structure and a query enriched using a domain ontology. Mediators perform data integration in order to combine information from several sources, solving the conflicts that may appear due to contradictory information, taking into account the trust of the sources. An important characteristic we consider in our proposal is that information contained in the Web changes constantly; therefore a mechanism that supports system evolution becomes essential. For this reason we propose the generation of metadata that keeps the traceability of the process, allowing managing the impact of the source changes on the whole system.
II Workshop de Ingeniería de Software y Bases de Datos (WISBD)
Red de Universidades con Carreras en Informática (RedUNCI) - Materia
-
Ciencias Informáticas
Data warehouse and repository
wrapper
mediator
ontology - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/2.5/ar/
- Repositorio
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/23160
Ver los metadatos del registro completo
id |
SEDICI_f2140a9b49ea01a4df1d70aa0534d832 |
---|---|
oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/23160 |
network_acronym_str |
SEDICI |
repository_id_str |
1329 |
network_name_str |
SEDICI (UNLP) |
spelling |
A wrapper&mediator prototype for web data warehousesGiaudrone, VerónicaGuerra, MarceloVaccaro, MarceloMotz, ReginaMarotta, AdrianaCiencias InformáticasData warehouse and repositorywrappermediatorontologyThere is a lot of information published on the Web that can be useful for decision-making. The work reported in this paper focuses on how to extract and integrate this information in order to construct a Data Warehouse that makes it available. The manual process of extracting and integrating information is expensive and complex. That is the reason why we suggest the development of a tool, based on Wrappers and Mediators, which allows the extraction of information from the Web and integrate it automatically. Wrappers are in charge of information extraction, which is based on page’s structure and a query enriched using a domain ontology. Mediators perform data integration in order to combine information from several sources, solving the conflicts that may appear due to contradictory information, taking into account the trust of the sources. An important characteristic we consider in our proposal is that information contained in the Web changes constantly; therefore a mechanism that supports system evolution becomes essential. For this reason we propose the generation of metadata that keeps the traceability of the process, allowing managing the impact of the source changes on the whole system.II Workshop de Ingeniería de Software y Bases de Datos (WISBD)Red de Universidades con Carreras en Informática (RedUNCI)2005-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdfhttp://sedici.unlp.edu.ar/handle/10915/23160enginfo:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/2.5/ar/Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2025-09-29T10:55:21Zoai:sedici.unlp.edu.ar:10915/23160Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292025-09-29 10:55:21.775SEDICI (UNLP) - Universidad Nacional de La Platafalse |
dc.title.none.fl_str_mv |
A wrapper&mediator prototype for web data warehouses |
title |
A wrapper&mediator prototype for web data warehouses |
spellingShingle |
A wrapper&mediator prototype for web data warehouses Giaudrone, Verónica Ciencias Informáticas Data warehouse and repository wrapper mediator ontology |
title_short |
A wrapper&mediator prototype for web data warehouses |
title_full |
A wrapper&mediator prototype for web data warehouses |
title_fullStr |
A wrapper&mediator prototype for web data warehouses |
title_full_unstemmed |
A wrapper&mediator prototype for web data warehouses |
title_sort |
A wrapper&mediator prototype for web data warehouses |
dc.creator.none.fl_str_mv |
Giaudrone, Verónica Guerra, Marcelo Vaccaro, Marcelo Motz, Regina Marotta, Adriana |
author |
Giaudrone, Verónica |
author_facet |
Giaudrone, Verónica Guerra, Marcelo Vaccaro, Marcelo Motz, Regina Marotta, Adriana |
author_role |
author |
author2 |
Guerra, Marcelo Vaccaro, Marcelo Motz, Regina Marotta, Adriana |
author2_role |
author author author author |
dc.subject.none.fl_str_mv |
Ciencias Informáticas Data warehouse and repository wrapper mediator ontology |
topic |
Ciencias Informáticas Data warehouse and repository wrapper mediator ontology |
dc.description.none.fl_txt_mv |
There is a lot of information published on the Web that can be useful for decision-making. The work reported in this paper focuses on how to extract and integrate this information in order to construct a Data Warehouse that makes it available. The manual process of extracting and integrating information is expensive and complex. That is the reason why we suggest the development of a tool, based on Wrappers and Mediators, which allows the extraction of information from the Web and integrate it automatically. Wrappers are in charge of information extraction, which is based on page’s structure and a query enriched using a domain ontology. Mediators perform data integration in order to combine information from several sources, solving the conflicts that may appear due to contradictory information, taking into account the trust of the sources. An important characteristic we consider in our proposal is that information contained in the Web changes constantly; therefore a mechanism that supports system evolution becomes essential. For this reason we propose the generation of metadata that keeps the traceability of the process, allowing managing the impact of the source changes on the whole system. II Workshop de Ingeniería de Software y Bases de Datos (WISBD) Red de Universidades con Carreras en Informática (RedUNCI) |
description |
There is a lot of information published on the Web that can be useful for decision-making. The work reported in this paper focuses on how to extract and integrate this information in order to construct a Data Warehouse that makes it available. The manual process of extracting and integrating information is expensive and complex. That is the reason why we suggest the development of a tool, based on Wrappers and Mediators, which allows the extraction of information from the Web and integrate it automatically. Wrappers are in charge of information extraction, which is based on page’s structure and a query enriched using a domain ontology. Mediators perform data integration in order to combine information from several sources, solving the conflicts that may appear due to contradictory information, taking into account the trust of the sources. An important characteristic we consider in our proposal is that information contained in the Web changes constantly; therefore a mechanism that supports system evolution becomes essential. For this reason we propose the generation of metadata that keeps the traceability of the process, allowing managing the impact of the source changes on the whole system. |
publishDate |
2005 |
dc.date.none.fl_str_mv |
2005-10 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/23160 |
url |
http://sedici.unlp.edu.ar/handle/10915/23160 |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/2.5/ar/ Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/2.5/ar/ Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
reponame_str |
SEDICI (UNLP) |
collection |
SEDICI (UNLP) |
instname_str |
Universidad Nacional de La Plata |
instacron_str |
UNLP |
institution |
UNLP |
repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
_version_ |
1844615812302766080 |
score |
13.070432 |