An instantiation for sequences of hierarchical distance-based conceptual clustering
- Autores
- Funes, Ana; Ramírez-Quintana, María José; Hernández-Orallo, Jose; Ferri, Cèsar
- Año de publicación
- 2011
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- In this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalizations of the discovered clusters. Since the approach is general, it allows combining the flexibility of changing distances for different data types at the same time that we take advantage of the interpretability offered by the obtained concepts, which is central for descriptive data mining tasks. We propose here different generalization operators for sequences and analyze how they work together with the edit and linkage distances in HDCC. This analysis is carried out based on three different properties for generalization operators and three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalization operators.
Sociedad Argentina de Informática e Investigación Operativa - Materia
-
Ciencias Informáticas
conceptual clustering
distance based clustering
Linked lists
sequences
edit distance - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/4.0/
- Repositorio
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/125251
Ver los metadatos del registro completo
id |
SEDICI_475c1ed3367262c4af937d3c83e686cf |
---|---|
oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/125251 |
network_acronym_str |
SEDICI |
repository_id_str |
1329 |
network_name_str |
SEDICI (UNLP) |
spelling |
An instantiation for sequences of hierarchical distance-based conceptual clusteringFunes, AnaRamírez-Quintana, María JoséHernández-Orallo, JoseFerri, CèsarCiencias Informáticasconceptual clusteringdistance based clusteringLinked listssequencesedit distanceIn this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalizations of the discovered clusters. Since the approach is general, it allows combining the flexibility of changing distances for different data types at the same time that we take advantage of the interpretability offered by the obtained concepts, which is central for descriptive data mining tasks. We propose here different generalization operators for sequences and analyze how they work together with the edit and linkage distances in HDCC. This analysis is carried out based on three different properties for generalization operators and three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalization operators.Sociedad Argentina de Informática e Investigación Operativa2011-08info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf128-139http://sedici.unlp.edu.ar/handle/10915/125251enginfo:eu-repo/semantics/altIdentifier/issn/1850-2784info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2025-09-29T11:30:08Zoai:sedici.unlp.edu.ar:10915/125251Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292025-09-29 11:30:09.218SEDICI (UNLP) - Universidad Nacional de La Platafalse |
dc.title.none.fl_str_mv |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
title |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
spellingShingle |
An instantiation for sequences of hierarchical distance-based conceptual clustering Funes, Ana Ciencias Informáticas conceptual clustering distance based clustering Linked lists sequences edit distance |
title_short |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
title_full |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
title_fullStr |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
title_full_unstemmed |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
title_sort |
An instantiation for sequences of hierarchical distance-based conceptual clustering |
dc.creator.none.fl_str_mv |
Funes, Ana Ramírez-Quintana, María José Hernández-Orallo, Jose Ferri, Cèsar |
author |
Funes, Ana |
author_facet |
Funes, Ana Ramírez-Quintana, María José Hernández-Orallo, Jose Ferri, Cèsar |
author_role |
author |
author2 |
Ramírez-Quintana, María José Hernández-Orallo, Jose Ferri, Cèsar |
author2_role |
author author author |
dc.subject.none.fl_str_mv |
Ciencias Informáticas conceptual clustering distance based clustering Linked lists sequences edit distance |
topic |
Ciencias Informáticas conceptual clustering distance based clustering Linked lists sequences edit distance |
dc.description.none.fl_txt_mv |
In this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalizations of the discovered clusters. Since the approach is general, it allows combining the flexibility of changing distances for different data types at the same time that we take advantage of the interpretability offered by the obtained concepts, which is central for descriptive data mining tasks. We propose here different generalization operators for sequences and analyze how they work together with the edit and linkage distances in HDCC. This analysis is carried out based on three different properties for generalization operators and three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalization operators. Sociedad Argentina de Informática e Investigación Operativa |
description |
In this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalizations of the discovered clusters. Since the approach is general, it allows combining the flexibility of changing distances for different data types at the same time that we take advantage of the interpretability offered by the obtained concepts, which is central for descriptive data mining tasks. We propose here different generalization operators for sequences and analyze how they work together with the edit and linkage distances in HDCC. This analysis is carried out based on three different properties for generalization operators and three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalization operators. |
publishDate |
2011 |
dc.date.none.fl_str_mv |
2011-08 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/125251 |
url |
http://sedici.unlp.edu.ar/handle/10915/125251 |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/issn/1850-2784 |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.format.none.fl_str_mv |
application/pdf 128-139 |
dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
reponame_str |
SEDICI (UNLP) |
collection |
SEDICI (UNLP) |
instname_str |
Universidad Nacional de La Plata |
instacron_str |
UNLP |
institution |
UNLP |
repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
_version_ |
1844616180869890048 |
score |
13.070432 |