An optimization of the dynamic set of clusters for similarity searches

Autores
Samat, Pablo; Ludueña, Verónica; Reyes, Nora Susana; Paredes, Rodrigo; Figueroa, Karina; Lagos, Miguel
Año de publicación
2025
Idioma
inglés
Tipo de recurso
documento de conferencia
Estado
versión publicada
Descripción
While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.
Red de Universidades con Carreras en Informática
Materia
Ciencias Informáticas
similarity searches
dynamic indexes
non conventional databases
Nivel de accesibilidad
acceso abierto
Condiciones de uso
http://creativecommons.org/licenses/by-nc-sa/4.0/
Repositorio
SEDICI (UNLP)
Institución
Universidad Nacional de La Plata
OAI Identificador
oai:sedici.unlp.edu.ar:10915/191274

id SEDICI_960b549f16b315beb9971b1be006b2ca
oai_identifier_str oai:sedici.unlp.edu.ar:10915/191274
network_acronym_str SEDICI
repository_id_str 1329
network_name_str SEDICI (UNLP)
spelling An optimization of the dynamic set of clusters for similarity searchesSamat, PabloLudueña, VerónicaReyes, Nora SusanaParedes, RodrigoFigueroa, KarinaLagos, MiguelCiencias Informáticassimilarity searchesdynamic indexesnon conventional databasesWhile metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.Red de Universidades con Carreras en Informática2025-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf477-486http://sedici.unlp.edu.ar/handle/10915/191274enginfo:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7info:eu-repo/semantics/reference/hdl/10915/189846info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-03-31T12:41:46Zoai:sedici.unlp.edu.ar:10915/191274Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-03-31 12:41:46.884SEDICI (UNLP) - Universidad Nacional de La Platafalse
dc.title.none.fl_str_mv An optimization of the dynamic set of clusters for similarity searches
title An optimization of the dynamic set of clusters for similarity searches
spellingShingle An optimization of the dynamic set of clusters for similarity searches
Samat, Pablo
Ciencias Informáticas
similarity searches
dynamic indexes
non conventional databases
title_short An optimization of the dynamic set of clusters for similarity searches
title_full An optimization of the dynamic set of clusters for similarity searches
title_fullStr An optimization of the dynamic set of clusters for similarity searches
title_full_unstemmed An optimization of the dynamic set of clusters for similarity searches
title_sort An optimization of the dynamic set of clusters for similarity searches
dc.creator.none.fl_str_mv Samat, Pablo
Ludueña, Verónica
Reyes, Nora Susana
Paredes, Rodrigo
Figueroa, Karina
Lagos, Miguel
author Samat, Pablo
author_facet Samat, Pablo
Ludueña, Verónica
Reyes, Nora Susana
Paredes, Rodrigo
Figueroa, Karina
Lagos, Miguel
author_role author
author2 Ludueña, Verónica
Reyes, Nora Susana
Paredes, Rodrigo
Figueroa, Karina
Lagos, Miguel
author2_role author
author
author
author
author
dc.subject.none.fl_str_mv Ciencias Informáticas
similarity searches
dynamic indexes
non conventional databases
topic Ciencias Informáticas
similarity searches
dynamic indexes
non conventional databases
dc.description.none.fl_txt_mv While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.
Red de Universidades con Carreras en Informática
description While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.
publishDate 2025
dc.date.none.fl_str_mv 2025-10
dc.type.none.fl_str_mv info:eu-repo/semantics/conferenceObject
info:eu-repo/semantics/publishedVersion
Objeto de conferencia
http://purl.org/coar/resource_type/c_5794
info:ar-repo/semantics/documentoDeConferencia
format conferenceObject
status_str publishedVersion
dc.identifier.none.fl_str_mv http://sedici.unlp.edu.ar/handle/10915/191274
url http://sedici.unlp.edu.ar/handle/10915/191274
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7
info:eu-repo/semantics/reference/hdl/10915/189846
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-sa/4.0/
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.format.none.fl_str_mv application/pdf
477-486
dc.source.none.fl_str_mv reponame:SEDICI (UNLP)
instname:Universidad Nacional de La Plata
instacron:UNLP
reponame_str SEDICI (UNLP)
collection SEDICI (UNLP)
instname_str Universidad Nacional de La Plata
instacron_str UNLP
institution UNLP
repository.name.fl_str_mv SEDICI (UNLP) - Universidad Nacional de La Plata
repository.mail.fl_str_mv alira@sedici.unlp.edu.ar
_version_ 1861199752954118144
score 13.332987