An optimization of the dynamic set of clusters for similarity searches
- Autores
- Samat, Pablo; Ludueña, Verónica; Reyes, Nora Susana; Paredes, Rodrigo; Figueroa, Karina; Lagos, Miguel
- Año de publicación
- 2025
- Idioma
- inglés
- Tipo de recurso
- documento de conferencia
- Estado
- versión publicada
- Descripción
- While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.
Red de Universidades con Carreras en Informática - Materia
-
Ciencias Informáticas
similarity searches
dynamic indexes
non conventional databases - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc-sa/4.0/
- Repositorio
.jpg)
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/191274
Ver los metadatos del registro completo
| id |
SEDICI_960b549f16b315beb9971b1be006b2ca |
|---|---|
| oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/191274 |
| network_acronym_str |
SEDICI |
| repository_id_str |
1329 |
| network_name_str |
SEDICI (UNLP) |
| spelling |
An optimization of the dynamic set of clusters for similarity searchesSamat, PabloLudueña, VerónicaReyes, Nora SusanaParedes, RodrigoFigueroa, KarinaLagos, MiguelCiencias Informáticassimilarity searchesdynamic indexesnon conventional databasesWhile metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications.Red de Universidades con Carreras en Informática2025-10info:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/publishedVersionObjeto de conferenciahttp://purl.org/coar/resource_type/c_5794info:ar-repo/semantics/documentoDeConferenciaapplication/pdf477-486http://sedici.unlp.edu.ar/handle/10915/191274enginfo:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7info:eu-repo/semantics/reference/hdl/10915/189846info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc-sa/4.0/Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-03-31T12:41:46Zoai:sedici.unlp.edu.ar:10915/191274Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-03-31 12:41:46.884SEDICI (UNLP) - Universidad Nacional de La Platafalse |
| dc.title.none.fl_str_mv |
An optimization of the dynamic set of clusters for similarity searches |
| title |
An optimization of the dynamic set of clusters for similarity searches |
| spellingShingle |
An optimization of the dynamic set of clusters for similarity searches Samat, Pablo Ciencias Informáticas similarity searches dynamic indexes non conventional databases |
| title_short |
An optimization of the dynamic set of clusters for similarity searches |
| title_full |
An optimization of the dynamic set of clusters for similarity searches |
| title_fullStr |
An optimization of the dynamic set of clusters for similarity searches |
| title_full_unstemmed |
An optimization of the dynamic set of clusters for similarity searches |
| title_sort |
An optimization of the dynamic set of clusters for similarity searches |
| dc.creator.none.fl_str_mv |
Samat, Pablo Ludueña, Verónica Reyes, Nora Susana Paredes, Rodrigo Figueroa, Karina Lagos, Miguel |
| author |
Samat, Pablo |
| author_facet |
Samat, Pablo Ludueña, Verónica Reyes, Nora Susana Paredes, Rodrigo Figueroa, Karina Lagos, Miguel |
| author_role |
author |
| author2 |
Ludueña, Verónica Reyes, Nora Susana Paredes, Rodrigo Figueroa, Karina Lagos, Miguel |
| author2_role |
author author author author author |
| dc.subject.none.fl_str_mv |
Ciencias Informáticas similarity searches dynamic indexes non conventional databases |
| topic |
Ciencias Informáticas similarity searches dynamic indexes non conventional databases |
| dc.description.none.fl_txt_mv |
While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications. Red de Universidades con Carreras en Informática |
| description |
While metric space searching offers a promising solution for efficient similarity searches, the approach lags behind traditional databases in several key areas. A primary weakness is that most indexing methods are static; very few can handle dynamic insertions and deletions cost-effectively without degrading performance. Additionally, even fewer schemes are optimized to operate efficiently on secondary storage. The Dynamic Set of Clusters (DSC) is a dynamic index built for secondary memory, but its performance is hindered by a lack of cluster compactness. To overcome this, we propose an optimization that uses "cut regions” established by global pivots, to more accurately define the data-containing zones within each cluster. This enhanced method is competitive with state-of-the-art approaches, performs well in secondary memory, and serves as a practical alternative for numerous database applications. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025-10 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject info:eu-repo/semantics/publishedVersion Objeto de conferencia http://purl.org/coar/resource_type/c_5794 info:ar-repo/semantics/documentoDeConferencia |
| format |
conferenceObject |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/191274 |
| url |
http://sedici.unlp.edu.ar/handle/10915/191274 |
| dc.language.none.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/isbn/978-987-8258-99-7 info:eu-repo/semantics/reference/hdl/10915/189846 |
| dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-sa/4.0/ Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.format.none.fl_str_mv |
application/pdf 477-486 |
| dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
| reponame_str |
SEDICI (UNLP) |
| collection |
SEDICI (UNLP) |
| instname_str |
Universidad Nacional de La Plata |
| instacron_str |
UNLP |
| institution |
UNLP |
| repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
| repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
| _version_ |
1861199752954118144 |
| score |
13.332987 |