LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope

Autores
Alberca, Lucas Nicolás; Prada Gori, Denis Nihuel; Fallico, Maximiliano José; Fassio, Alexandre V.; Talevi, Alan; Bellera, Carolina Leticia
Año de publicación
2025
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.
Laboratorio de Investigación y Desarrollo de Bioactivos
Materia
Biología
Decoys
Decoy generation
Retrospective screening
Cheminformatics
Open-source
Nivel de accesibilidad
acceso abierto
Condiciones de uso
http://creativecommons.org/licenses/by-nc/4.0/
Repositorio
SEDICI (UNLP)
Institución
Universidad Nacional de La Plata
OAI Identificador
oai:sedici.unlp.edu.ar:10915/189520

id SEDICI_844ef6fb81db14b5d20d75b44f24968b
oai_identifier_str oai:sedici.unlp.edu.ar:10915/189520
network_acronym_str SEDICI
repository_id_str 1329
network_name_str SEDICI (UNLP)
spelling LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scopeAlberca, Lucas NicolásPrada Gori, Denis NihuelFallico, Maximiliano JoséFassio, Alexandre V.Talevi, AlanBellera, Carolina LeticiaBiologíaDecoysDecoy generationRetrospective screeningCheminformaticsOpen-sourceIn the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.Laboratorio de Investigación y Desarrollo de Bioactivos2025-02info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionArticulohttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfhttp://sedici.unlp.edu.ar/handle/10915/189520enginfo:eu-repo/semantics/altIdentifier/issn/2667-3185info:eu-repo/semantics/altIdentifier/doi/10.1016/j.ailsci.2025.100129info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc/4.0/Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-01-07T13:36:27Zoai:sedici.unlp.edu.ar:10915/189520Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-01-07 13:36:28.293SEDICI (UNLP) - Universidad Nacional de La Platafalse
dc.title.none.fl_str_mv LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
title LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
spellingShingle LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
Alberca, Lucas Nicolás
Biología
Decoys
Decoy generation
Retrospective screening
Cheminformatics
Open-source
title_short LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
title_full LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
title_fullStr LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
title_full_unstemmed LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
title_sort LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
dc.creator.none.fl_str_mv Alberca, Lucas Nicolás
Prada Gori, Denis Nihuel
Fallico, Maximiliano José
Fassio, Alexandre V.
Talevi, Alan
Bellera, Carolina Leticia
author Alberca, Lucas Nicolás
author_facet Alberca, Lucas Nicolás
Prada Gori, Denis Nihuel
Fallico, Maximiliano José
Fassio, Alexandre V.
Talevi, Alan
Bellera, Carolina Leticia
author_role author
author2 Prada Gori, Denis Nihuel
Fallico, Maximiliano José
Fassio, Alexandre V.
Talevi, Alan
Bellera, Carolina Leticia
author2_role author
author
author
author
author
dc.subject.none.fl_str_mv Biología
Decoys
Decoy generation
Retrospective screening
Cheminformatics
Open-source
topic Biología
Decoys
Decoy generation
Retrospective screening
Cheminformatics
Open-source
dc.description.none.fl_txt_mv In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.
Laboratorio de Investigación y Desarrollo de Bioactivos
description In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.
publishDate 2025
dc.date.none.fl_str_mv 2025-02
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Articulo
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://sedici.unlp.edu.ar/handle/10915/189520
url http://sedici.unlp.edu.ar/handle/10915/189520
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/issn/2667-3185
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.ailsci.2025.100129
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
http://creativecommons.org/licenses/by-nc/4.0/
Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
eu_rights_str_mv openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc/4.0/
Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:SEDICI (UNLP)
instname:Universidad Nacional de La Plata
instacron:UNLP
reponame_str SEDICI (UNLP)
collection SEDICI (UNLP)
instname_str Universidad Nacional de La Plata
instacron_str UNLP
institution UNLP
repository.name.fl_str_mv SEDICI (UNLP) - Universidad Nacional de La Plata
repository.mail.fl_str_mv alira@sedici.unlp.edu.ar
_version_ 1853683336049852416
score 13.25844