LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope
- Autores
- Alberca, Lucas Nicolás; Prada Gori, Denis Nihuel; Fallico, Maximiliano José; Fassio, Alexandre V.; Talevi, Alan; Bellera, Carolina Leticia
- Año de publicación
- 2025
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.
Laboratorio de Investigación y Desarrollo de Bioactivos - Materia
-
Biología
Decoys
Decoy generation
Retrospective screening
Cheminformatics
Open-source - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- http://creativecommons.org/licenses/by-nc/4.0/
- Repositorio
.jpg)
- Institución
- Universidad Nacional de La Plata
- OAI Identificador
- oai:sedici.unlp.edu.ar:10915/189520
Ver los metadatos del registro completo
| id |
SEDICI_844ef6fb81db14b5d20d75b44f24968b |
|---|---|
| oai_identifier_str |
oai:sedici.unlp.edu.ar:10915/189520 |
| network_acronym_str |
SEDICI |
| repository_id_str |
1329 |
| network_name_str |
SEDICI (UNLP) |
| spelling |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scopeAlberca, Lucas NicolásPrada Gori, Denis NihuelFallico, Maximiliano JoséFassio, Alexandre V.Talevi, AlanBellera, Carolina LeticiaBiologíaDecoysDecoy generationRetrospective screeningCheminformaticsOpen-sourceIn the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches.Laboratorio de Investigación y Desarrollo de Bioactivos2025-02info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionArticulohttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfhttp://sedici.unlp.edu.ar/handle/10915/189520enginfo:eu-repo/semantics/altIdentifier/issn/2667-3185info:eu-repo/semantics/altIdentifier/doi/10.1016/j.ailsci.2025.100129info:eu-repo/semantics/openAccesshttp://creativecommons.org/licenses/by-nc/4.0/Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)reponame:SEDICI (UNLP)instname:Universidad Nacional de La Platainstacron:UNLP2026-01-07T13:36:27Zoai:sedici.unlp.edu.ar:10915/189520Institucionalhttp://sedici.unlp.edu.ar/Universidad públicaNo correspondehttp://sedici.unlp.edu.ar/oai/snrdalira@sedici.unlp.edu.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:13292026-01-07 13:36:28.293SEDICI (UNLP) - Universidad Nacional de La Platafalse |
| dc.title.none.fl_str_mv |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| title |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| spellingShingle |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope Alberca, Lucas Nicolás Biología Decoys Decoy generation Retrospective screening Cheminformatics Open-source |
| title_short |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| title_full |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| title_fullStr |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| title_full_unstemmed |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| title_sort |
LIDEB’s Useful Decoys (LUDe): A freely available decoy-generation tool: Benchmarking and scope |
| dc.creator.none.fl_str_mv |
Alberca, Lucas Nicolás Prada Gori, Denis Nihuel Fallico, Maximiliano José Fassio, Alexandre V. Talevi, Alan Bellera, Carolina Leticia |
| author |
Alberca, Lucas Nicolás |
| author_facet |
Alberca, Lucas Nicolás Prada Gori, Denis Nihuel Fallico, Maximiliano José Fassio, Alexandre V. Talevi, Alan Bellera, Carolina Leticia |
| author_role |
author |
| author2 |
Prada Gori, Denis Nihuel Fallico, Maximiliano José Fassio, Alexandre V. Talevi, Alan Bellera, Carolina Leticia |
| author2_role |
author author author author author |
| dc.subject.none.fl_str_mv |
Biología Decoys Decoy generation Retrospective screening Cheminformatics Open-source |
| topic |
Biología Decoys Decoy generation Retrospective screening Cheminformatics Open-source |
| dc.description.none.fl_txt_mv |
In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches. Laboratorio de Investigación y Desarrollo de Bioactivos |
| description |
In the field of chemoinformatics, and in particular, when developing models to be applied in virtual screening campaigns, it is essential to run retrospective virtual screening experiments that evaluate the performance of such models in a scenario similar to the real one. That is, the ability to recover a small number of active compounds dispersed among a much larger number of compounds without the desired activity. However, such a retrospective experiment is often limited by the relative scarcity of known inactive compounds against the pharmacological target of interest. In these cases, automatic decoy (putative inactive compound) generation tools are often of great importance. Their basic goal is to generate decoys that are similar enough to the known active compounds to challenge the models, but different enough so that the probability that the decoys modulate the molecular target of interest is small. In this article, we report the latest version of our open-source decoy generation tool LUDe, inspired by the well-known DUD-E but designed to reduce the probability of generating decoys topologically similar to known active compounds. We have carried out a benchmarking exercise against DUD-E through 102 pharmacological targets, using the DOE score and the Doppelganger score as comparison criteria. LUDe decoys obtained better DOE scores across most of the targets, indicating a lower risk of artificial enrichment. The mean Doppelganger score, in contrast, was similar for LUDe and DUD-E decoys, exhibiting a slight improvement for LUDe decoys for most of the targets. Simulation experiments were performed to verify whether the generated decoys are unsuitable to validate ligand-based models. Our results suggest that LUDe decoys are apt to be used to validate and compare machine learning ligand-based screening approaches. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025-02 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion Articulo http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
| format |
article |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
http://sedici.unlp.edu.ar/handle/10915/189520 |
| url |
http://sedici.unlp.edu.ar/handle/10915/189520 |
| dc.language.none.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/issn/2667-3185 info:eu-repo/semantics/altIdentifier/doi/10.1016/j.ailsci.2025.100129 |
| dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.source.none.fl_str_mv |
reponame:SEDICI (UNLP) instname:Universidad Nacional de La Plata instacron:UNLP |
| reponame_str |
SEDICI (UNLP) |
| collection |
SEDICI (UNLP) |
| instname_str |
Universidad Nacional de La Plata |
| instacron_str |
UNLP |
| institution |
UNLP |
| repository.name.fl_str_mv |
SEDICI (UNLP) - Universidad Nacional de La Plata |
| repository.mail.fl_str_mv |
alira@sedici.unlp.edu.ar |
| _version_ |
1853683336049852416 |
| score |
13.25844 |