The Development of More Accurate QSAR Techniques

Autores
Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román
Año de publicación
2012
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.
Fil: A. Lee.
Fil: Mercader, Andrew Gustavo.
Fil: E. A. Castro.
Fil: Duchowicz, Pablo Román.
Materia
Qsar Theory
Replacement Method
K-Means Clustering
Molecular Descriptors
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/103247

id CONICETDig_baebd49f1173e08d04e41b5e5ea80449
oai_identifier_str oai:ri.conicet.gov.ar:11336/103247
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling The Development of More Accurate QSAR TechniquesLee, AdamMercader, Andrew GustavoCastro, Eduardo AlbertoDuchowicz, Pablo RománQsar TheoryReplacement MethodK-Means ClusteringMolecular Descriptorshttps://purl.org/becyt/ford/1.4https://purl.org/becyt/ford/1QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.Fil: A. Lee.Fil: Mercader, Andrew Gustavo.Fil: E. A. Castro.Fil: Duchowicz, Pablo Román.The SciTech Publishers2012-05info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/103247Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-392278-53292348-098XCONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/https://sites.google.com/a/thescitechpub.com/thescitech/issuesinfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T09:49:15Zoai:ri.conicet.gov.ar:11336/103247instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 09:49:15.423CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv The Development of More Accurate QSAR Techniques
title The Development of More Accurate QSAR Techniques
spellingShingle The Development of More Accurate QSAR Techniques
Lee, Adam
Qsar Theory
Replacement Method
K-Means Clustering
Molecular Descriptors
title_short The Development of More Accurate QSAR Techniques
title_full The Development of More Accurate QSAR Techniques
title_fullStr The Development of More Accurate QSAR Techniques
title_full_unstemmed The Development of More Accurate QSAR Techniques
title_sort The Development of More Accurate QSAR Techniques
dc.creator.none.fl_str_mv Lee, Adam
Mercader, Andrew Gustavo
Castro, Eduardo Alberto
Duchowicz, Pablo Román
author Lee, Adam
author_facet Lee, Adam
Mercader, Andrew Gustavo
Castro, Eduardo Alberto
Duchowicz, Pablo Román
author_role author
author2 Mercader, Andrew Gustavo
Castro, Eduardo Alberto
Duchowicz, Pablo Román
author2_role author
author
author
dc.subject.none.fl_str_mv Qsar Theory
Replacement Method
K-Means Clustering
Molecular Descriptors
topic Qsar Theory
Replacement Method
K-Means Clustering
Molecular Descriptors
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.4
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.
Fil: A. Lee.
Fil: Mercader, Andrew Gustavo.
Fil: E. A. Castro.
Fil: Duchowicz, Pablo Román.
description QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.
publishDate 2012
dc.date.none.fl_str_mv 2012-05
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/103247
Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-39
2278-5329
2348-098X
CONICET Digital
CONICET
url http://hdl.handle.net/11336/103247
identifier_str_mv Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-39
2278-5329
2348-098X
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/https://sites.google.com/a/thescitechpub.com/thescitech/issues
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv The SciTech Publishers
publisher.none.fl_str_mv The SciTech Publishers
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1842268962550710272
score 13.13397