The Development of More Accurate QSAR Techniques
- Autores
- Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román
- Año de publicación
- 2012
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.
Fil: A. Lee.
Fil: Mercader, Andrew Gustavo.
Fil: E. A. Castro.
Fil: Duchowicz, Pablo Román. - Materia
-
Qsar Theory
Replacement Method
K-Means Clustering
Molecular Descriptors - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
- Repositorio
- Institución
- Consejo Nacional de Investigaciones Científicas y Técnicas
- OAI Identificador
- oai:ri.conicet.gov.ar:11336/103247
Ver los metadatos del registro completo
id |
CONICETDig_baebd49f1173e08d04e41b5e5ea80449 |
---|---|
oai_identifier_str |
oai:ri.conicet.gov.ar:11336/103247 |
network_acronym_str |
CONICETDig |
repository_id_str |
3498 |
network_name_str |
CONICET Digital (CONICET) |
spelling |
The Development of More Accurate QSAR TechniquesLee, AdamMercader, Andrew GustavoCastro, Eduardo AlbertoDuchowicz, Pablo RománQsar TheoryReplacement MethodK-Means ClusteringMolecular Descriptorshttps://purl.org/becyt/ford/1.4https://purl.org/becyt/ford/1QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method.Fil: A. Lee.Fil: Mercader, Andrew Gustavo.Fil: E. A. Castro.Fil: Duchowicz, Pablo Román.The SciTech Publishers2012-05info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/103247Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-392278-53292348-098XCONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/https://sites.google.com/a/thescitechpub.com/thescitech/issuesinfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T09:49:15Zoai:ri.conicet.gov.ar:11336/103247instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 09:49:15.423CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse |
dc.title.none.fl_str_mv |
The Development of More Accurate QSAR Techniques |
title |
The Development of More Accurate QSAR Techniques |
spellingShingle |
The Development of More Accurate QSAR Techniques Lee, Adam Qsar Theory Replacement Method K-Means Clustering Molecular Descriptors |
title_short |
The Development of More Accurate QSAR Techniques |
title_full |
The Development of More Accurate QSAR Techniques |
title_fullStr |
The Development of More Accurate QSAR Techniques |
title_full_unstemmed |
The Development of More Accurate QSAR Techniques |
title_sort |
The Development of More Accurate QSAR Techniques |
dc.creator.none.fl_str_mv |
Lee, Adam Mercader, Andrew Gustavo Castro, Eduardo Alberto Duchowicz, Pablo Román |
author |
Lee, Adam |
author_facet |
Lee, Adam Mercader, Andrew Gustavo Castro, Eduardo Alberto Duchowicz, Pablo Román |
author_role |
author |
author2 |
Mercader, Andrew Gustavo Castro, Eduardo Alberto Duchowicz, Pablo Román |
author2_role |
author author author |
dc.subject.none.fl_str_mv |
Qsar Theory Replacement Method K-Means Clustering Molecular Descriptors |
topic |
Qsar Theory Replacement Method K-Means Clustering Molecular Descriptors |
purl_subject.fl_str_mv |
https://purl.org/becyt/ford/1.4 https://purl.org/becyt/ford/1 |
dc.description.none.fl_txt_mv |
QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method. Fil: A. Lee. Fil: Mercader, Andrew Gustavo. Fil: E. A. Castro. Fil: Duchowicz, Pablo Román. |
description |
QSAR is a very effective starting step in the development of compounds for vast numbers of industries. Its scale and importance, especially in the medicinal field means it is a dynamic area to research. The size of QSAR also presents problems; there are many different methods in use for each of the steps in a study, from the descriptors in use, to the type of linear regression to apply to the descriptors. The idea was to put forward models that improved upon the existing methods to such a degree that it could become a universal method for QSAR modelling. This project successfully investigated in detail an improvement to the existing methods to choose the correct number of descriptors to include in the model by using Rloo analysis; this resulted in a simpler model compared to previous methods. K – Means clustering was also investigated as part of a novel, variable independent method. This methodology only uses one descriptor as opposed to general QSAR studies which use several. The results for 12 out of the 14 sets were at least as accurate as the results obtained by existing linear methods. An example using PERM; the Stest obtained using the novel method was 0.46 compared to the Stest of 0.53 obtained by using current linear methods. The simplicity associated with the K - Means clustering method and the fact it shows improved predictive potential could lead to an overhaul of all current, more complicated methods in favour of the simpler K- Means based method. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012-05 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
format |
article |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://hdl.handle.net/11336/103247 Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-39 2278-5329 2348-098X CONICET Digital CONICET |
url |
http://hdl.handle.net/11336/103247 |
identifier_str_mv |
Lee, Adam; Mercader, Andrew Gustavo; Castro, Eduardo Alberto; Duchowicz, Pablo Román; The Development of More Accurate QSAR Techniques; The SciTech Publishers; The SciTech, Journal of Science & Technology; 1; 1; 5-2012; 3-39 2278-5329 2348-098X CONICET Digital CONICET |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/url/https://sites.google.com/a/thescitechpub.com/thescitech/issues |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
dc.format.none.fl_str_mv |
application/pdf application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
The SciTech Publishers |
publisher.none.fl_str_mv |
The SciTech Publishers |
dc.source.none.fl_str_mv |
reponame:CONICET Digital (CONICET) instname:Consejo Nacional de Investigaciones Científicas y Técnicas |
reponame_str |
CONICET Digital (CONICET) |
collection |
CONICET Digital (CONICET) |
instname_str |
Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.name.fl_str_mv |
CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.mail.fl_str_mv |
dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar |
_version_ |
1842268962550710272 |
score |
13.13397 |