Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction

Autores
Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina
Año de publicación
2012
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.
Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; Argentina
Fil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados Unidos
Fil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; Dinamarca
Fil: Marino Buslje, Cristina. Fundación Instituto Leloir; Argentina
Materia
CATALYTIC RESIDUES
COEVOLUTION
FUNCTIONAL SITES
MUTUAL INFORMATION
SEQUENCE ANALYSIS
SPECIFICITY DETERMINING POSITION
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/100979

id CONICETDig_a201a104747b077fe80f2b5713cca578
oai_identifier_str oai:ri.conicet.gov.ar:11336/100979
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue predictionTeppa, Roxana ElinWilkins, Angela D.Nielsen, MortenMarino Buslje, CristinaCATALYTIC RESIDUESCOEVOLUTIONFUNCTIONAL SITESMUTUAL INFORMATIONSEQUENCE ANALYSISSPECIFICITY DETERMINING POSITIONhttps://purl.org/becyt/ford/1.6https://purl.org/becyt/ford/1Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; ArgentinaFil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados UnidosFil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; DinamarcaFil: Marino Buslje, Cristina. Fundación Instituto Leloir; ArgentinaBioMed Central2012-09info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/100979Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-81471-21051471-2105CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/http://www.biomedcentral.com/1471-2105/13/235info:eu-repo/semantics/altIdentifier/doi/10.1186/1471-2105-13-235info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-29T10:35:43Zoai:ri.conicet.gov.ar:11336/100979instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-29 10:35:43.692CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
title Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
spellingShingle Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
Teppa, Roxana Elin
CATALYTIC RESIDUES
COEVOLUTION
FUNCTIONAL SITES
MUTUAL INFORMATION
SEQUENCE ANALYSIS
SPECIFICITY DETERMINING POSITION
title_short Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
title_full Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
title_fullStr Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
title_full_unstemmed Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
title_sort Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
dc.creator.none.fl_str_mv Teppa, Roxana Elin
Wilkins, Angela D.
Nielsen, Morten
Marino Buslje, Cristina
author Teppa, Roxana Elin
author_facet Teppa, Roxana Elin
Wilkins, Angela D.
Nielsen, Morten
Marino Buslje, Cristina
author_role author
author2 Wilkins, Angela D.
Nielsen, Morten
Marino Buslje, Cristina
author2_role author
author
author
dc.subject.none.fl_str_mv CATALYTIC RESIDUES
COEVOLUTION
FUNCTIONAL SITES
MUTUAL INFORMATION
SEQUENCE ANALYSIS
SPECIFICITY DETERMINING POSITION
topic CATALYTIC RESIDUES
COEVOLUTION
FUNCTIONAL SITES
MUTUAL INFORMATION
SEQUENCE ANALYSIS
SPECIFICITY DETERMINING POSITION
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.6
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.
Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; Argentina
Fil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados Unidos
Fil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; Dinamarca
Fil: Marino Buslje, Cristina. Fundación Instituto Leloir; Argentina
description Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.
publishDate 2012
dc.date.none.fl_str_mv 2012-09
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/100979
Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-8
1471-2105
1471-2105
CONICET Digital
CONICET
url http://hdl.handle.net/11336/100979
identifier_str_mv Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-8
1471-2105
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/http://www.biomedcentral.com/1471-2105/13/235
info:eu-repo/semantics/altIdentifier/doi/10.1186/1471-2105-13-235
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv BioMed Central
publisher.none.fl_str_mv BioMed Central
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1844614376385937408
score 13.069144