Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction
- Autores
- Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina
- Año de publicación
- 2012
- Idioma
- inglés
- Tipo de recurso
- artículo
- Estado
- versión publicada
- Descripción
- Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.
Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; Argentina
Fil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados Unidos
Fil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; Dinamarca
Fil: Marino Buslje, Cristina. Fundación Instituto Leloir; Argentina - Materia
-
CATALYTIC RESIDUES
COEVOLUTION
FUNCTIONAL SITES
MUTUAL INFORMATION
SEQUENCE ANALYSIS
SPECIFICITY DETERMINING POSITION - Nivel de accesibilidad
- acceso abierto
- Condiciones de uso
- https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
- Repositorio
- Institución
- Consejo Nacional de Investigaciones Científicas y Técnicas
- OAI Identificador
- oai:ri.conicet.gov.ar:11336/100979
Ver los metadatos del registro completo
id |
CONICETDig_a201a104747b077fe80f2b5713cca578 |
---|---|
oai_identifier_str |
oai:ri.conicet.gov.ar:11336/100979 |
network_acronym_str |
CONICETDig |
repository_id_str |
3498 |
network_name_str |
CONICET Digital (CONICET) |
spelling |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue predictionTeppa, Roxana ElinWilkins, Angela D.Nielsen, MortenMarino Buslje, CristinaCATALYTIC RESIDUESCOEVOLUTIONFUNCTIONAL SITESMUTUAL INFORMATIONSEQUENCE ANALYSISSPECIFICITY DETERMINING POSITIONhttps://purl.org/becyt/ford/1.6https://purl.org/becyt/ford/1Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd.Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; ArgentinaFil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados UnidosFil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; DinamarcaFil: Marino Buslje, Cristina. Fundación Instituto Leloir; ArgentinaBioMed Central2012-09info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/100979Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-81471-21051471-2105CONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/http://www.biomedcentral.com/1471-2105/13/235info:eu-repo/semantics/altIdentifier/doi/10.1186/1471-2105-13-235info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-29T10:35:43Zoai:ri.conicet.gov.ar:11336/100979instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-29 10:35:43.692CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse |
dc.title.none.fl_str_mv |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
title |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
spellingShingle |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction Teppa, Roxana Elin CATALYTIC RESIDUES COEVOLUTION FUNCTIONAL SITES MUTUAL INFORMATION SEQUENCE ANALYSIS SPECIFICITY DETERMINING POSITION |
title_short |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
title_full |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
title_fullStr |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
title_full_unstemmed |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
title_sort |
Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction |
dc.creator.none.fl_str_mv |
Teppa, Roxana Elin Wilkins, Angela D. Nielsen, Morten Marino Buslje, Cristina |
author |
Teppa, Roxana Elin |
author_facet |
Teppa, Roxana Elin Wilkins, Angela D. Nielsen, Morten Marino Buslje, Cristina |
author_role |
author |
author2 |
Wilkins, Angela D. Nielsen, Morten Marino Buslje, Cristina |
author2_role |
author author author |
dc.subject.none.fl_str_mv |
CATALYTIC RESIDUES COEVOLUTION FUNCTIONAL SITES MUTUAL INFORMATION SEQUENCE ANALYSIS SPECIFICITY DETERMINING POSITION |
topic |
CATALYTIC RESIDUES COEVOLUTION FUNCTIONAL SITES MUTUAL INFORMATION SEQUENCE ANALYSIS SPECIFICITY DETERMINING POSITION |
purl_subject.fl_str_mv |
https://purl.org/becyt/ford/1.6 https://purl.org/becyt/ford/1 |
dc.description.none.fl_txt_mv |
Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd. Fil: Teppa, Roxana Elin. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Fundación Instituto Leloir; Argentina Fil: Wilkins, Angela D.. Baylor College of Medicine. Department of Molecular and Human Genetics; Estados Unidos Fil: Nielsen, Morten. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata. Instituto de Investigaciones Biotecnológicas. Universidad Nacional de San Martín. Instituto de Investigaciones Biotecnológicas; Argentina. Technical University of Denmark; Dinamarca Fil: Marino Buslje, Cristina. Fundación Instituto Leloir; Argentina |
description |
Background: A large panel of methods exists that aim to identify residues with critical impact on protein function based on evolutionary signals, sequence and structure information. However, it is not clear to what extent these different methods overlap, and if any of the methods have higher predictive potential compared to others when it comes to, in particular, the identification of catalytic residues (CR) in proteins. Using a large set of enzymatic protein families and measures based on different evolutionary signals, we sought to break up the different components of the information content within a multiple sequence alignment to investigate their predictive potential and degree of overlap.Results: Our results demonstrate that the different methods included in the benchmark in general can be divided into three groups with a limited mutual overlap. One group containing real-value Evolutionary Trace (rvET) methods and conservation, another containing mutual information (MI) methods, and the last containing methods designed explicitly for the identification of specificity determining positions (SDPs): integer-value Evolutionary Trace (ivET), SDPfox, and XDET. In terms of prediction of CR, we find using a proximity score integrating structural information (as the sum of the scores of residues located within a given distance of the residue in question) that only the methods from the first two groups displayed a reliable performance. Next, we investigated to what degree proximity scores for conservation, rvET and cumulative MI (cMI) provide complementary information capable of improving the performance for CR identification. We found that integrating conservation with proximity scores for rvET and cMI achieved the highest performance. The proximity conservation score contained no complementary information when integrated with proximity rvET. Moreover, the signal from rvET provided only a limited gain in predictive performance when integrated with mutual information and conservation proximity scores. Combined, these observations demonstrate that the rvET and cMI scores add complementary information to the prediction system.Conclusions: This work contributes to the understanding of the different signals of evolution and also shows that it is possible to improve the detection of catalytic residues by integrating structural and higher order sequence evolutionary information with sequence conservation. © 2012 Teppa et al.; licensee BioMed Central Ltd. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012-09 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion http://purl.org/coar/resource_type/c_6501 info:ar-repo/semantics/articulo |
format |
article |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://hdl.handle.net/11336/100979 Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-8 1471-2105 1471-2105 CONICET Digital CONICET |
url |
http://hdl.handle.net/11336/100979 |
identifier_str_mv |
Teppa, Roxana Elin; Wilkins, Angela D.; Nielsen, Morten; Marino Buslje, Cristina; Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction; BioMed Central; BMC Bioinformatics; 13; 1; 9-2012; 1-8 1471-2105 CONICET Digital CONICET |
dc.language.none.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
info:eu-repo/semantics/altIdentifier/url/http://www.biomedcentral.com/1471-2105/13/235 info:eu-repo/semantics/altIdentifier/doi/10.1186/1471-2105-13-235 |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ |
dc.format.none.fl_str_mv |
application/pdf application/pdf application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
BioMed Central |
publisher.none.fl_str_mv |
BioMed Central |
dc.source.none.fl_str_mv |
reponame:CONICET Digital (CONICET) instname:Consejo Nacional de Investigaciones Científicas y Técnicas |
reponame_str |
CONICET Digital (CONICET) |
collection |
CONICET Digital (CONICET) |
instname_str |
Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.name.fl_str_mv |
CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas |
repository.mail.fl_str_mv |
dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar |
_version_ |
1844614376385937408 |
score |
13.069144 |