MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms

Autores
Tomasini, Nicolás; Lauthier, Juan José; Llewellyn, Martin S.; Diosque, Patricio
Año de publicación
2013
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
Multi-locus sequence typing (MLST) is a frequently used genotyping method whose goal is the unambiguous assignment of microorganisms to genetic clusters. MLST typically involves analysis of DNA sequence results generated from several house-keeping gene loci. MLST remains the gold standard for molecular typing of many bacterial pathogens. Eukaryotic pathogens have also been the subject of MLST, however, few tools are available to deal with diploid sequence data. Here we present novel software for MLST data analysis tailored towards diploid Eukaryotes: MLSTest. This software meets various methods used in MLST and introduces some novel methodologies for the evaluation of the data set. In addition to construction of allelic profiles and basic clustering analysis, the MLSTest looks for network structures that suggest genetic exchange in BURST graphs. Additionally, it uses several simple methods for tree construction with the advantage of managing heterozygous or three-state sites. Additionally, the software analyses whether concatenation of fragments from different genes is suitable for the data set using different tests (bionj-incongruence length difference test, Templeton test). It evaluates how the incongruence is distributed across the tree using a variation of the localized incongruence length difference test based on a modified neighbour joining algorithm. We tested the last method in simulated datasets. We showed that is conservative (adequate type I error rate) and moderately to highly powerful as well as useful to localize incongruences in two bacterial and two eukaryotic MLST datasets. MLSTest was also designed for developing MLST schemes. It thus has tools to optimize locus combinations and to reduce the number of targets required for typing. MLSTest also analyses whether the discriminatory power of the typing scheme is increased by including more loci. We evaluated the software over simulated and real datasets from bacterial and eukaryotic microorganisms. The software is freely available at http://www.ipe.unsa.edu.ar/software.
Fil: Tomasini, Nicolás. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
Fil: Lauthier, Juan José. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
Fil: Llewellyn, Martin S.. University of Wales. School of Biological Sciences; Reino Unido
Fil: Diosque, Patricio. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
Materia
Mlstest
Mlst
Incongruence
Concatenation
Scheme Optimization
Multilocus Sequence Typing
Software
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/7619

id CONICETDig_4c9b60d9e1d6a34e8fbb8c0e61337ac6
oai_identifier_str oai:ri.conicet.gov.ar:11336/7619
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organismsTomasini, NicolásLauthier, Juan JoséLlewellyn, Martin S.Diosque, PatricioMlstestMlstIncongruenceConcatenationScheme OptimizationMultilocus Sequence TypingSoftwarehttps://purl.org/becyt/ford/3.3https://purl.org/becyt/ford/3Multi-locus sequence typing (MLST) is a frequently used genotyping method whose goal is the unambiguous assignment of microorganisms to genetic clusters. MLST typically involves analysis of DNA sequence results generated from several house-keeping gene loci. MLST remains the gold standard for molecular typing of many bacterial pathogens. Eukaryotic pathogens have also been the subject of MLST, however, few tools are available to deal with diploid sequence data. Here we present novel software for MLST data analysis tailored towards diploid Eukaryotes: MLSTest. This software meets various methods used in MLST and introduces some novel methodologies for the evaluation of the data set. In addition to construction of allelic profiles and basic clustering analysis, the MLSTest looks for network structures that suggest genetic exchange in BURST graphs. Additionally, it uses several simple methods for tree construction with the advantage of managing heterozygous or three-state sites. Additionally, the software analyses whether concatenation of fragments from different genes is suitable for the data set using different tests (bionj-incongruence length difference test, Templeton test). It evaluates how the incongruence is distributed across the tree using a variation of the localized incongruence length difference test based on a modified neighbour joining algorithm. We tested the last method in simulated datasets. We showed that is conservative (adequate type I error rate) and moderately to highly powerful as well as useful to localize incongruences in two bacterial and two eukaryotic MLST datasets. MLSTest was also designed for developing MLST schemes. It thus has tools to optimize locus combinations and to reduce the number of targets required for typing. MLSTest also analyses whether the discriminatory power of the typing scheme is increased by including more loci. We evaluated the software over simulated and real datasets from bacterial and eukaryotic microorganisms. The software is freely available at http://www.ipe.unsa.edu.ar/software.Fil: Tomasini, Nicolás. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; ArgentinaFil: Lauthier, Juan José. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; ArgentinaFil: Llewellyn, Martin S.. University of Wales. School of Biological Sciences; Reino UnidoFil: Diosque, Patricio. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; ArgentinaElsevier Science2013-09-08info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/7619Tomasini, Nicolás; Lauthier, Juan José; Llewellyn, Martin S.; Diosque, Patricio; MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms; Elsevier Science; Infection, Genetics And Evolution; 20; 8-9-2013; 188-1961567-1348enginfo:eu-repo/semantics/altIdentifier/doi/10.1016/j.meegid.2013.08.029info:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S1567134813003419info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-nd/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-10-15T14:56:42Zoai:ri.conicet.gov.ar:11336/7619instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-10-15 14:56:42.764CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
title MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
spellingShingle MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
Tomasini, Nicolás
Mlstest
Mlst
Incongruence
Concatenation
Scheme Optimization
Multilocus Sequence Typing
Software
title_short MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
title_full MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
title_fullStr MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
title_full_unstemmed MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
title_sort MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms
dc.creator.none.fl_str_mv Tomasini, Nicolás
Lauthier, Juan José
Llewellyn, Martin S.
Diosque, Patricio
author Tomasini, Nicolás
author_facet Tomasini, Nicolás
Lauthier, Juan José
Llewellyn, Martin S.
Diosque, Patricio
author_role author
author2 Lauthier, Juan José
Llewellyn, Martin S.
Diosque, Patricio
author2_role author
author
author
dc.subject.none.fl_str_mv Mlstest
Mlst
Incongruence
Concatenation
Scheme Optimization
Multilocus Sequence Typing
Software
topic Mlstest
Mlst
Incongruence
Concatenation
Scheme Optimization
Multilocus Sequence Typing
Software
purl_subject.fl_str_mv https://purl.org/becyt/ford/3.3
https://purl.org/becyt/ford/3
dc.description.none.fl_txt_mv Multi-locus sequence typing (MLST) is a frequently used genotyping method whose goal is the unambiguous assignment of microorganisms to genetic clusters. MLST typically involves analysis of DNA sequence results generated from several house-keeping gene loci. MLST remains the gold standard for molecular typing of many bacterial pathogens. Eukaryotic pathogens have also been the subject of MLST, however, few tools are available to deal with diploid sequence data. Here we present novel software for MLST data analysis tailored towards diploid Eukaryotes: MLSTest. This software meets various methods used in MLST and introduces some novel methodologies for the evaluation of the data set. In addition to construction of allelic profiles and basic clustering analysis, the MLSTest looks for network structures that suggest genetic exchange in BURST graphs. Additionally, it uses several simple methods for tree construction with the advantage of managing heterozygous or three-state sites. Additionally, the software analyses whether concatenation of fragments from different genes is suitable for the data set using different tests (bionj-incongruence length difference test, Templeton test). It evaluates how the incongruence is distributed across the tree using a variation of the localized incongruence length difference test based on a modified neighbour joining algorithm. We tested the last method in simulated datasets. We showed that is conservative (adequate type I error rate) and moderately to highly powerful as well as useful to localize incongruences in two bacterial and two eukaryotic MLST datasets. MLSTest was also designed for developing MLST schemes. It thus has tools to optimize locus combinations and to reduce the number of targets required for typing. MLSTest also analyses whether the discriminatory power of the typing scheme is increased by including more loci. We evaluated the software over simulated and real datasets from bacterial and eukaryotic microorganisms. The software is freely available at http://www.ipe.unsa.edu.ar/software.
Fil: Tomasini, Nicolás. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
Fil: Lauthier, Juan José. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
Fil: Llewellyn, Martin S.. University of Wales. School of Biological Sciences; Reino Unido
Fil: Diosque, Patricio. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Salta. Instituto de Patología Experimental; Argentina. Universidad Nacional de Salta; Argentina
description Multi-locus sequence typing (MLST) is a frequently used genotyping method whose goal is the unambiguous assignment of microorganisms to genetic clusters. MLST typically involves analysis of DNA sequence results generated from several house-keeping gene loci. MLST remains the gold standard for molecular typing of many bacterial pathogens. Eukaryotic pathogens have also been the subject of MLST, however, few tools are available to deal with diploid sequence data. Here we present novel software for MLST data analysis tailored towards diploid Eukaryotes: MLSTest. This software meets various methods used in MLST and introduces some novel methodologies for the evaluation of the data set. In addition to construction of allelic profiles and basic clustering analysis, the MLSTest looks for network structures that suggest genetic exchange in BURST graphs. Additionally, it uses several simple methods for tree construction with the advantage of managing heterozygous or three-state sites. Additionally, the software analyses whether concatenation of fragments from different genes is suitable for the data set using different tests (bionj-incongruence length difference test, Templeton test). It evaluates how the incongruence is distributed across the tree using a variation of the localized incongruence length difference test based on a modified neighbour joining algorithm. We tested the last method in simulated datasets. We showed that is conservative (adequate type I error rate) and moderately to highly powerful as well as useful to localize incongruences in two bacterial and two eukaryotic MLST datasets. MLSTest was also designed for developing MLST schemes. It thus has tools to optimize locus combinations and to reduce the number of targets required for typing. MLSTest also analyses whether the discriminatory power of the typing scheme is increased by including more loci. We evaluated the software over simulated and real datasets from bacterial and eukaryotic microorganisms. The software is freely available at http://www.ipe.unsa.edu.ar/software.
publishDate 2013
dc.date.none.fl_str_mv 2013-09-08
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/7619
Tomasini, Nicolás; Lauthier, Juan José; Llewellyn, Martin S.; Diosque, Patricio; MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms; Elsevier Science; Infection, Genetics And Evolution; 20; 8-9-2013; 188-196
1567-1348
url http://hdl.handle.net/11336/7619
identifier_str_mv Tomasini, Nicolás; Lauthier, Juan José; Llewellyn, Martin S.; Diosque, Patricio; MLSTest: Novel software for multi-locus sequence data analysis in eukaryotic organisms; Elsevier Science; Infection, Genetics And Evolution; 20; 8-9-2013; 188-196
1567-1348
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/doi/10.1016/j.meegid.2013.08.029
info:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S1567134813003419
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Elsevier Science
publisher.none.fl_str_mv Elsevier Science
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1846083103185960960
score 13.221938