Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)

Autores
Rozadilla, Gastón; Mccarthy, Cristina Beryl
Año de publicación
2024
Idioma
inglés
Tipo de recurso
conjunto de datos
Estado
Descripción
Spodoptera frugiperda is a noctuid moth that devastates various crops including corn, rice and cotton, and is found in most of the American continent. The purpose of this study was to integrate gene expression data from S. frugiperda guts and their associated metatranscriptomes, under natural and controlled conditions. For this, four S. frugiperda samples from the province of Tucumán (Argentina; subtropical region) were analysed. Specimens were obtained from different environments, altitudes and food sources, namely: 1) a transgenic maize (Zea mays) field at 495 m.a.s.l. where insecticides and fertilisers were applied (named MM; 26o49’50”S; 65o16’59.4”W); 2) Sorghum halepense at 495 m.a.s.l. (MS; 26o49’50”S; 65o16’59.4”W); 3) a maize field at 2283 m.a.s.l. where no insecticides or fertilisers were used (TV; 26o55’40.75”S; 65o45’19.90”W) ; and 4) a colony established from larvae originally collected from the same transgenic maize field as Sf_MM, reared for 9 generations under controlled conditions on an artificial diet adapted from [8], without the addition of antibiotics (BT). For all samples, total RNA extracted from fifth instar larvae guts (two digestive tracts per sample), was submitted to a modified one-step reverse transcription and polymerase chain reaction sequence-independent amplification procedure, as described previously. High-throughput pyrosequencing of the samples was performed using a Roche GS FLX (Macrogen Inc., Korea), yielding ~1Gb of metatranscriptomic reads with lengths of 50 to 1600 bases (nt) (652 nt average). Raw sequence reads were trimmed to remove nucleotides derived from the amplification primers using a custom application. Below follows an outline of the main steps we followed to create the uploaded databases: I.Sequences were compared locally to a combined nucleotide database (nt16SLep = “Non-redundant” nucleotide sequence (nt) database + 16S rRNA gene (16S) database + Lepidopteran whole genome shotgun (Lep) projects completed at the time of the analysis) using BLASTN (Altschul et al., 1990) with a 1e-50 cutoff E-value, and to the protein database (nr = non-redundant protein sequence) using Diamond (Buchfink et al., 2014) with a 1e-17 cutoff E-value. II.The homology search results were then processed as follows: Step A: The output files from both homology searches were processed with MEGAN, a software which performs taxonomic binning and assigns sequences to taxa using the Lowest Common Ancestor (LCA)-assignment algorithm (Huson et al., 2007). Taxonomic and functional assignments performed by MEGAN for each sequence were then exported using a MEGAN functionality. Note: MEGAN computes a “species profile” by finding the lowest node in the NCBI taxonomy that encompasses the set of hit taxa and assigns the sequence to the taxon represented by that lowest node. With this approach, every sequence is assigned to some taxon; if the sequence aligns very specifically only to a single taxon, then it is assigned to that taxon; the less specifically a sequence hits taxa, the higher up in the taxonomy it is placed. Step B: The output files from both homology searches were also processed with a custom bash script. This script parses the homology search output files and generates two files (one for each homology search) containing the name of each sequence, its best hit (or no hit) and the corresponding E-value. III. Create local database (Step C): All this information (from the exported MEGAN files and from the bash script output files) was then used to create a local SQLite database which included all the available information for each sequence (from both homology searches).
Fil: Rozadilla, Gastón. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; Argentina
Fil: Mccarthy, Cristina Beryl. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; Argentina
Nivel de accesibilidad
acceso restringido
Condiciones de uso
Datos sujetos al derecho de propiedad intelectual
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/234791

id CONICETDig_ceb5dfdd5dded358ab95ebe0c872763b
oai_identifier_str oai:ri.conicet.gov.ar:11336/234791
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)Rozadilla, GastónMccarthy, Cristina Berylhttps://purl.org/becyt/ford/1.6https://purl.org/becyt/ford/1Spodoptera frugiperda is a noctuid moth that devastates various crops including corn, rice and cotton, and is found in most of the American continent. The purpose of this study was to integrate gene expression data from S. frugiperda guts and their associated metatranscriptomes, under natural and controlled conditions. For this, four S. frugiperda samples from the province of Tucumán (Argentina; subtropical region) were analysed. Specimens were obtained from different environments, altitudes and food sources, namely: 1) a transgenic maize (Zea mays) field at 495 m.a.s.l. where insecticides and fertilisers were applied (named MM; 26o49’50”S; 65o16’59.4”W); 2) Sorghum halepense at 495 m.a.s.l. (MS; 26o49’50”S; 65o16’59.4”W); 3) a maize field at 2283 m.a.s.l. where no insecticides or fertilisers were used (TV; 26o55’40.75”S; 65o45’19.90”W) ; and 4) a colony established from larvae originally collected from the same transgenic maize field as Sf_MM, reared for 9 generations under controlled conditions on an artificial diet adapted from [8], without the addition of antibiotics (BT). For all samples, total RNA extracted from fifth instar larvae guts (two digestive tracts per sample), was submitted to a modified one-step reverse transcription and polymerase chain reaction sequence-independent amplification procedure, as described previously. High-throughput pyrosequencing of the samples was performed using a Roche GS FLX (Macrogen Inc., Korea), yielding ~1Gb of metatranscriptomic reads with lengths of 50 to 1600 bases (nt) (652 nt average). Raw sequence reads were trimmed to remove nucleotides derived from the amplification primers using a custom application. Below follows an outline of the main steps we followed to create the uploaded databases: I.Sequences were compared locally to a combined nucleotide database (nt16SLep = “Non-redundant” nucleotide sequence (nt) database + 16S rRNA gene (16S) database + Lepidopteran whole genome shotgun (Lep) projects completed at the time of the analysis) using BLASTN (Altschul et al., 1990) with a 1e-50 cutoff E-value, and to the protein database (nr = non-redundant protein sequence) using Diamond (Buchfink et al., 2014) with a 1e-17 cutoff E-value. II.The homology search results were then processed as follows: Step A: The output files from both homology searches were processed with MEGAN, a software which performs taxonomic binning and assigns sequences to taxa using the Lowest Common Ancestor (LCA)-assignment algorithm (Huson et al., 2007). Taxonomic and functional assignments performed by MEGAN for each sequence were then exported using a MEGAN functionality. Note: MEGAN computes a “species profile” by finding the lowest node in the NCBI taxonomy that encompasses the set of hit taxa and assigns the sequence to the taxon represented by that lowest node. With this approach, every sequence is assigned to some taxon; if the sequence aligns very specifically only to a single taxon, then it is assigned to that taxon; the less specifically a sequence hits taxa, the higher up in the taxonomy it is placed. Step B: The output files from both homology searches were also processed with a custom bash script. This script parses the homology search output files and generates two files (one for each homology search) containing the name of each sequence, its best hit (or no hit) and the corresponding E-value. III. Create local database (Step C): All this information (from the exported MEGAN files and from the bash script output files) was then used to create a local SQLite database which included all the available information for each sequence (from both homology searches).Fil: Rozadilla, Gastón. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; ArgentinaFil: Mccarthy, Cristina Beryl. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; Argentina2024info:ar-repo/semantics/conjuntoDeDatosv1.0info:eu-repo/semantics/dataSetapplication/octet-streamapplication/octet-streamapplication/octet-streamapplication/octet-streamhttp://hdl.handle.net/11336/234791Rozadilla, Gastón; Mccarthy, Cristina Beryl; (2024): Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province). Consejo Nacional de Investigaciones Científicas y Técnicas. (dataset). http://hdl.handle.net/11336/234791CONICET DigitalCONICETenginfo:eu-repo/semantics/restrictedAccessDatos sujetos al derecho de propiedad intelectualreponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-29T09:49:33Zoai:ri.conicet.gov.ar:11336/234791instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-29 09:49:33.376CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
title Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
spellingShingle Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
Rozadilla, Gastón
title_short Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
title_full Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
title_fullStr Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
title_full_unstemmed Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
title_sort Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province)
dc.creator.none.fl_str_mv Rozadilla, Gastón
Mccarthy, Cristina Beryl
author Rozadilla, Gastón
author_facet Rozadilla, Gastón
Mccarthy, Cristina Beryl
author_role author
author2 Mccarthy, Cristina Beryl
author2_role author
purl_subject.fl_str_mv https://purl.org/becyt/ford/1.6
https://purl.org/becyt/ford/1
dc.description.none.fl_txt_mv Spodoptera frugiperda is a noctuid moth that devastates various crops including corn, rice and cotton, and is found in most of the American continent. The purpose of this study was to integrate gene expression data from S. frugiperda guts and their associated metatranscriptomes, under natural and controlled conditions. For this, four S. frugiperda samples from the province of Tucumán (Argentina; subtropical region) were analysed. Specimens were obtained from different environments, altitudes and food sources, namely: 1) a transgenic maize (Zea mays) field at 495 m.a.s.l. where insecticides and fertilisers were applied (named MM; 26o49’50”S; 65o16’59.4”W); 2) Sorghum halepense at 495 m.a.s.l. (MS; 26o49’50”S; 65o16’59.4”W); 3) a maize field at 2283 m.a.s.l. where no insecticides or fertilisers were used (TV; 26o55’40.75”S; 65o45’19.90”W) ; and 4) a colony established from larvae originally collected from the same transgenic maize field as Sf_MM, reared for 9 generations under controlled conditions on an artificial diet adapted from [8], without the addition of antibiotics (BT). For all samples, total RNA extracted from fifth instar larvae guts (two digestive tracts per sample), was submitted to a modified one-step reverse transcription and polymerase chain reaction sequence-independent amplification procedure, as described previously. High-throughput pyrosequencing of the samples was performed using a Roche GS FLX (Macrogen Inc., Korea), yielding ~1Gb of metatranscriptomic reads with lengths of 50 to 1600 bases (nt) (652 nt average). Raw sequence reads were trimmed to remove nucleotides derived from the amplification primers using a custom application. Below follows an outline of the main steps we followed to create the uploaded databases: I.Sequences were compared locally to a combined nucleotide database (nt16SLep = “Non-redundant” nucleotide sequence (nt) database + 16S rRNA gene (16S) database + Lepidopteran whole genome shotgun (Lep) projects completed at the time of the analysis) using BLASTN (Altschul et al., 1990) with a 1e-50 cutoff E-value, and to the protein database (nr = non-redundant protein sequence) using Diamond (Buchfink et al., 2014) with a 1e-17 cutoff E-value. II.The homology search results were then processed as follows: Step A: The output files from both homology searches were processed with MEGAN, a software which performs taxonomic binning and assigns sequences to taxa using the Lowest Common Ancestor (LCA)-assignment algorithm (Huson et al., 2007). Taxonomic and functional assignments performed by MEGAN for each sequence were then exported using a MEGAN functionality. Note: MEGAN computes a “species profile” by finding the lowest node in the NCBI taxonomy that encompasses the set of hit taxa and assigns the sequence to the taxon represented by that lowest node. With this approach, every sequence is assigned to some taxon; if the sequence aligns very specifically only to a single taxon, then it is assigned to that taxon; the less specifically a sequence hits taxa, the higher up in the taxonomy it is placed. Step B: The output files from both homology searches were also processed with a custom bash script. This script parses the homology search output files and generates two files (one for each homology search) containing the name of each sequence, its best hit (or no hit) and the corresponding E-value. III. Create local database (Step C): All this information (from the exported MEGAN files and from the bash script output files) was then used to create a local SQLite database which included all the available information for each sequence (from both homology searches).
Fil: Rozadilla, Gastón. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; Argentina
Fil: Mccarthy, Cristina Beryl. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - La Plata; Argentina. Universidad Nacional de La Plata. Centro Regional de Estudios Genómicos; Argentina
description Spodoptera frugiperda is a noctuid moth that devastates various crops including corn, rice and cotton, and is found in most of the American continent. The purpose of this study was to integrate gene expression data from S. frugiperda guts and their associated metatranscriptomes, under natural and controlled conditions. For this, four S. frugiperda samples from the province of Tucumán (Argentina; subtropical region) were analysed. Specimens were obtained from different environments, altitudes and food sources, namely: 1) a transgenic maize (Zea mays) field at 495 m.a.s.l. where insecticides and fertilisers were applied (named MM; 26o49’50”S; 65o16’59.4”W); 2) Sorghum halepense at 495 m.a.s.l. (MS; 26o49’50”S; 65o16’59.4”W); 3) a maize field at 2283 m.a.s.l. where no insecticides or fertilisers were used (TV; 26o55’40.75”S; 65o45’19.90”W) ; and 4) a colony established from larvae originally collected from the same transgenic maize field as Sf_MM, reared for 9 generations under controlled conditions on an artificial diet adapted from [8], without the addition of antibiotics (BT). For all samples, total RNA extracted from fifth instar larvae guts (two digestive tracts per sample), was submitted to a modified one-step reverse transcription and polymerase chain reaction sequence-independent amplification procedure, as described previously. High-throughput pyrosequencing of the samples was performed using a Roche GS FLX (Macrogen Inc., Korea), yielding ~1Gb of metatranscriptomic reads with lengths of 50 to 1600 bases (nt) (652 nt average). Raw sequence reads were trimmed to remove nucleotides derived from the amplification primers using a custom application. Below follows an outline of the main steps we followed to create the uploaded databases: I.Sequences were compared locally to a combined nucleotide database (nt16SLep = “Non-redundant” nucleotide sequence (nt) database + 16S rRNA gene (16S) database + Lepidopteran whole genome shotgun (Lep) projects completed at the time of the analysis) using BLASTN (Altschul et al., 1990) with a 1e-50 cutoff E-value, and to the protein database (nr = non-redundant protein sequence) using Diamond (Buchfink et al., 2014) with a 1e-17 cutoff E-value. II.The homology search results were then processed as follows: Step A: The output files from both homology searches were processed with MEGAN, a software which performs taxonomic binning and assigns sequences to taxa using the Lowest Common Ancestor (LCA)-assignment algorithm (Huson et al., 2007). Taxonomic and functional assignments performed by MEGAN for each sequence were then exported using a MEGAN functionality. Note: MEGAN computes a “species profile” by finding the lowest node in the NCBI taxonomy that encompasses the set of hit taxa and assigns the sequence to the taxon represented by that lowest node. With this approach, every sequence is assigned to some taxon; if the sequence aligns very specifically only to a single taxon, then it is assigned to that taxon; the less specifically a sequence hits taxa, the higher up in the taxonomy it is placed. Step B: The output files from both homology searches were also processed with a custom bash script. This script parses the homology search output files and generates two files (one for each homology search) containing the name of each sequence, its best hit (or no hit) and the corresponding E-value. III. Create local database (Step C): All this information (from the exported MEGAN files and from the bash script output files) was then used to create a local SQLite database which included all the available information for each sequence (from both homology searches).
publishDate 2024
dc.date.none.fl_str_mv 2024
dc.type.none.fl_str_mv info:ar-repo/semantics/conjuntoDeDatos
v1.0
info:eu-repo/semantics/dataSet
format dataSet
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/234791
Rozadilla, Gastón; Mccarthy, Cristina Beryl; (2024): Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province). Consejo Nacional de Investigaciones Científicas y Técnicas. (dataset). http://hdl.handle.net/11336/234791
CONICET Digital
CONICET
url http://hdl.handle.net/11336/234791
identifier_str_mv Rozadilla, Gastón; Mccarthy, Cristina Beryl; (2024): Metatranscriptomic and transcriptomic databases (DB4S) of Spodoptera frugiperda larvae guts from Northern Argentina (Tucumán province). Consejo Nacional de Investigaciones Científicas y Técnicas. (dataset). http://hdl.handle.net/11336/234791
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.rights.none.fl_str_mv info:eu-repo/semantics/restrictedAccess
Datos sujetos al derecho de propiedad intelectual
eu_rights_str_mv restrictedAccess
rights_invalid_str_mv Datos sujetos al derecho de propiedad intelectual
dc.format.none.fl_str_mv application/octet-stream
application/octet-stream
application/octet-stream
application/octet-stream
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1844613533040377856
score 13.070432