Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish

Autores
Allés Torrent, Susanna; del Rio, María Gimena; Bonnell, Jerry; Song, Dieyun; Hernández, Nidia
Año de publicación
2021
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
Digital Narratives of COVID-19 (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanishspeaking areas spanning North and Central America (Mexico, Columbia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.
Fil: Allés Torrent, Susanna. University of Miami; Estados Unidos
Fil: del Rio, María Gimena. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Instituto de Investigaciones Bibliográficas y Crítica Textual. IIBICRIT - Subsede "Seminario Orduna"; Argentina
Fil: Bonnell, Jerry. University of Miami; Estados Unidos
Fil: Song, Dieyun. University of Miami; Estados Unidos
Fil: Hernández, Nidia. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Centro Argentino de Información Científica y Tecnológica; Argentina
Materia
NARRATIVAS
TWITTER
MINERIA DE TEXTOS
VISUALIZACION DE DATOS
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/163331

id CONICETDig_90e907ab5e3f433ab0bef23d54f8d776
oai_identifier_str oai:ri.conicet.gov.ar:11336/163331
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in SpanishAllés Torrent, Susannadel Rio, María GimenaBonnell, JerrySong, DieyunHernández, NidiaNARRATIVASTWITTERMINERIA DE TEXTOSVISUALIZACION DE DATOShttps://purl.org/becyt/ford/6.2https://purl.org/becyt/ford/6Digital Narratives of COVID-19 (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanishspeaking areas spanning North and Central America (Mexico, Columbia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.Fil: Allés Torrent, Susanna. University of Miami; Estados UnidosFil: del Rio, María Gimena. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Instituto de Investigaciones Bibliográficas y Crítica Textual. IIBICRIT - Subsede "Seminario Orduna"; ArgentinaFil: Bonnell, Jerry. University of Miami; Estados UnidosFil: Song, Dieyun. University of Miami; Estados UnidosFil: Hernández, Nidia. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Centro Argentino de Información Científica y Tecnológica; ArgentinaUbiquity press2021-06info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/163331Allés Torrent, Susanna; del Rio, María Gimena; Bonnell, Jerry; Song, Dieyun; Hernández, Nidia; Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish; Ubiquity press; Journal of Open Humanities Data; 7; 6-2021; 1-72059-481XCONICET DigitalCONICETenginfo:eu-repo/semantics/altIdentifier/url/http://openhumanitiesdata.metajnl.com/articles/10.5334/johd.28/info:eu-repo/semantics/altIdentifier/doi/10.5334/johd.28info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T10:04:30Zoai:ri.conicet.gov.ar:11336/163331instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 10:04:30.965CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
title Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
spellingShingle Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
Allés Torrent, Susanna
NARRATIVAS
TWITTER
MINERIA DE TEXTOS
VISUALIZACION DE DATOS
title_short Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
title_full Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
title_fullStr Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
title_full_unstemmed Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
title_sort Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
dc.creator.none.fl_str_mv Allés Torrent, Susanna
del Rio, María Gimena
Bonnell, Jerry
Song, Dieyun
Hernández, Nidia
author Allés Torrent, Susanna
author_facet Allés Torrent, Susanna
del Rio, María Gimena
Bonnell, Jerry
Song, Dieyun
Hernández, Nidia
author_role author
author2 del Rio, María Gimena
Bonnell, Jerry
Song, Dieyun
Hernández, Nidia
author2_role author
author
author
author
dc.subject.none.fl_str_mv NARRATIVAS
TWITTER
MINERIA DE TEXTOS
VISUALIZACION DE DATOS
topic NARRATIVAS
TWITTER
MINERIA DE TEXTOS
VISUALIZACION DE DATOS
purl_subject.fl_str_mv https://purl.org/becyt/ford/6.2
https://purl.org/becyt/ford/6
dc.description.none.fl_txt_mv Digital Narratives of COVID-19 (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanishspeaking areas spanning North and Central America (Mexico, Columbia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.
Fil: Allés Torrent, Susanna. University of Miami; Estados Unidos
Fil: del Rio, María Gimena. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Instituto de Investigaciones Bibliográficas y Crítica Textual. IIBICRIT - Subsede "Seminario Orduna"; Argentina
Fil: Bonnell, Jerry. University of Miami; Estados Unidos
Fil: Song, Dieyun. University of Miami; Estados Unidos
Fil: Hernández, Nidia. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Saavedra 15. Centro Argentino de Información Científica y Tecnológica; Argentina
description Digital Narratives of COVID-19 (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanishspeaking areas spanning North and Central America (Mexico, Columbia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.
publishDate 2021
dc.date.none.fl_str_mv 2021-06
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/163331
Allés Torrent, Susanna; del Rio, María Gimena; Bonnell, Jerry; Song, Dieyun; Hernández, Nidia; Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish; Ubiquity press; Journal of Open Humanities Data; 7; 6-2021; 1-7
2059-481X
CONICET Digital
CONICET
url http://hdl.handle.net/11336/163331
identifier_str_mv Allés Torrent, Susanna; del Rio, María Gimena; Bonnell, Jerry; Song, Dieyun; Hernández, Nidia; Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish; Ubiquity press; Journal of Open Humanities Data; 7; 6-2021; 1-7
2059-481X
CONICET Digital
CONICET
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/url/http://openhumanitiesdata.metajnl.com/articles/10.5334/johd.28/
info:eu-repo/semantics/altIdentifier/doi/10.5334/johd.28
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-sa/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
dc.publisher.none.fl_str_mv Ubiquity press
publisher.none.fl_str_mv Ubiquity press
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1842269859914711040
score 13.13397