Dataset on dynamics of coronavirus on twitter

In this data article, we provide a dataset of 8,982,694 Twitter posts around the coronavirus health global crisis. The data were collected through the Twitter REST API search. We used the rtweet R package to download raw data. The term searched was “Coronavirus” which included the word itself and it...

Full description

Autores:
Tipo de recurso:
Article of investigation
Fecha de publicación:
2020
Institución:
Universidad de Bogotá Jorge Tadeo Lozano
Repositorio:
Expeditio: repositorio UTadeo
Idioma:
eng
OAI Identifier:
oai:expeditiorepositorio.utadeo.edu.co:20.500.12010/13290
Acceso en línea:
https://doi.org/10.1016/j.dib.2020.105684
http://hdl.handle.net/20.500.12010/13290
Palabra clave:
COVID-19
Pandemic
Infodemiology
Social media
Twitter
Retweets
Social Network Analysis
Hashtags
Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
Rights
License
Abierto (Texto Completo)
Description
Summary:In this data article, we provide a dataset of 8,982,694 Twitter posts around the coronavirus health global crisis. The data were collected through the Twitter REST API search. We used the rtweet R package to download raw data. The term searched was “Coronavirus” which included the word itself and its hashtag version. We collected the data over 23 days, from January 21 to February 12, 2020. The dataset is multilingual, prevailing English, Spanish, and Portuguese. We include a new variable created from other four variables; it is called “type” of tweets, which is useful for showing the diversity of tweets and the dynamics of users on Twitter. The dataset comprises seven databases which can be analysed separately. On the other hand, they can be crossed to set other researches, among them, trends and relevance of different topics, types of tweets, the embeddedness of users and their profiles, the retweets dynamics, hashtag analysis, as well as to perform social network analysis. This dataset can attract the attention of researchers related to different fields on knowledge, such as data science, social science, network science, health informatics, tourism, infodemiology, and others