COVID-19: A scholarly production dataset report for research analysis

COVID-2019 has been recognized as a global threat, and several studies are being conducted in order to contribute to the fight and prevention of this pandemic. This work presents a scholarly production dataset focused on COVID-19, providing an overview of scientific research activities, making it po...

Full description

Autores:
Tipo de recurso:
Article of investigation
Fecha de publicación:
2020
Institución:
Universidad de Bogotá Jorge Tadeo Lozano
Repositorio:
Expeditio: repositorio UTadeo
Idioma:
eng
OAI Identifier:
oai:expeditiorepositorio.utadeo.edu.co:20.500.12010/14112
Acceso en línea:
https://doi.org/10.1016/j.dib.2020.106178
http://hdl.handle.net/20.500.12010/14112
Palabra clave:
COVID-19
SARS-CoV-2
Pandemic
Data Science
Bibliometrics
Scientometrics
Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
Rights
License
Abierto (Texto Completo)
id UTADEO2_649d7764e32850bb92d618419a783102
oai_identifier_str oai:expeditiorepositorio.utadeo.edu.co:20.500.12010/14112
network_acronym_str UTADEO2
network_name_str Expeditio: repositorio UTadeo
repository_id_str
dc.title.spa.fl_str_mv COVID-19: A scholarly production dataset report for research analysis
title COVID-19: A scholarly production dataset report for research analysis
spellingShingle COVID-19: A scholarly production dataset report for research analysis
COVID-19
SARS-CoV-2
Pandemic
Data Science
Bibliometrics
Scientometrics
Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
title_short COVID-19: A scholarly production dataset report for research analysis
title_full COVID-19: A scholarly production dataset report for research analysis
title_fullStr COVID-19: A scholarly production dataset report for research analysis
title_full_unstemmed COVID-19: A scholarly production dataset report for research analysis
title_sort COVID-19: A scholarly production dataset report for research analysis
dc.subject.spa.fl_str_mv COVID-19
SARS-CoV-2
Pandemic
Data Science
Bibliometrics
Scientometrics
topic COVID-19
SARS-CoV-2
Pandemic
Data Science
Bibliometrics
Scientometrics
Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
dc.subject.lemb.spa.fl_str_mv Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
description COVID-2019 has been recognized as a global threat, and several studies are being conducted in order to contribute to the fight and prevention of this pandemic. This work presents a scholarly production dataset focused on COVID-19, providing an overview of scientific research activities, making it possible to identify countries, scientists and research groups most active in this task force to combat the coronavirus disease. The dataset is composed of 40,212 records of articles’ metadata collected from Scopus, PubMed, arXiv and bioRxiv databases from January 2019 to July 2020. Those data were extracted by using the techniques of Python Web Scraping and preprocessed with Pandas Data Wrangling. In addition, the pipeline to preprocess and generate the dataset are versioned with the Data Version Control tool (DVC) and are thus easily reproducible and auditable.
publishDate 2020
dc.date.accessioned.none.fl_str_mv 2020-10-01T19:47:26Z
dc.date.available.none.fl_str_mv 2020-10-01T19:47:26Z
dc.date.created.none.fl_str_mv 2020
dc.type.local.spa.fl_str_mv Artículo
dc.type.coar.spa.fl_str_mv http://purl.org/coar/resource_type/c_2df8fbb1
format http://purl.org/coar/resource_type/c_2df8fbb1
dc.identifier.issn.spa.fl_str_mv 2352-3409
dc.identifier.other.spa.fl_str_mv https://doi.org/10.1016/j.dib.2020.106178
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/20.500.12010/14112
dc.identifier.doi.spa.fl_str_mv https://doi.org/10.1016/j.dib.2020.106178
identifier_str_mv 2352-3409
url https://doi.org/10.1016/j.dib.2020.106178
http://hdl.handle.net/20.500.12010/14112
dc.language.iso.spa.fl_str_mv eng
language eng
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.rights.local.spa.fl_str_mv Abierto (Texto Completo)
rights_invalid_str_mv Abierto (Texto Completo)
http://purl.org/coar/access_right/c_abf2
dc.format.extent.spa.fl_str_mv 6 páginas
dc.format.mimetype.spa.fl_str_mv application/pdf
dc.publisher.spa.fl_str_mv Data in Brief
dc.source.spa.fl_str_mv reponame:Expeditio Repositorio Institucional UJTL
instname:Universidad de Bogotá Jorge Tadeo Lozano
instname_str Universidad de Bogotá Jorge Tadeo Lozano
institution Universidad de Bogotá Jorge Tadeo Lozano
reponame_str Expeditio Repositorio Institucional UJTL
collection Expeditio Repositorio Institucional UJTL
bitstream.url.fl_str_mv https://expeditiorepositorio.utadeo.edu.co/bitstream/20.500.12010/14112/2/license.txt
https://expeditiorepositorio.utadeo.edu.co/bitstream/20.500.12010/14112/3/COVID-19--A-scholarly-production-dataset-report-for-researc_2020_Data-in-Bri.pdf.jpg
bitstream.checksum.fl_str_mv abceeb1c943c50d3343516f9dbfc110f
ca72f920b75c456d6a948eb75c2f31cc
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositorio Institucional - Universidad Jorge Tadeo Lozano
repository.mail.fl_str_mv expeditio@utadeo.edu.co
_version_ 1814213803596840960
spelling 2020-10-01T19:47:26Z2020-10-01T19:47:26Z20202352-3409https://doi.org/10.1016/j.dib.2020.106178http://hdl.handle.net/20.500.12010/14112https://doi.org/10.1016/j.dib.2020.106178COVID-2019 has been recognized as a global threat, and several studies are being conducted in order to contribute to the fight and prevention of this pandemic. This work presents a scholarly production dataset focused on COVID-19, providing an overview of scientific research activities, making it possible to identify countries, scientists and research groups most active in this task force to combat the coronavirus disease. The dataset is composed of 40,212 records of articles’ metadata collected from Scopus, PubMed, arXiv and bioRxiv databases from January 2019 to July 2020. Those data were extracted by using the techniques of Python Web Scraping and preprocessed with Pandas Data Wrangling. In addition, the pipeline to preprocess and generate the dataset are versioned with the Data Version Control tool (DVC) and are thus easily reproducible and auditable.6 páginasapplication/pdfengData in Briefreponame:Expeditio Repositorio Institucional UJTLinstname:Universidad de Bogotá Jorge Tadeo LozanoCOVID-19SARS-CoV-2PandemicData ScienceBibliometricsScientometricsSíndrome respiratorio agudo graveCOVID-19SARS-CoV-2CoronavirusCOVID-19: A scholarly production dataset report for research analysisArtículohttp://purl.org/coar/resource_type/c_2df8fbb1Abierto (Texto Completo)http://purl.org/coar/access_right/c_abf2Santana Santos, BrenoSilva, IvanovitchCâmara Ribeiro-Dantas, Marcel daAlves, GislianyTakako Endo, PatriciaLima, LucianaLICENSElicense.txtlicense.txttext/plain; charset=utf-82938https://expeditiorepositorio.utadeo.edu.co/bitstream/20.500.12010/14112/2/license.txtabceeb1c943c50d3343516f9dbfc110fMD52open accessTHUMBNAILCOVID-19--A-scholarly-production-dataset-report-for-researc_2020_Data-in-Bri.pdf.jpgCOVID-19--A-scholarly-production-dataset-report-for-researc_2020_Data-in-Bri.pdf.jpgIM Thumbnailimage/jpeg13263https://expeditiorepositorio.utadeo.edu.co/bitstream/20.500.12010/14112/3/COVID-19--A-scholarly-production-dataset-report-for-researc_2020_Data-in-Bri.pdf.jpgca72f920b75c456d6a948eb75c2f31ccMD53open access20.500.12010/14112oai:expeditiorepositorio.utadeo.edu.co:20.500.12010/141122021-03-17 19:10:38.745metadata only accessRepositorio Institucional - Universidad Jorge Tadeo Lozanoexpeditio@utadeo.edu.coQXV0b3Jpem8gYWwgU2lzdGVtYSBkZSBCaWJsaW90ZWNhcyBVbml2ZXJzaWRhZCBkZSBCb2dvdMOhIEpvcmdlIFRhZGVvIExvemFubyBwYXJhIHF1ZSBjb24gZmluZXMgYWNhZMOpbWljb3MsIHByZXNlcnZlLCBjb25zZXJ2ZSwgb3JnYW5pY2UsIGVkaXRlIHkgbW9kaWZpcXVlIHRlY25vbMOzZ2ljYW1lbnRlIGVsIGRvY3VtZW50byBhbnRlcmlvcm1lbnRlIGNhcmdhZG8gYWwgUmVwb3NpdG9yaW8gSW5zdGl0dWNpb25hbCBFeHBlZGl0aW8KCkV4Y2VwdHVhbmRvIHF1ZSBlbCBkb2N1bWVudG8gc2VhIGNvbmZpZGVuY2lhbCwgYXV0b3Jpem8gYSB1c3VhcmlvcyBpbnRlcm5vcyB5IGV4dGVybm9zIGRlIGxhIEluc3RpdHVjacOzbiBhIGNvbnN1bHRhciB5IHJlcHJvZHVjaXIgZWwgY29udGVuaWRvIGRlbCBkb2N1bWVudG8gcGFyYSBmaW5lcyBhY2Fkw6ltaWNvcyBudW5jYSBwYXJhIHVzb3MgY29tZXJjaWFsZXMsIGN1YW5kbyBtZWRpYW50ZSBsYSBjb3JyZXNwb25kaWVudGUgY2l0YSBiaWJsaW9ncsOhZmljYSBzZSBsZSBkZSBjcsOpZGl0byBhIGxhIG9icmEgeSBzdShzKSBhdXRvcihzKS4KCkV4Y2VwdHVhbmRvIHF1ZSBlbCBkb2N1bWVudG8gc2VhIGNvbmZpZGVuY2lhbCwgYXV0b3Jpem8gYXBsaWNhciBsYSBsaWNlbmNpYSBkZWwgZXN0w6FuZGFyIGludGVybmFjaW9uYWwgQ3JlYXRpdmUgQ29tbW9ucyAoQXR0cmlidXRpb24tTm9uQ29tbWVyY2lhbC1Ob0Rlcml2YXRpdmVzIDQuMCBJbnRlcm5hdGlvbmFsKSBxdWUgaW5kaWNhIHF1ZSBjdWFscXVpZXIgcGVyc29uYSBwdWVkZSB1c2FyIGxhIG9icmEgZGFuZG8gY3LDqWRpdG8gYWwgYXV0b3IsIHNpbiBwb2RlciBjb21lcmNpYXIgY29uIGxhIG9icmEgeSBzaW4gZ2VuZXJhciBvYnJhcyBkZXJpdmFkYXMuCgpFbCAobG9zKSBhdXRvcihlcykgY2VydGlmaWNhKG4pIHF1ZSBlbCBkb2N1bWVudG8gbm8gaW5mcmluZ2UgbmkgYXRlbnRhIGNvbnRyYSBkZXJlY2hvcyBpbmR1c3RyaWFsZXMsIHBhdHJpbW9uaWFsZXMsIGludGVsZWN0dWFsZXMsIG1vcmFsZXMgbyBjdWFscXVpZXIgb3RybyBkZSB0ZXJjZXJvcywgYXPDrSBtaXNtbyBkZWNsYXJhbiBxdWUgbGEgVW5pdmVyc2lkYWQgSm9yZ2UgVGFkZW8gTG96YW5vIHNlIGVuY3VlbnRyYSBsaWJyZSBkZSB0b2RhIHJlc3BvbnNhYmlsaWRhZCBjaXZpbCwgYWRtaW5pc3RyYXRpdmEgeS9vIHBlbmFsIHF1ZSBwdWVkYSBkZXJpdmFyc2UgZGUgbGEgcHVibGljYWNpw7NuIGRlbCB0cmFiYWpvIGRlIGdyYWRvIHkvbyB0ZXNpcyBlbiBjYWxpZGFkIGRlIGFjY2VzbyBhYmllcnRvIHBvciBjdWFscXVpZXIgbWVkaW8uCgpFbiBjdW1wbGltaWVudG8gY29uIGxvIGRpc3B1ZXN0byBlbiBsYSBMZXkgMTU4MSBkZSAyMDEyIHkgZXNwZWNpYWxtZW50ZSBlbiB2aXJ0dWQgZGUgbG8gZGlzcHVlc3RvIGVuIGVsIEFydMOtY3VsbyAxMCBkZWwgRGVjcmV0byAxMzc3IGRlIDIwMTMsIGF1dG9yaXpvIGEgbGEgVW5pdmVyc2lkYWQgSm9yZ2UgVGFkZW8gTG96YW5vIGEgcHJvY2VkZXIgY29uIGVsIHRyYXRhbWllbnRvIGRlIGxvcyBkYXRvcyBwZXJzb25hbGVzIHBhcmEgZmluZXMgYWNhZMOpbWljb3MsIGhpc3TDs3JpY29zLCBlc3RhZMOtc3RpY29zIHkgYWRtaW5pc3RyYXRpdm9zIGRlIGxhIEluc3RpdHVjacOzbi4gRGUgY29uZm9ybWlkYWQgY29uIGxvIGVzdGFibGVjaWRvIGVuIGVsIGFydMOtY3VsbyAzMCBkZSBsYSBMZXkgMjMgZGUgMTk4MiB5IGVsIGFydMOtY3VsbyAxMSBkZSBsYSBEZWNpc2nDs24gQW5kaW5hIDM1MSBkZSAxOTkzLCBhY2xhcmFtb3MgcXVlIOKAnExvcyBkZXJlY2hvcyBtb3JhbGVzIHNvYnJlIGVsIHRyYWJham8gc29uIHByb3BpZWRhZCBkZSBsb3MgYXV0b3Jlc+KAnSwgbG9zIGN1YWxlcyBzb24gaXJyZW51bmNpYWJsZXMsIGltcHJlc2NyaXB0aWJsZXMsIGluZW1iYXJnYWJsZXMgZSBpbmFsaWVuYWJsZXMuCgpDb24gZWwgcmVnaXN0cm8gZW4gbGEgcMOhZ2luYSwgYXV0b3Jpem8gZGUgbWFuZXJhIGV4cHJlc2EgYSBsYSBGVU5EQUNJw5NOIFVOSVZFUlNJREFEIERFIEJPR09Uw4EgSk9SR0UgVEFERU8gTE9aQU5PLCBlbCB0cmF0YW1pZW50byBkZSBtaXMgZGF0b3MgcGVyc29uYWxlcyBwYXJhIHByb2Nlc2FyIG8gY29uc2VydmFyLCBjb24gZmluZXMgZXN0YWTDrXN0aWNvcywgZGUgY29udHJvbCBvIHN1cGVydmlzacOzbiwgYXPDrSBjb21vIHBhcmEgZWwgZW52w61vIGRlIGluZm9ybWFjacOzbiB2w61hIGNvcnJlbyBlbGVjdHLDs25pY28sIGRlbnRybyBkZWwgbWFyY28gZXN0YWJsZWNpZG8gcG9yIGxhIExleSAxNTgxIGRlIDIwMTIgeSBzdXMgZGVjcmV0b3MgY29tcGxlbWVudGFyaW9zIHNvYnJlIFRyYXRhbWllbnRvIGRlIERhdG9zIFBlcnNvbmFsZXMuIEVuIGN1YWxxdWllciBjYXNvLCBlbnRpZW5kbyBxdWUgcG9kcsOpIGhhY2VyIHVzbyBkZWwgZGVyZWNobyBhIGNvbm9jZXIsIGFjdHVhbGl6YXIsIHJlY3RpZmljYXIgbyBzdXByaW1pciBsb3MgZGF0b3MgcGVyc29uYWxlcyBtZWRpYW50ZSBlbCBlbnbDrW8gZGUgdW5hIGNvbXVuaWNhY2nDs24gZXNjcml0YSBhbCBjb3JyZW8gZWxlY3Ryw7NuaWNvIHByb3RlY2Npb25kYXRvc0B1dGFkZW8uZWR1LmNvLgoKTGEgRlVOREFDScOTTiBVTklWRVJTSURBRCBERSBCT0dPVMOBIEpPUkdFIFRBREVPIExPWkFOTyBubyB1dGlsaXphcsOhIGxvcyBkYXRvcyBwZXJzb25hbGVzIHBhcmEgZmluZXMgZGlmZXJlbnRlcyBhIGxvcyBhbnVuY2lhZG9zIHkgZGFyw6EgdW4gdXNvIGFkZWN1YWRvIHkgcmVzcG9uc2FibGUgYSBzdXMgZGF0b3MgcGVyc29uYWxlcyBkZSBhY3VlcmRvIGNvbiBsYSBkaXJlY3RyaXogZGUgUHJvdGVjY2nDs24gZGUgRGF0b3MgUGVyc29uYWxlcyBxdWUgcG9kcsOhIGNvbnN1bHRhciBlbjogaHR0cDovL3d3dy51dGFkZW8uZWR1LmNvL2VzL2xpbmsvZGVzY3VicmUtbGEtdW5pdmVyc2lkYWQvMi9kb2N1bWVudG9zCg==