Dataset
COVID 19 literature NLP models – Viral outbreak topic tuning Open Access Deposited
The data sets were derived from coronavirus related scientific literature using the CORD-19 dataset released by the Allen Institute of Artificial Intelligence as of July 14, 2020, using the Elasticsearch engine hosted by the Digital Scholarship Center (DSC). Through indexing the full-text and the metadata of the article corpus, the research team generated a full-corpus model and 7 different models corresponding to key viral outbreaks from the past several decades' coronaviruses (SARS-CoV, MERS-CoV, and SARS- CoV-2) and non-coronaviruses (HIV, Zika, H1N1, and Ebola). The targeted subsets of the articles used two or more occurrences of virus-specific keywords drawn from conventions established by the World Health Organization.
- Creator
- License
- Subject
- Submitter
- College
- Department
- Date Created
- Publisher
- Language
-
- In Collection:
Relationships
Items
Thumbnail | Title | Date Uploaded | Visibility | Actions |
---|---|---|---|---|
![]() |
covid_q2_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
ebola_m2_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
10k_eval_avgs.txt | 2020-10-30 | Open Access |
|
![]() |
covid_q1_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
hiv_m2_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
hyperparam_tuning_on_10k_rand.csv | 2020-10-30 | Open Access |
|
![]() |
full_tuning_results.csv | 2020-10-30 | Open Access |
|
![]() |
h1n1_m2_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
zika_m2_topics_tuning.csv | 2020-10-30 | Open Access |
|
![]() |
mers_m2_topics_tuning.csv | 2020-10-30 | Open Access |
|
Permanent link to this page: https://scholar.uc.edu/show/pk02cc123