Dataset
COVID 19 literature NLP models – Viral outbreak topic tuning 开放存取 Deposited
The data sets were derived from coronavirus related scientific literature using the CORD-19 dataset released by the Allen Institute of Artificial Intelligence as of July 14, 2020, using the Elasticsearch engine hosted by the Digital Scholarship Center (DSC). Through indexing the full-text and the metadata of the article corpus, the research team generated a full-corpus model and 7 different models corresponding to key viral outbreaks from the past several decades' coronaviruses (SARS-CoV, MERS-CoV, and SARS- CoV-2) and non-coronaviruses (HIV, Zika, H1N1, and Ebola). The targeted subsets of the articles used two or more occurrences of virus-specific keywords drawn from conventions established by the World Health Organization.
- 创建者
- 证书
- 学科
- 提交
- 学
- 部门
- 创建日期
- 出版者
- 语言
永久链接到此页面: https://scholar.uc.edu/show/pk02cc123
