Dataset

 

Coherence_Evaluations - nws_earth.csv 开放存取 Deposited

不可预览

下载文件

Date Uploaded: 11/03/2022
Date Modified: 11/03/2022

CSV files containing the coherence scoring pertaining to datasets of:
DocumentCount = 5,000
Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws]
SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus

Coherence was scored across every combination of:
TopicCount: 10-40
Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric]
Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric]

The columns in this file include:
Validation_Set: Which search term this scoring pertains to
Topics: Number of topics in the model
Alpha: Hyperparameter alpha selection from the 6 options above
Beta: Hyperparameter beta selection from the 6 options above
Coherence: The topic coherence score for the given model-row
Perplexity: The perplexity score for the given model-row

创建者
证书
学科
时间段
  • 21st century
提交
部门
创建日期
出版者
语言

Digital Object Identifier (DOI)

识别码: doi:10.7945/dpd9-zk65
链接: https://doi.org/10.7945/dpd9-zk65

这个DOI链接是其他人引用您工作的最佳方式。

关系

在收集:

单件

永久链接到此页面: https://scholar.uc.edu/show/5m60qt424