Box-and-Whisker visualization of coherence scores for three corpora types: Caselaw (cas), Pubmed Abstracts (pma), Pubmed Central (pmc).
This figure is for models matching search-term "climate". Visualizations for other search terms and additional interactive elements available at the related URL below.
Coherence was scored across every combination of:
- TopicCount: 10-40
- Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric]
- Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric]
Box-and-Whisker visualization of topic coherence scores for three corpora types: Caselaw (cas), Pubmed Abstracts (pma), Pubmed Central (pmc). This figure is for models matching search-term "climate". Visualizations for other search terms and additional interactive elements available at related URL below.
Coherence was scored across every combination of:
- TopicCount: 10-40
- Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric]
- Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric]
Heat map visualization of median coherence scores for three corpora: Caselaw (cas), Pubmed Abstracts (pma), Pubmed Central (pmc).
Median coherence scores across all search-term based models ("climate", "earth", "environmental" "pollution")
The median is found from 1,116 total coherence scores. Coherence was scored across every combination of:
- TopicCount: 10-40
- Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric]
- Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric]