Search Constraints
Filtering by:
Department
Digital Scholarship Center (DSC)
Remove constraint Department: Digital Scholarship Center (DSC)
« Previous |
1 - 10 of 60
|
Next »
Number of results to display per page
Search Results
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- Text and Metadata for 14,399 newspaper articles. Transcripts collected from Internet Archive Date Range: 2010-2022 File includes meta/data: - Unique-id (uid) - Title (incl. search term) - Date - Link (url) - Abstract - Text Text matching the following terms: - space explor* - space mission - space science - spaceship - space tour* - space transport* - spacecraft - space shuttle - outer space - astronom* - astrop* - astrona* - planet - NASA - star trek - star wars - lunar - space flight
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- Text and Metadata for 9,061 newspaper articles. Newspapers included: New York Times, Wall Street Journal, & Washington Post Date Range: 2017-2022 File includes meta/data: - Unique-id (uid) - Title (incl. source paper & section name) - Date - Link (url) - Author - Text Text matching the following terms: - space explor* - space mission - space science - spaceship - space tour* - space transport* - spacecraft - space shuttle - outer space - astronom* - astrop* - astrona* - planet - NASA - star trek - star wars - lunar - space flight
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/12/2022
- Date Modified:
- 11/12/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the topic coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one of) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] SearchTerm[s] = (one of) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/05/2022
- Date Modified:
- 11/11/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the topic coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one of) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] SearchTerm[s] = (one of) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/04/2022
- Date Modified:
- 11/04/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/04/2022
- Date Modified:
- 11/04/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)
-
- Type:
- Dataset
- Description/Abstract:
- CSV files containing the coherence scoring pertaining to datasets of: DocumentCount = 5,000 Corpus = (one from) Federal Caselaw [cas] / Pubmed-Abstracts [pma] / Pubmed-Central [pmc] / News [nws] SearchTerm[s] = (one from) Earth / Environmental / Climate / Pollution / Random 5k documents of a specific corpus Coherence was scored across every combination of: TopicCount: 10-40 Hyperparameter-Alpha: [0.01, 0.31, 0.61, 0.91, symmetric, asymmetric] Hyperparameter-Beta: [0.01, 0.31, 0.61, 0.91, automatic, symmetric] The columns in this file include: Validation_Set: Which search term this scoring pertains to Topics: Number of topics in the model Alpha: Hyperparameter alpha selection from the 6 options above Beta: Hyperparameter beta selection from the 6 options above Coherence: The topic coherence score for the given model-row Perplexity: The perplexity score for the given model-row
- Creator/Author:
- McCabe, Erin E.
- Submitter:
- Erin E. McCabe
- Date Uploaded:
- 11/04/2022
- Date Modified:
- 11/04/2022
- Date Created:
- 2022
- License:
- Open Data Commons Attribution License (ODC-By)