Dataset
Topic Model Results of Ohio Non-Profit Organizations' Mission Language Open Access Deposited
This CSV file contains the topic distribution of each EIN as uncovered using six parallel Latent Dirichlet Allocation (LDA) Topic Models.
Each row depicts a topic and topic-score associated with an Ohio NPO (identified by Employer Identification Number) generated from one model run.
The sum of topic scores possible for every row associated with an EIN therefore will not exceed 6.0 (6 models x 100%)
Topic scores below .01 (1%) are not included.
Each topic from the models is further identified as Essential/Non-Essential by subject matter expert, Dr. Michael Jones, guided by the official IRS definition.
The topic models are generated on unstructured text language from the mission statement and activities language taken from the 2019 tax forms of Ohio non-profit organizations.
- Alternate Title
- LDA Topic Scores by EIN/NPO
- Creator
- License
- Subject
- Geographic Subject
- Time Period
- 2019
- Submitter
- College
- Department
- Date Created
- Publisher
- Language
- Related URL
-
- In Collection:
Relationships
Items
Thumbnail | Title | Date Uploaded | Visibility | Actions |
---|---|---|---|---|
topicmodel_results.csv | 2021-09-20 | Open Access |
|
Permanent link to this page: https://scholar.uc.edu/show/6q182m71w