Dataset

 

Topic Model Results of Ohio Non-Profit Organizations' Mission Language Open Access Deposited

No preview available

Download the file

Date Uploaded: 05/06/2021
Date Modified: 09/20/2021

This CSV file contains the topic distribution of each EIN as uncovered using six parallel Latent Dirichlet Allocation (LDA) Topic Models.
Each row depicts a topic and topic-score associated with an Ohio NPO (identified by Employer Identification Number) generated from one model run.
The sum of topic scores possible for every row associated with an EIN therefore will not exceed 6.0 (6 models x 100%)
Topic scores below .01 (1%) are not included.
Each topic from the models is further identified as Essential/Non-Essential by subject matter expert, Dr. Michael Jones, guided by the official IRS definition.
The topic models are generated on unstructured text language from the mission statement and activities language taken from the 2019 tax forms of Ohio non-profit organizations.

Alternate Title
  • LDA Topic Scores by EIN/NPO
Creator
License
Subject
Geographic Subject
Time Period
  • 2019
Submitter
College
Department
Date Created
Publisher
Language
Related URL

Relationships

In Collection:

Items

Permanent link to this page: https://scholar.uc.edu/show/6q182m71w