Dataset

 

Vocabulary Count Feature Vectors for Medical School Classifier Open Access Deposited

Date Uploaded: 05/14/2021
Date Modified: 05/14/2021

Classifier algorithms use the features (collectively known as Feature Vectors) of each item in a dataset to assess the classification to which that item belongs.

In this classifier approach, each item represents one document containing the application essay combined with unstructured language describing relevant activities of a single applicant. For privacy, the full text of this document is not provided. Instead, each document is represented only by its features. The feature vector for this classifier is based on the term frequency for each of the identified terms. E.G. Doc_A contains 0 occurrences of any terms identified as family medicine vocabulary, and 10 occurrences of terms from the the non-family-medicine vocabulary.

Creator
License
Subject
Geographic Subject
Time Period
  • 2010-2019
Submitter
College
Department
Publisher
Language

Relationships

In Collection:

Items

Permanent link to this page: https://scholar.uc.edu/show/pv63g177p