README_file_metadata_study_IDCC2018.txt Project: Metadata for Datasets Study – 4 Institutional Repositories Date: June 2016-April 2018 Description: The collection of data sets is the raw data underlying the paper entitled "Giving datasets context: a comparison study of institutional repositories that apply varying degrees of curation" presented at the International Digital Curation Conference in Barcelona, Spain (Feb 2018) and published in the International Journal of Digital Curation. All figures and tables in the publication were based on the analysis of this data set. The study examines the metadata and documentation for data sets in four institutional repositories. The participating institutions and repositories are Scholar@UC - University of Cincinnati (Cincinnati), Deep Blue Data - University of Michigan (Michigan), Data Repository for the University of Minnesota (DRUM) - University of Minnesota (Minnesota), and ScholarsArchive@OSU - Oregon State University (Oregon State). Funder: none Contact: Amy Koshoffer – koshofae@ucmail.uc.edu NAMING - Files are named as follows: -- ResearchQuestion_file_content_figure_number_YYYYMMDD All files can be found in the Scholar Collection "Metadata_Repositories_IDCC submission:" (URL https://scholar.uc.edu/collections/9w0323021) with content organized into individual works and Files associated as follows: - Metadata of data sets from four institutional repositories (URL https://scholar.uc.edu/show/pn89d657h) This data set is the raw data underlying the paper entitled "Giving datasets context: a comparison study of institutional repositories that apply varying degrees of curation" presented at the International Digital Curation Conference in Barcelona, Spain (Feb 2018) and published in the International Journal of Digital Curation. -- Metadata_completeness_raw_data_20160624.csv (master raw data file) -- README_file_metadata_study_IDCC2018.txt -- Data_Dictionary_IDCC2018 - Percent Completeness of Data Set Metadata (URL https://scholar.uc.edu/show/2j62s4855) This data set describes the percent completeness of metadata options for data sets in the four participating institutional repositories.   --PercentComplete_graph_Fig1_image_20180111.jpg (image of the figure created from the data) --PercentComplete_graph_Fig1_rawdata_20180111.csv (access file of raw data) --PercentComplete_graph_Fig1_stats_20180111.csv (access file of descriptive statistics for each institution) --PercentCompleteness_graph_Fig1_20180111.xlsx (original file) -Documentation for data sets in four institutional repositories (URL https://scholar.uc.edu/show/fx719m47b) This data set describes the type and number of documentation that accompany and describe data sets in the four participating institutional repositories. --Documentation_graph_Fig2_analyzed_IDCC2018.csv (access file of analyzed data) --Documentation_graph_Fig2_IDCC_2018.jpg (image of the figure created from the data) --Documentation_graph_Fig2_IDCC_2018.xlsx (original file) --Documentation_graph_Fig2_rawdata_IDCC2018.csv (access file of raw data) -Digital_object_identifiers_for_datasets_four_institutional_repositories (URL https://scholar.uc.edu/show/pz50gw09d) This data set measures whether data sets in the four participating institutional repositories have digital object identifiers or not. --DOIs_graph_Fig3_IDCC2018.csv (access file) --DOIs_graph_Fig3_IDCC2018.jpg (image of the figure created from the data) --DOIs_graph_Fig3_IDCC2018.xlsx (original file) -Keywords_data_sets_four_institutional_repositories (URL https://scholar.uc.edu/show/vq27zn431) This data set measures the number of keywords associated with data sets in the four participating institutional repositories. --Keyword_graph_fig4_IDCC2018.jpg (image of the figure created from the data) --Keyword_graph_fig4_IDCC2018.xlsx (original file) --Keyword_graph_rawdata_fig4_IDCC2018.csv (access file) -Significance_analysis_metadata_study_four_insititional_repositories (URL https://scholar.uc.edu/show/474299142) This data set describes the Mann-Whitney U test statistical analysis of the completeness profile for data sets in the four participating institutional repositories. --Mann_Whitney_test_results_2018.csv (original/access file)