UHCL Theses and Dissertations
datasetposted on 12.07.2018 by Clarke Iakovakis
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The dataset tdsFinal.csv includes all master's projects, theses and dissertations submitted to UHCL as of May 2018. Variables include:
bibrecord: The bibliographic record as cataloged by the Neumann Library
itemrecord: The item record as cataloged by the Neumann Library
title: The title of the work
author: The author of the work
callnumber: The call number of the work
note: The bibliographic notes field
subject: subjects in Library of Congress classification as cataloged by the Neumann Library
imprint: publication year
add.author: The additional author field
tot.chkout: The total number of checkouts
type: thesis, graduate project, or dissertation
college: college, updated to current college names
link: the hyperlink to the work
format: print or electronic
This dataset has gone through significant processing, as documented in the thesisCleaning pdf document included in this FigShare fileset. The thesisCleaning.Rmd file is the Markdown file used to generate the PDF.
Advisor and college names have been extracted from other fields. Chair names have been clustered in OpenRefine to create controlled names (e.g. consistent use of middle initials).