Clustering Download Events to Identify Classrooms
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
The Network for Computational Nanotechnology's (NCN)  nanoHUB site uses the HUBzero® platform  to offer a variety of content, simulation tools, and collaboration methods to an international community of students, teachers and professionals. Understanding and identifying educational usage of nanoHUB to form communities around nanotechnology education and improve education content is a long term objective of nanoHUB. While certain users log into nanoHUB, providing us with an identity with which to associate their usage, the majority of activity is from unidentified users who download content and come to the site from outside references such as search engine results. This paper describes a method to detect classroom usage from content download events with no additional information, identifying classroom usage by any user of nanoHUB material and providing insights into content usage.