figshare
Browse
Table 2.xls (5.5 kB)

An analysis of correlation between tags and keywords was used to decide which terms would be included in the model.

Download (5.5 kB)
dataset
posted on 2016-03-29, 07:59 authored by John P. Schomberg, Oliver L. Haimson, Gillian R. Hayes, Hoda Anton-Culver

A correlation cutoff of .05 was used for inclusion in the model unless the authors strongly believed the keyword would be useful in the model despite low correlation (e.g., high quality, food poisoning, and employees, selected for relation to food quality, foodborne illness, and employee behavior). A liberal cut off point was used to include as many predictors as possible. Correlation is specific to pilot study training data which excluded all but Chinese restaurants.

History