1/1
2 files

Questions on Diabetes from Patients and the Public

dataset
posted on 05.09.2018, 22:55 authored by Colleen CrangleColleen Crangle

There are two Excel .csv files containing questions on type 2 diabetes mellitus tagged by the specific topics each question covers.

clinicquestions.csv has questions collected from 100 patients.

crowdsourcedquestions.csv has questions collected from 300 members of the public.


Questions on type 1 diabetes were removed, as were questions on clinic operations. Details on the coding can be found in the citation (below).


clinicquestions.csv

Number of instances: 152

Number of attributes: 25

Attribute characteristics: text; integer IDs, one-hot encodings

Missing data: none


crowdsourcedquestions.csv

Number of instances: 284

Number of attributes: 26

Attribute characteristics: text; categorical (Female/Male); integer IDs, one-hot encodings

Missing data: none


ATTRIBUTES:

gender – categorical (Male/Female)

person_ID – Integer identifier for the person asking the question

qx_ID – Integer identifier for the question

Qx – The text of the question itself, as asked, without corrections or edits

22 topic categories one-hot encoded:

CAUSE ; RISK; PREVENTION; DIAGNOSIS; MANIFESTATIONS; TREATMENT; ANATOMY; CURE; DIET; EXERCISE; WEIGHT; SELF-MANAGEMENT; DISEASE COMPLICATIONS; TREATMENT COMPLICATIONS; PERSON or ORGANIZATION; PROGNOSIS; DISTRIBUTION of a DISEASE in a POPULATION; INHERITANCE PATTERNS; RESEARCH; PSYCHOSOCIAL; Own HEALTH RECORD RELATED; OTHER


CITATION:

Crangle CE, Bradley C, Carlin PF, Esterhay RJ, Harper R, Kearney PM, Lorig K, McCarthy VJC, McTear M, Savage E, Tuttle MS, Wallace JG. (2018, to appear) Exploring Patient Information Needs: A Cross Sectional Study of Questions on Type 2 Diabetes. PLOS ONE

History