<p>There are two Excel .csv files containing
questions on type 2 diabetes mellitus tagged by the specific topics each
question covers. </p><p>clinicquestions.csv has questions collected from 100 patients. </p><p>crowdsourcedquestions.csv has questions collected from 300 members of the
public. </p><p><br></p><p>Questions on type 1 diabetes were removed, as were questions on clinic
operations. Details on the coding can be found in the citation (below).</p><p><br></p><p>clinicquestions.csv</p><p>Number of instances: 152</p><p>Number of attributes: 25</p><p>Attribute characteristics: text; integer IDs,
one-hot encodings</p><p>Missing data: none</p><p><br></p><p>crowdsourcedquestions.csv
</p><p>Number of instances: 284</p><p>Number of attributes: 26</p><p>Attribute characteristics: text; categorical
(Female/Male); integer IDs, one-hot encodings</p><p>Missing data: none</p><p><br></p><p>ATTRIBUTES:</p><p>gender – categorical (Male/Female)</p><p>person_ID – Integer identifier for the person asking the question </p><p>qx_ID – Integer identifier for the question</p><p>Qx – The text of the question itself, as asked, without
corrections or edits</p><p>22 topic categories one-hot encoded:</p><p>CAUSE ; RISK; PREVENTION; DIAGNOSIS; MANIFESTATIONS; TREATMENT; ANATOMY;
CURE; DIET; EXERCISE; WEIGHT; SELF-MANAGEMENT; DISEASE COMPLICATIONS; TREATMENT
COMPLICATIONS; PERSON or ORGANIZATION; PROGNOSIS; DISTRIBUTION of a DISEASE in
a POPULATION; INHERITANCE PATTERNS; RESEARCH; PSYCHOSOCIAL; Own HEALTH RECORD
RELATED; OTHER<br></p><p><br></p><p> </p><p>CITATION:</p><p>
</p><p>Crangle CE, Bradley C, Carlin PF, Esterhay RJ, Harper R, Kearney
PM, Lorig K, McCarthy VJC, McTear M, Savage E, Tuttle MS, Wallace JG. (2018, to
appear) Exploring Patient Information
Needs: A Cross Sectional Study of Questions on Type 2 Diabetes. PLOS ONE</p><p></p>