PLOS One Data - Accompanies "The Broad Reach of Online Extremism: Understanding the ISIS Supporting Community on Twitter"
2017-01-11T17:35:31Z (GMT) by
This Dataset accompanies:
Benigni, Matthew, Joseph, Kenneth, and Carley, Kathleen. n.d. “Online Threat-Group-Supporting Community Detection: Uncovering the ISIS Supporting Community on Twitter.” Under Review Plos One.
and can be analyzed using R source code provided at: https://github.com/mbenigni/OSNThreatGroups
The following files are contained in this dataset:
deIdentified_attributes.csv - contains node attribute information for users associated with the 2 hop snowball sample described in the aformantioned work. The file contains the following fields: anonID,followingCount,followerCount,tweetCount,lastTweet,creation_date,lang,suspended,official. AnonID refers to a unique identifier assigned to each user and corresponds to nodes in the provided edge lists. The suspended field refers to accounts that were suspended by Twitter between NOV14 and MAR15. Some of these suspended accounts were used as positive case labels. A full explanation is provided in the article. The official field refers to a list of human verified media, government, and celebrity accounts used to train the 'official classifier' in our presented work. All other fields correspond to fields provided by the Twitter API.
deIdentified_friend_edges.csv - a directed network edge list of the following or friend ties associated with all nodes listed in deIdentified_attributes.csv.
deIdentified_mention_edges.csv - a directed network edge list of the mention ties associated with all nodes listed in deIdentified_attributes.csv. Additionally epoch time for each edge is provided in the 'date' field.
deIdentified_user_ht_edges.csv - a bipartite network edge list of the user to hash tag ties associated with all nodes listed in deIdentified_attributes.csv. Additionally epoch time for each edge is provided in the 'date' field.