figshare
Browse

sorry, we can't preview this file

OTGSC_data.zip (1.16 GB)

PLOS One Data - Accompanies "The Broad Reach of Online Extremism: Understanding the ISIS Supporting Community on Twitter"

Download (1.16 GB)
Version 2 2017-01-11, 17:35
Version 1 2016-04-08, 11:20
dataset
posted on 2017-01-11, 17:35 authored by Matthew BenigniMatthew Benigni
This Dataset accompanies:

Benigni, Matthew, Joseph, Kenneth, and Carley, Kathleen. n.d. “Online Threat-Group-Supporting Community Detection: Uncovering the ISIS Supporting Community on Twitter.” Under Review Plos One.

and can be analyzed using R source code provided at: https://github.com/mbenigni/OSNThreatGroups

The following files are contained in this dataset:

Files:
deIdentified_attributes.csv - contains node attribute information for users associated with the 2 hop snowball sample described in the aformantioned work.  The file contains the following fields: anonID,followingCount,followerCount,tweetCount,lastTweet,creation_date,lang,suspended,official.  AnonID refers to a unique identifier assigned to each user and corresponds to nodes in the provided edge lists. The suspended field refers to accounts that were suspended by Twitter between NOV14 and MAR15.  Some of these suspended accounts were used as positive case labels.  A full explanation is provided in the article.  The official field refers to a list of human verified media, government, and celebrity accounts used to train the 'official classifier' in our presented work.  All other fields correspond to fields provided by the Twitter API.

deIdentified_friend_edges.csv - a directed network edge list of the following or friend ties associated with all nodes listed in deIdentified_attributes.csv.

deIdentified_mention_edges.csv - a directed network edge list of the mention ties associated with all nodes listed in deIdentified_attributes.csv.  Additionally epoch time for each edge is provided in the 'date' field.

deIdentified_user_ht_edges.csv - a bipartite network edge list of the user to hash tag ties associated with all nodes listed in deIdentified_attributes.csv.  Additionally epoch time for each edge is provided in the 'date' field.



Funding

N000141310835

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC