File(s) stored somewhere else

https://repository.iit.edu/islandora/object/islandora%3A1000013

Please note: Linked content is NOT stored on Figshare and we can't guarantee its availability, quality, security or accept any liability.

Data used to develop #Polar scores

Version 3 2021-10-06, 23:31

Version 2 2016-06-02, 19:22

Version 1 2016-06-02, 19:20

dataset

posted on 2021-10-06, 23:31 authored by Culotta, A., Libby HemphillLibby Hemphill, Heston, M.

We present a new approach to measuring political polarization, including a novel algorithm and open source Python code, which leverages Twitter content to produce measures of polarization for both users and hashtags. #Polar scores provide advantages over existing measures because they (1) can be calculated throughout the legislative cycle, (2) allow for easy differentiation between users with similar scores, (3) are chamber-agnostic, and (4) are a generic approach that can be applied beyond the U.S. Congress. #Polar scores leverage available information such as party labels, word frequency, and hashtags to create an accessible, straightforward algorithm for estimating polarity using text. (from the paper: Hemphill, L., Culotta, A., and Heston, M. (forthcoming) #Polar Scores: Measuring partisanship using social media content. Journal of Information Technology & Politics.)

The dataset contains one plain text TSV file with the following information for each of the 55,244 tweets used to develop #Polar scores : tweet_id, created_at, user_id, screen_name, tag, shortid, sex, party, state, chamber, name. The file contains one row per hashtag, and therefore tweets may appear more than once. The Python code for calculating #Polar scores is available here: http://doi.org/10.5281/zenodo.53888

Funding

NSF 1525662; NSF 1526674

History

Usage metrics

Keywords

twitter congress partisanship political polarization Pattern Recognition and Data Mining

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM