figshare
Browse
BlogCatalog3.zip (956.72 kB)

BlogCatalog dataset

Download (956.72 kB)
Version 3 2020-04-20, 06:40
Version 2 2020-04-20, 06:36
Version 1 2020-04-20, 06:27
dataset
posted on 2020-04-20, 06:40 authored by Nitin Agarwal, Xufei Wang

Abstract: BlogCatalog is the social blog directory which manages the bloggers and their blogs.


Number of Nodes:

10,312

Number of Edges:

333,983

Missing Values?

no


Source:

Nitin Agarwal+, Xufei Wang*, Huan Liu*

+ Department of Information Science, University of Arkansas at Little Rock. E-mail:nxagarwal@ualr.edu

* School of Computing, Informatics and Decision Systems Engineering, Arizona State University. E-mail: huan.liu@asu.edu, xufei.wang@asu.edu


Data Set Information:

2 files are included:

1. nodes.csv
-- it's the file of all the users. This file works as a dictionary of all the users in this data set. It's useful for fast reference. It contains all the node ids used in the dataset.

2. edges.csv
-- this is the friendship network among the bloggers. The blogger's friends are represented using edges. Here is an example.

1,2

This means blogger with id "1" is friend with blogger id "2".


Attribute Information:

This is the data set crawled on July, 2009 from BlogCatalog ( http://www.blogcatalog.com ). BlogCatalog is a social blog directory website. This contains the friendship network crawled. For easier understanding, all the contents are organized in CSV file format.

-. Basic statistics

Number of bloggers : 88,784

Number of friendship pairs: 4,186,390


Relevant Papers:


Nitin Agarwal and Huan Liu. ”Modeling and Data Mining in Blogosphere”, Synthesis Lectures on Data Mining and Knowledge Discovery #1, Morgan & Claypool Publishers, Robert Grossman (Editor), August 2009. ISBN: 9781598299083 (paperback) ISBN: 9781598299090 (ebook)

Nitin Agarwal, Magdiel Galan, Huan Liu, and Shankar Subramanya. WisColl: Collective Wisdom based Blog Clustering. Journal of Information Science, 180(1): 39-61, January, 2010.

Nitin Agarwal, Huan Liu, Sudheendra Murthy, Arunabha Sen, and Xufei Wang. A Social Identity Approach to Identify Familiar Strangers in a Social Network. In Proceedings of the Third International AAAI Conference on Weblogs and Social Media (ICWSM09), pp. 2 - 9, May 17-20, 2009. San Jose, California.

Nitin Agarwal, Huan Liu, Sudheendra Murthy, Arunabha Sen, and Xufei Wang. "A Social Identity Approach to Identify Familiar Strangers in a Social Network", 3rd International AAAI Conference on Weblogs and Social Media (ICWSM09), pp. 2 - 9, May 17-20, 2009. San Jose, California.


History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC