figshare
Browse
1/1
2 files

Peacock Chinese Twitter Corpus (PCTC)

dataset
posted on 2020-12-26, 19:09 authored by Xiaowen NieXiaowen Nie, Weiyang Mo
The Peacock Chinese Twitter Corpus (PCTC) contains 4911813 tweets (including original tweets and replies, excluding retweets) made in simplified Chinese from 2007 to 2020. The documents are stored in MongoDB in JSON format.
User Interface: www.peacockpus.com

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC