Evans, Tim Word Frequency Count from Network Review <p>These are files containing the data used in one of the plots shown in Figure 10 in my basic overview of Complex Networks (a review for Contemporary Physics, see below). Please cite the source if you use this data. However you could also do this yourself by using the LaTeX file via arXiv. It was produced by using various UNIX tools to strip the LaTeX commands to produce a list of words (one per line) followed by counting the number of times each line was repeated. I can see that there was no stopping or stemming</p> <p>e.g. "The" and "the" appear separately, "vertex" and "vertices" are counted separately.</p> <p>Files:-</p> <p>netrevcountrawdata.xls = rank and count for each word, along with plots</p> <p>netrevcountTabSeparated.txt = rank and count for each word in simple text format</p> <p>netrevindex.txt = raw data, unsorted (note there are some silly 'words' like "x"</p> <p> </p> <p>Original Text:-</p> <p>T.S.Evans<br>Complex Networks<br>Contemporary Physics 45 (2004) 455-475<br>DOI: 10.1080/00107510412331283531<br>arXiv:cond-mat/0405123 <br>http://arxiv.org/abs/cond-mat/0405123</p> <p> </p> Complex Networks;Frequency Count;Zipfs Law;Probability;Condensed Matter Physics;Science Policy 2012-12-12
    https://figshare.com/articles/dataset/Word_Frequency_Count_from_Network_Review/104409
10.6084/m9.figshare.104409.v1