Word Frequency Count from Network Review

None
Share this:
Embed*
Cite this:

Evans, Tim (2012): Word Frequency Count from Network Review. figshare.

http://dx.doi.org/10.6084/m9.figshare.104409
Retrieved 04:46, Oct 22, 2014 (GMT)

Description

These are files containing the data used in one of the plots shown in Figure 10 in my basic overview of Complex Networks (a review for Contemporary Physics, see below). Please cite the source if you use this data. However you could also do this yourself by using the LaTeX file via arXiv. It was produced by using various UNIX tools to strip the LaTeX commands to produce a list of words (one per line) followed by counting the number of times each line was repeated. I can see that there was no stopping or stemming

e.g. "The" and "the" appear separately, "vertex" and "vertices" are counted separately.

Files:-

netrevcountrawdata.xls = rank and count for each word, along with plots

netrevcountTabSeparated.txt = rank and count for each word in simple text format

netrevindex.txt = raw data, unsorted (note there are some silly 'words' like "x"

 

Original Text:-

T.S.Evans
Complex Networks
Contemporary Physics 45 (2004) 455-475
DOI: 10.1080/00107510412331283531
arXiv:cond-mat/0405123
http://arxiv.org/abs/cond-mat/0405123

 

Links

Comments (0)

You must be logged in to post comments.

Cite "Filename"

Place your mouse over the citation text to select it

Embed "Word Frequency Count from Network Review"

Place your mouse over the embed code to select and copy it

Claim article

You claim request was sent. I will be handled in the next 24 hours.

Close window

Feedback

We appreciate all your comments, questions, suggestions or gratitude.

Login

The username or password entered are wrong.

Reset password

Your password will be sent to your registered e-mail address.

Create account

I agree to the Terms & Conditions *