American College Football Network Files

2012-12-03T13:56:32Z (GMT) by Tim Evans
<p><strong>American College Football network of Girvan and Newman</strong> Mark Newman provides a football.gml file which contains the network of American football games between Division IA colleges during regular season Fall 2000. The file asks you to cite M. Girvan and M. E. J. Newman, Community structure in social and biological networks, Proc. Natl. Acad. Sci. USA <strong>99</strong>, 7821-7826 (2002). There are are two issues with the original GN file. First three teams met twice in one season so the graph is not simple. This is easily dealt with if required. Secondly, the assignments to conferences, the node values, seem to be for the 2001 season and not the 2000 season. The games do appear to be for the 2000 season as stated. For instance the Big West conference existed for football till 2000 while the Sun Belt conference was only started in 2001. Also there were 11 conferences and 5 independents in 2001 but 10 conferences and 8 independents in 2000. I have provided a set of files footballTSE* which define a simple graph with the correct conference assignments in the archive here. There is a read me file included with more details.  Further information about the problems with this data and the solutions are given in <a href="http://arxiv.org/abs/1009.0638" target="_blank">T.S. Evans, “Clique Graphs and Overlapping Communities”, J. Stat. Mech. (2010) P12037</a> [<a href="http://arxiv.org/abs/1009.0638" target="_blank">arXiv:1009.0638</a>] which would be the appropriate source to cite along with the original GN publication.</p><p>Note that Gschwind et al, 2015, Social Network Analysis and Community Detection by Decomposing a Graph into Relaxed Cliques, independently finds similar errors in this data.</p>