Network Structure Analysis

  • Dataset

    • Pajek Dataset
    • Collection of complex network datasets which are obtained through Pajek [(Slovene word for Spider) is a program, for Windows, for analysis and visualization of large networks].

    • Large-scale network data
    • Large Network Dataset Collection of various categories such as the social networks like Facebook, Twitter , communication networks like Enron, citation networks like Arxiv High Energy Physics paper citation network, collaboration networks like collaboration network of Arxiv High Energy Physics Theory, Amazon network, Internet p2p networks and other online communities like Flickr and Reddit.

    • Small-scale network data
    • Small Network Dataset Collection of various networks such as Zachary's karate club, Word adjacencies, American College football, Power grid, High-energy theory collaborations, Coauthorships in network science and etc.
  • Tools

    • Identifying communities
    • Interactive GUI to help you identify communities in a graph.

    • Breaking communities
    • Interactive GUI to help you identify and break selected/all communities in a graph. The selected community and the community structure in the graph before and after breaking the selected community will be available to view.

    • Source Codes
    • The network structure analysis algorithms are presented here. A brief description and link to download the source code are provided for your reference. The algorithms include detection of overlapping communities, adaptively finding overlapping community structure, community vulnerability assessment and approximations and maximizing the modularity.

Information Diffusion and Social Influence

  • Dataset

    • Social Media Dataset
    • Social Computing Data Repository hosts data from a collection of many different social media sites, most of which have blogging capacity. Some of the prominent social media sites included in this repository are BlogCatalog, Twitter, MyBlogLog, Digg, StumbleUpon,, MySpace, LiveJournal, The Unofficial Apple Weblog (TUAW), Reddit, etc. The repository contains various facets of blog data including blog site metadata like, user defined tags, predefined categories, blog site description; blog post level metadata like, user defined tags, date and time of posting; blog posts; blog post mood (which is defined as the blogger's emotions when (s)he wrote the blog post); blogger name; blog post comments; and blogger social network.

    • Interdependent Networks
    • Collection of interdependent network (overlapping users) data among various social networks, such as Foursquare and Twitter and collaboration networks such as Co-author networks in the area of Condensed Matter(CM), High-Energy Theory(Het), and Network Science (NetS).
  • Tools

    • Source codes
    • The Information diffusion and social influence algorithms are presented here. A brief description and link to download the source code are provided for your reference. The algorithms include viral advertising in online social networks (OSN) [Under construction]

Smart Grids

  • Dataset

    • Power Systems Test Case Archive
    • The above is the link to the UW Power System Test Case Archive which provides access to power system data (test cases). To use the archive, select the test case that interests you, then select the appropriate parts of it. If you have selected a text file, its contents will appear on your browser. At this point, use the File Save As option to download it. Other file types will invite you to save them when you select them.

    • Bus test cases for Power Grid
    • The is the link for datasets compiled by Daniel Bienstock and Abhinav Verma, in their paper "The N - k Problem in Power Grids: New Models, Formulations and Numerical Experiments". It includes the 58- and 118-bus test cases with some variations on the capacities; as well as the 49-node \square grid" example and three larger networks created using the replication technique described at the start of Section 2.4 of the above cited paper: a 300-node, 409-arc network, a 600-node, 990-arc network, and a 619-node, 1368-arc network. Additional artificial networks were created to test specific conditions. Please cite the source if you plan to use the dataset.

    • Western States Power Grid
    • An undirected, unweighted network representing the topology of the Western States Power Grid of the United States. Data compiled by D. Watts and S. Strogatz and made available on the web here. Please cite D. J. Watts and S. H. Strogatz, Nature 393, 440-442 (1998).

Other Useful tools