Matrix: SNAP/web-BerkStan

Description: Web graph of Berkeley and Stanford

SNAP/web-BerkStan graph SNAP/web-BerkStan graph
(bipartite graph drawing) (graph drawing of A+A')


SNAP/web-BerkStan
scc of SNAP/web-BerkStan

  • Home page of the UF Sparse Matrix Collection
  • Matrix group: SNAP
  • Click here for a description of the SNAP group.
  • Click here for a list of all matrices
  • Click here for a list of all matrix groups
  • download as a MATLAB mat-file, file size: 9 MB. Use UFget(2300) or UFget('SNAP/web-BerkStan') in MATLAB.
  • download in Matrix Market format, file size: 18 MB.
  • download in Rutherford/Boeing format, file size: 12 MB.

    Matrix properties
    number of rows685,230
    number of columns685,230
    nonzeros7,600,595
    # strongly connected comp.109,406
    explicit zero entries0
    nonzero pattern symmetry 25%
    numeric value symmetry 25%
    typebinary
    structureunsymmetric
    Cholesky candidate?no
    positive definite?no

    authorS. Kamvar
    editorJ. Leskovec
    date2002
    kinddirected graph
    2D/3D problem?no

    Notes:

    Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
    Jure Leskovec http://snap.stanford.edu/data/index.html                         
    email jure at cs.stanford.edu                                                  
                                                                                   
    Berkeley-Stanford web graph                                                    
                                                                                   
    NOTE:  This is an earlier version (2002) of the data obtained from             
    Sep Kamvar, Stanford (2003) (the Kamvar/Stanford_Berkeley graph                
    in the UF collection, matrix ID 980).                                          
                                                                                   
    Dataset information                                                            
                                                                                   
    Nodes represent pages from berkely.edu and stanford.edu domains and directed   
    edges represent hyperlinks between them. The data was collected in 2002.       
                                                                                   
    Dataset statistics                                                             
    Nodes   685230                                                                 
    Edges   7600595                                                                
    Nodes in largest WCC    654782 (0.956)                                         
    Edges in largest WCC    7499425 (0.987)                                        
    Nodes in largest SCC    334857 (0.489)                                         
    Edges in largest SCC    4523232 (0.595)                                        
    Average clustering coefficient  0.6149                                         
    Number of triangles     64690980                                               
    Fraction of closed triangles    0.08769                                        
    Diameter (longest shortest path)    669                                        
    90-percentile effective diameter    10                                         
                                                                                   
    Source (citation)                                                              
                                                                                   
    J. Leskovec, K. Lang, A. Dasgupta, M. Mahoney. Community Structure in Large    
    Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters.
    arXiv.org:0810.1355, 2008.                                                     
                                                                                   
    Files                                                                          
    File    Description                                                            
    web-BerkStan.txt.gz     Berkely-Stanford web graph from 2002                   
                                                                                   
    NOTE: a near duplicate of this problem already appears in the UF Collection:   
                                                                                   
    web-BerkStan        Kamvar/Stanford_Berkeley                                   
                        in SNAP/:       n: 685,230   nz: 7,600,595                 
                        in Kamvar/      n: 683,446   nz: 7,583,376                 
                                                                                   
                        I obtained the Kamvar/Stanford_Berkeley directly           
                        from Sep Kamvar.  It is slightly smaller than the          
                        version in SNAP.  It is thus likely that Sep created       
                        multiple versions of the graph.                            
    

    For a description of the statistics displayed above, click here.

    Maintained by Tim Davis, last updated 12-Mar-2014.
    Matrix pictures by cspy, a MATLAB function in the CSparse package.
    Matrix graphs by Yifan Hu, AT&T Labs Visualization Group.