Matrix: SNAP/email-Enron

Description: Email communication network from Enron

SNAP/email-Enron graph
(undirected graph drawing)


SNAP/email-Enron
scc of SNAP/email-Enron

  • Home page of the UF Sparse Matrix Collection
  • Matrix group: SNAP
  • Click here for a description of the SNAP group.
  • Click here for a list of all matrices
  • Click here for a list of all matrix groups
  • download as a MATLAB mat-file, file size: 684 KB. Use UFget(2290) or UFget('SNAP/email-Enron') in MATLAB.
  • download in Matrix Market format, file size: 505 KB.
  • download in Rutherford/Boeing format, file size: 437 KB.

    Matrix properties
    number of rows36,692
    number of columns36,692
    nonzeros367,662
    # strongly connected comp.1,065
    explicit zero entries0
    nonzero pattern symmetrysymmetric
    numeric value symmetrysymmetric
    typebinary
    structuresymmetric
    Cholesky candidate?no
    positive definite?no

    authorL. Kaelbling, M. Gervasio, W. Cohen
    editorJ. Leskovec
    date2003
    kinddirected graph
    2D/3D problem?no

    Notes:

    Networks from SNAP (Stanford Network Analysis Platform) Network Data Sets,     
    Jure Leskovec http://snap.stanford.edu/data/index.html                         
    email jure at cs.stanford.edu                                                  
                                                                                   
    Enron email network                                                            
                                                                                   
    Dataset information                                                            
                                                                                   
    Enron email communication network covers all the email communication within a  
    dataset of around half million emails. This data was originally made public,   
    and posted to the web, by the Federal Energy Regulatory Commission during its  
    investigation. Nodes of the network are email addresses and if an address i    
    sent at least one email to address j, the graph contains a directed edge from i
    to j. Note that non-Enron email addresses act as sinks and sources in the      
    network as we only observe their communication with the Enron email addresses. 
                                                                                   
    The Enron email data was originally released by William Cohen at CMU.          
                                                                                   
    Dataset statistics                                                             
    Nodes   36692                                                                  
    Edges   367662                                                                 
    Nodes in largest WCC    33696 (0.918)                                          
    Edges in largest WCC    361622 (0.984)                                         
    Nodes in largest SCC    33696 (0.918)                                          
    Edges in largest SCC    361622 (0.984)                                         
    Average clustering coefficient  0.4970                                         
    Number of triangles     727044                                                 
    Fraction of closed triangles    0.08531                                        
    Diameter (longest shortest path)    12                                         
    90-percentile effective diameter    4.8                                        
                                                                                   
    Source (citation)                                                              
                                                                                   
    J. Leskovec, J. Kleinberg and C. Faloutsos. Graphs over Time: Densification    
    Laws, Shrinking Diameters and Possible Explanations. ACM SIGKDD International  
    Conference on Knowledge Discovery and Data Mining (KDD), 2005.                 
                                                                                   
    File    Description                                                            
    email-Enron.txt.gz  Enron email network                                        
    Enron email data    Complete Enron email dataset (includes full email message  
                        text and attachments), at http://www.cs.cmu.edu/~enron/    
                                                                                   
    

    SVD-based statistics:
    norm(A)118.418
    min(svd(A))1.25584e-285
    cond(A)9.42937e+286
    rank(A)24,939
    null space dimension11,753
    full numerical rank?no
    singular value gap2.13047e+06

    singular values (MAT file):click here
    SVD method used:s = svd (full (A))
    status:ok

    SNAP/email-Enron svd

    For a description of the statistics displayed above, click here.

    Maintained by Tim Davis, last updated 12-Mar-2014.
    Matrix pictures by cspy, a MATLAB function in the CSparse package.
    Matrix graphs by Yifan Hu, AT&T Labs Visualization Group.