MBA智库文档行业 IT互联网数据挖掘：网络概念与网络建模.pdf

数据挖掘：网络概念与网络建模.pdf

下载

Liljoe1222

50页 | 1.44MB | 0次下载 |

0.0

(0人评价)

我要评价：

投诉举报

用手机看文档

扫一扫,手机看文档

下载

开通VIP

Introduction to Introduction to NetworksNetworks  Basic Measures of Networks  Centrality Analysis in Networks  Modeling of Network Formation  Primitives of Social Networks Networks and Their RepresentationsNetworks and Their Representations  A network/graph: G = (V, E), where V: vertices/nodes, E: edges/links  E: a subset of V × V, n = |V| (order of G), m = |E| (size of G).  Multi-edge: if more than one edge between the same pair of vertices  loop: if an edge connects vertex to itself (., (v i, vi))  Simple network: if a network has neither self-edges nor multi-edges  Adjacency matrix:  Aij = 1 if there is an edge between vertices i and j; 0 otherwise  Directed graph (digraph): if each edge has a direction (tail head) →  Aij = 1 if there is an edge from j to i; 0 otherwise  Weighted graph: If a weight wij (usually a real number) is associated with each edge vij Basic Basic NetworkNetwork Structures and Properties Structures and Properties  Subgraph: A subset of the nodes and edges in a graph/network  Given a subset of vertices V’ V, the induced subgraph G’ = (V’, E’) consists exactly of all the edges present in G between vertices in V’  Clique(complete graph): Every node is connected to every other  Singleton vs. dyad (two nodes and their relationship) vs. triad: B C D E F A  Ego-centric network: A network pull out by selecting a node and all of its connections  The 1-degree egocentric network of A  The -degree egocentric network of A  The 2-degree egocentric network of A Vertex Degree for Undirected & Directed NetworksVertex Degree for Undirected & Directed Networks  Let a network G = (V, E)  Undirected Network  Degree (or degree centrality) of a vertex: d(vi)  # of edges connected to it, ., d(A) = 4, d(H) = 2  Directed network  In-degree of a vertex din(vi):  # of edges pointing to vi  ., din(A) = 3, din(B) = 2  Out-degree of a vertex dout(vi):  # of edges from vi  ., dout(A) = 1, dout(B) = 2 Degree Distribution and PathDegree Distribution and Path  Degree sequence of a graph: The list of degrees of the nodes sorted in non-increasing order  ., in graph G1, degree sequence: (4, 3, 2, 2, 1)  Degree frequency distribution of a graph: Let Nk denote the # of vertices with degree k  (N0, N1, …, Nt), t is max degree for a node in G  ., in graph G1, degree freq. distrib.: (0, 1, 2, 1, 1)  Degree distribution of a graph: Probability mass function f for random variable X  (f(0), f(1), …, f(t), where f(k) = P(X = k) = Nk/n  ., in graph G1, degree distrib.: (0, , , , )  Walk in a graph G between nodes X and Y: ordered sequence of vertices, starting at X and ending at Y, . there is an edge between every pair of consecutive vertices  Hops: the length of the walk  Path: a walk with distinct vertices  Distance: the length of the shortest path Graph G1 Radius and Diameter of a NetworkRadius and Diameter of a Network  Eccentricity: The eccentricity of a node vi is the maximum distance from vi to any other nodes in the graph  e(vi) = maxj {d(vi, vj)}  ., e(A) = 1, e(F) = e(B) = e(D) = e(H) = 2  Radius of a connected graph G: the min eccentricity of any node in G  r(G) = mini {e(vi)} = mini {maxj {d(vi, vj)}}  ., r(G1) = 1  Diameter of a connected graph G: the max eccentricity of any node in G  d(G) = maxi {e(vi)} = maxi, j {d(vi, vj)}  ., d(G1) = 2  Diameter is sensitive to outliers. Effective diameter: min # of hops for which a large fraction, typically 90%, of all connected pairs of nodes can reach each other. Graph G1 PathsPaths  Geodesic path: shortest path  Geodesic paths are not necessarily unique: It is quite possible to have more than one path of equal length between a given pair of vertices  Diameter of a graph: the length of the longest geodesic path between any pair of vertices in the network for which a path actually exists  Eulerian path: a path that traverses each edge in a network exactly once The Königsberg bridge problem  Hamilton path: a path that visits each vertex in a network exactly once Other PathsOther Paths Components in Directed & Undirected NetworkComponents in Directed & Undirected Network The graph contains 2 weakly connected and 5 strongly connected components Weakly connected graph: there is a node in the graph such that this node can be reached but this node cannot reach any other nodes except for itself Out-component of a node n: the subgraph including any nodes that can be reached by n In-component of n: the subgraph including any nodes that can reach n Independent Paths, Connectivity, and Cut SetsIndependent Paths, Connectivity, and Cut Sets  Two path connecting a pair of vertices (A, B) are edge-independent if they share no edges  Two path are vertex-independent if they share no vertices other than the starting and ending vertices  A vertex cut set is a set of vertices whose removal will disconnected a specified pair of vertices  An edge cut set is a set of edges whose removal will disconnected a specified pair of vertices  A minimum cut set: the smallest cut set that will disconnect a specified pair of vertices  Menger’s theorem => maxflow/min-cut theorem: For a pair of vertices, size of min-cut set = vertex connectivity = maximum flow  This works also for weighted networks multiple size-2 edge/vertex cut set Clustering CoefficientClustering Coefficient  The clustering coefficient of a node vi is a measure of the density of edges in the neighborhood of vi  Let Gi = (Vi, Ei) be the subgraph induced by the neighbors of vertex vi, |Vi| = ni (# of neighbors of vi), and |Ei| = mi (# of edges among the neighbors of vi)  Clustering coefficient of vi for undirected network is  For directed network,  Clustering coefficient of a graph G: Averaging the local clustering coefficient of all the vertices (Watts & Strogatz): Co-citation and Bibliographic CouplingCo-citation and Bibliographic Coupling  Co-citation of vertices i and j: # of vertices having outgoing edges pointing to both i and j  Co-citation of i and j:  Co-citation matrix: It is a symmetric matrix  Diagonal matrix (Cii): total # papers citing i  Bibliographic coupling of vertices i and j: # of other vertices to which both point  Bibliographic coupling of i and j:  Co-citation matrix:  Diagonal matrix (Bii): total # papers cited by i Vertices i and j are co-cited by 3 papers Vertices i and j cite 3 same papers Cocitation & Bibliographic Coupling: ComparisonCocitation & Bibliographic Coupling: Comparison  Two measures are affected by the number of incoming and outgoing edges that vertices have  For strong co-citation: must have a lot of incoming edges  Must be well-cited (influential) papers, surveys, or books  Takes time to accumulate citations  Strong bib-coupling if two papers have similar citations  A more uniform indicator of similarity between papers  Can be computed as soon as a paper is published  Not change over time  Recent analysis algorithms  HITS explores both co-citation and bibliographic coupling Bipartite NetworksBipartite Networks  Bipartite Network: 1. two kinds of vertices 2. edges linking only vertices of unlike types  Incidence matrix:  Bij = 1 if vertex j links to group i 0 otherwise  One can create a one-mode project from the two-mode partite form (but with info loss) Introduction to Introduction to NetworksNetworks  Basic Measures of Networks  Centrality Analysis in Networks  Modeling of Network Formation  Primitives of Social Networks Centrality: Basic Measure in a NetworkCentrality: Basic Measure in a Network  Centrality: How “central” a node is in the network  Degree centrality: degree of a node (the higher degree, more important the node)  Eccentricity centrality: the less eccentric, the more central  c(vi) = 1/e(vi)  Central node: e(vi) = r(G) (if it equals the radius of G)  Periphery node: e(vi) = d(G) (if it equals the diameter of G)  Often used in facility location, ., emergency center  Closeness centrality: the average of the shortest path length from the node to every other node in the network, indicating how close a node is to all other nodes in the network  c(vi) = 1/∑j d(vi, vj)  median node vm if vm has the smallest total distance ∑j d(vm, vj)  Facility location, ., shopping center, minimize total distance Centrality Measures (II)Centrality Measures (II)  Betweenness centrality for a node v: # of shortest paths from all vertices to all others that pass through v  ηjk: # of shortest paths between vertices vj and vk  ηjk(vi): # of such paths that contain vi  Betweenness centrality of a vertex vi:  Indicating a central “monitoring role played by vi for various pairs of nodes  Eigenvector centrality: Measure the influence of a node in a network, ., connections to high-scoring nodes contribute more to the score of the node in question than equal connections to low-scoring nodes Centrality Measures on the Web (I): Centrality Measures on the Web (I): Eigenvector CentralityEigenvector Centrality  Web: a directed graph, PageRank and HITS are typical algorithms  Eigenvector centrality, or prestige, importance, or rank of a node v  The more nodes point to v, the higher v’s prestige  The more prestige of a node pointing to v, the higher v’s prestige  Let p(u) be prestige score for node u. Then  Written in vector form:  At k-th iteration, we have  Vetor pk converges to the dominant eigenvector of AT with increasing k Centrality Measures on the Web (II): Centrality Measures on the Web (II): PageRankPageRank  Random surfing assumption: A web surfer randomly chooses one of the outgoing links from the current page or with some very small probability randomly jumps to any other page in the web graph  Pagerank of a page v : the probability of a random web surfer landing at v  Normalized prestige:  The prob. of visiting a page pointed by v is 1/dout(v), dout is outdegree of v  Compute updated pagerank vector for v, where N(u, v) is the normalized adjacency matrix of the graph, and N(u, v) = 1/dout(u) if (u, v) in E or 0 .  Random Jumps: a small prob. jumping to any other node (viewing web as a fully connected graph, ., adjacency matrix Ar = 1nXn)  Pagerank score computation: PageRank: Capturing Page PopularityPageRank: Capturing Page Popularity (Brin & Page’98)(Brin & Page’98)  Intuitions  Links are like citations in literature  A page that is cited often can be expected to be more useful in general  PageRank is essentially “citation counting”, but improves over simple counting  Consider “indirect citations” (being cited by a highly cited paper counts a lot…)  Smoothing of citations (every page is assumed to have a non-zero citation count)  PageRank can also be interpreted as a random surfing model (thus capturing popularity)  At any page, With prob. , randomly jumping to a page With prob. (1 – ), randomly picking a link to follow Centrality Measures on the Web (III): Centrality Measures on the Web (III): HITS (HITS (Computing Computing Hub & Authority ScoresHub & Authority Scores))  For a specific query, a page of high Pagerank score may not be that relevant  HITS (Hyperlink Induced Topic Search) computes two values for a page  Authority score: analogous to pagerank/prestige scores  Hub score: based on how many “good” pages it points to  How is HITS query-based?  first uses standard search engines to retrieve the set of relevant pages  then expands the set to include any page that point to or is pointed to by) some pages in the set  Any pages originating from the same host are eliminated  HITS is only applied on this expanded query-specific graph G  Computation:  In matrix computation (essentially two eigenvector computation): HITS: Capturing Authorities & Hubs HITS: Capturing Authorities & Hubs (Kleinberg’98)(Kleinberg’98)  Intuitions of HITS (Hyperlink Induced Topic Search)  Pages that are widely cited are good authorities  Pages that cite many other pages are good hubs  The key idea of HITS  Good authorities are cited by good hubs  Good hubs point to good authorities  Iterative reinforcement …  AAT is the co-citation matrix and ATA is the bibliographic coupling matrix. Authority centrality is eigenvector centrality for the co-citation network Metrics (Measures) in Social Network Analysis (I)Metrics (Measures) in Social Network Analysis (I)  Betweenness: The extent to which a node lies between other nodes in the network. This measure takes into account the connectivity of the node's neighbors, giving a higher value for nodes which bridge clusters. The measure reflects the number of people who a person is connecting indirectly through their direct links  Bridge: An edge is a bridge if deleting it would cause its endpoints to lie in different components of a graph.  Centrality: This measure gives a rough indication of the social power of a node based on how well they "connect" the network. "Betweenness", "Closeness", and "Degree" are all measures of centrality.  Centralization: The difference between the number of links for each node divided by maximum possible sum of differences. A centralized network will have many of its links dispersed around one or a few nodes, while a decentralized network is one in which there is little variation between the number of links each node possesses. Metrics (Measures) in Social Network Analysis (II)Metrics (Measures) in Social Network Analysis (II)  Closeness: The degree an individual is near all other individuals in a network (directly or indirectly). It reflects the ability to access information through the "grapevine" of network members. Thus, closeness is the inverse of the sum of the shortest distances between each individual and every other person in the network  Clustering coefficient: A measure of the likelihood that two associates of a node are associates themselves. A higher clustering coefficient indicates a greater 'cliquishness'.  Cohesion: The degree to which actors are connected directly to each other by cohesive bonds. Groups are identified as ‘cliques’ if every individual is directly tied to every other individual, ‘social circles’ if there is less stringency of direct contact, which is imprecise, or as structurally cohesive blocks if precision is wanted.  Degree (or geodesic distance): The count of the number of ties to other actors in the network. Metrics (Measures) in Social Network Analysis (III)Metrics (Measures) in Social Network Analysis (III)  (Individual-level) Density: The degree a respondent's ties know one another/ proportion of ties among an individual's nominees. Network or global-level density is the proportion of ties in a network relative to the total number possible (sparse versus dense networks).  Flow betweenness centrality: The degree that a node contributes to sum of maximum flow between all pairs of nodes (not that node).  Eigenvector centrality: A measure of the importance of a node in a network. It assigns relative scores to all nodes in the network based on the principle that connections to nodes having a high score contribute more to the score of the node in question.  Local Bridge: An edge is a local bridge if its endpoints share no common neighbors. Unlike a bridge, a local bridge is contained in a cycle.  Path Length: The distances between pairs of nodes in the network. Average path-length is the average of these distances between all pairs of nodes. Metrics (Measures) in Social Network Analysis (IV)Metrics (Measures) in Social Network Analysis (IV)  Prestige: In a directed graph prestige is the term used to describe a node's centrality. "Degree Prestige", "Proximity Prestige", and "Status Prestige" are all measures of Prestige.  Radiality Degree: an individual’s network reaches out into the network and provides novel information and influence.  Reach: The degree any member of a network can reach other members of the network.  Structural cohesion: The minimum number of members who, if removed from a group, would disconnect the group  Structural equivalence: Refers to the extent to which nodes have a common set of linkages to other nodes in the system. The nodes don’t need to have any ties to each other to be structurally equivalent.  Structural hole: Static holes that can be strategically filled by connecting one or more links to link together other points. Linked to ideas of social capital: if you link to two people who are not linked you can control their communication Introduction to Introduction to NetworksNetworks  Basic Measures of Networks  Centrality Analysis in Networks  Modeling of Network Formation  Primitives of Social Networks Why Network Modeling?Why Network Modeling?  Many real-world networks exhibit certain common characteristics, even though they come from very different domains, ., communication, social, and biological networks  A typical network has the following common properties:A typical network has the following common properties:  Few connected components:  often only 1 or a small number, independent of network size  Small diameter:  often a constant independent of network size (like 6)  growing only logarithmically with network size or even shrink?  typically exclude infinite distances  A high degree of clustering:  considerably more so than for a random network  A heavy-tailed degree distribution:  a small but reliable number of high-degree vertices  often of power law form Common Properties of Real NetworksCommon Properties of Real Networks  Real network: large, sparse (# of edges |E| = O(n), n: # of nodes)  Small-world property: Avg. path length µL scales logarithmically with n (# of nodes in the graph):  Ultra-small-world property:  Scale-free property (power law distribution): most nodes have very small degree, but a few hub nodes have high degrees  The probability that a node has degree k:  log-log plot shows a straight line:  Clustering effect: Two nodes are more likely to be connected if they share a common neighbor  Clustering effect: a high clustering coefficient for graph G  C(k): avg clustering coefficient for nodes with degree k  Power law relationship between C(k) and k: Probabilistic Models of NetworksProbabilistic Models of Networks  All of the network generation models we will study are probabilistic or statistical in nature  They can generate networks of any size  They often have various parameters that can be set:  size of network generated  average degree of a vertex  fraction of long-distance connections  The models generate a distribution over networks  Statements are always statistical in nature:  with high probability, diameter is small  on average, degree distribution has heavy tail  Thus, we’re going to need some basic statistics and probability theory Power Law (or Pareto) DistributionsPower Law (or Pareto) Distributions  Pareto distribution (heavy-tailed, or power law):  Pareto(x|k,m) = k mk x–(k+1) I(x ≥ m)  x must be greater than some constant m but not too much greater  Distributions: mode = m  mean = km/(k-1) if k > 1  Variance = m2k/((k-1)2(k-2)) if k > 2  For variables assuming integer values > 0, probability of value x ~ 1/xα  This is why it is called power law distribution, also referred to as scale- free  If we plot the distribution on a log-log scale, it forms a straight line  Typically 0 < α < 2; smaller α gives heavier tail  Why long tails or heavy tails? For binomial, normal, and Poisson distributions, the tail probabilities approach 0 exponentially fast  What kind of phenomena does this distribution model?  Word frequency vs their rank (the, of, …); wealth distribution; … Distinguishing Distributions in DataDistinguishing Distributions in Data  All these distributions are idealized models  In practice, we do not see distributions, but data  Typical procedure to distinguish between Poisson, power law, …  might restrict our attention to a range of values of interest  accumulate counts of observed data into equal-sized bins  look at counts on a log-log plot  Power law:  log(P(X = x)) = log(1/xα) = – α log(x)  linear, slope –α  Normal:  log(P(X = x)) = log(α exp(–x2/b)) = log(α) – x2/b  non-linear, concave near mean  Poisson:  log(P(X = x)) = log(exp(–l) lx/x!)  also non-linear Zipf’s LawZipf’s Law  Pareto distribution vs. Zipf’s Law  Pareto distributions are continuous probability distributions  Zipf's law: a discrete counterpart of the Pareto distribution  Zipf's law:  Given some corpus of natural language utterances, the frequency of any word is inversely proportional to its rank in the frequency table  Thus the most frequent word will occur approximately twice as often as the second most frequent word, which occurs twice as often as the fourth most frequent word, etc.  General theme:  rank events by their frequency of occurrence  resulting distribution often is a power law!  Other examples:  North America city sizes  personal income  file sizes  genus sizes (number of species) Linear scales on both axes Logarithmic scales on both axes The same data plotted on linear and logarithmic scales. Both plots show a Zipf distribution with 300 data points Zipf DistributionZipf Distribution Some Models of Network GenerationSome Models of Network Generation  Erdös-Rényi Random graph model:  Gives few components and small diameter  does not give high clustering and heavy-tailed degree distributions  is the mathematically most well-studied and understood model  Watts-Strogatz small world graph model:  gives few components, small diameter and high clustering  does not give heavy-tailed degree distributions  Barabási-Albert Scale-free model:  gives few components, small diameter and heavy-tailed distribution  does not give high clustering  Hierarchical network:  few components, small diameter, high clustering, heavy-tailed  Affiliation network:  models group-actor formation Erdös-Rényi (ER) Random Graph ModelErdös-Rényi (ER) Random Graph Model  A random graph is obtained by starting with a set of N vertices and adding edges between them at random  Different random graph models produce different probability distributions on graphs  Most commonly studied is the Erdős–Rényi model, denoted G(N, p), in which every possible edge occurs independently with probability p  Random graphs were first defined by Paul Erdős and Alfréd Rényi in their 1959 paper "On Random Graphs”  The usual regime of interest is when p ~ 1/N, N is large  ., p = 1/2N, p = 1/N, p = 2/N, p = 10/N, p = log(N)/N, etc.  in expectation, each vertex will have a “small” number of neighbors  will then examine what happens when N --> infinity  can thus study properties of large networks with bounded degree  Sharply concentrated; not heavy-tailed The Watts and Strogatz ModelThe Watts and Strogatz Model  Proposed by Duncan J. Watts and Steven Strogatz in their joint 1998 Nature paper  A random graph generation model that produces graphs with small- world properties, including short average path lengths and high clustering  The model also became known as the (Watts) beta model after Watts used β to formulate it in his popular science book Six Degrees  The ER graphs fail to explain two important properties observed in real- world networks:  By assuming a constant and independent probability of two nodes being connected, they do not account for local clustering, ., having a low clustering coefficient  Do not account for the formation of hubs. Formally, the degree distribution of ER graphs converges to a Poisson distribution, rather than a power law observed in most real-world, scale-free networks Barabási–AlbertBarabási–Albert Scale-Free Model Scale-Free Model  Major limitation of the Watts-Strogatz model  It produces graphs that are homogeneous in degree  Real networks are often inhomogeneous in degree, having hubs and a scale-free degree distribution (scale-free networks)  Scale-free networks are better described by the preferential attachment family of models, ., the Barabási–Albert (BA) model  Edges from the new vertex are more likely to link to nodes with higher degrees  The rich-get-richer approach  This leads to the proposal of a new model: scale-free network, a network whose degree distribution follows a power law, at least asymptotically What Does that Mean?What Does that Mean? Poisson distribution Exponential Network Power-law distribution Scale-free Network Scale-Free Networks: Major IdeasScale-Free Networks: Major Ideas  The number of nodes (N) is not fixed  Networks continuously expand by additional new nodes  WWW: addition of new nodes  Citation: publication of new papers  The attachment is not uniform  A node is linked with higher probability to a node that already has a large number of links  WWW: new documents link to well known sites (CNN, Yahoo, Google)  Citation: Well cited papers are more likely to be cited again Generation of Scale-Free NetworkGeneration of Scale-Free Network  Start with (say) two vertices connected by an edge  For i = 3 to N:  for each 1 <= j < i, d(j) = degree of vertex j so far  let Z = ∑ d(j) (sum of all degrees so far)  add new vertex i with k edges back to {1, …, i – 1}:  i is connected back to j with probability d(j)/Z  Vertices j with high degree are likely to get more links! —“Rich get richer”  Natural model for many processes:  hyperlinks on the web  new business and social contacts  transportation networks  Generates a power law distribution of degrees  exponent depends on value of k  Preferential attachment explains  heavy-tailed degree distributions  small diameter (~log(N), via “hubs”)  Will not generate high clustering coefficient  no bias towards local connectivity, but towards hubs 03/01/15 Robustness of Robustness of Random vs. Scale-Free NetworksRandom vs. Scale-Free Networks  The accidental failure of a number of nodes in a random network can fracture the system into non- communicating islands.  Scale-free networks are more robust in the face of such failures  Scale-free networks are highly vulnerable to a coordinated attack against their hubs Case1: Internet BackboneCase1: Internet Backbone (Faloutsos, Faloutsos and Faloutsos, 1999) Nodes: computers, routers Links: physical lines 03/01/15 45 Internet-MapInternet-Map Introduction to Introduction to NetworksNetworks  Basic Measures of Networks  Centrality Analysis in Networks  Modeling of Network Formation  Primitives of Social Networks Social NetworksSocial Networks  Social network: A social structure made of nodes (individuals or organizations) that are related to each other by various interdependencies like friendship, kinship, like, ...  Graphical representation  Nodes = members  Edges = relationships  Examples of typical social networks on the Web  Social bookmarking ()  Friendship networks (Facebook, Myspace, LinkedIn)  Blogosphere  Media Sharing (Flickr, Youtube)  Folksonomies Communication NetworksCommunication Networks The Earth is developing an electronic nervous system, a network with diverse nodes and links are -computers -routers -satellites -phone lines -TV cables -EM waves Communication networks: Many non-identical components with diverse connections between them ““Natural” Networks and UniversalityNatural” Networks and Universality  Consider many kinds of networks:  social, technological, business, economic, content,…  These networks tend to share certain informal properties:  large scale; continual growth  distributed, organic growth: vertices “decide” who to link to  interaction restricted to links  mixture of local and long-distance connections  abstract notions of distance: geographical, content, social, …  Social network theory and link analysis  Do natural networks share more quantitative universals?  What would these “universals” be?  How can we make them precise and measure them?  How can we explain their universality? Ref: Introduction to NetworksRef: Introduction to Networks  S. Brin and L. Page, The anatomy of a large scale hypertextual Web search engine. WWW7.  S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, . Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Mining the link structure of the World Wide Web. IEEE Computer’99  D. Cai, X. He, J. Wen, and W. Ma, Block-level Link Analysis. SIGIR'2004.  P. Domingos, Mining Social Networks for Viral Marketing. IEEE Intelligent Systems, 20(1), 80-82, 2005.  D. Easley and J. Kleinberg, Networks, Crowds, and Markets: Reasoning About a Highly Connected World, Cambridge Univ. Press, 2010.  L. Getoor: Lecture notes from Lise Getoor’s website:  D. Kempe, J. Kleinberg, and E. Tardos, Maximizing the Spread of Influence through a Social Network. KDD’03.  J. M. Kleinberg, Authoritative Sources in a Hyperlinked Environment, J. ACM, 1999  D. Liben-Nowell and J. Kleinberg. The Link Prediction Problem for Social Networks. CIKM’03  M. Newman, Networks: An Introduction, Oxford Univ. Press, 2010. Introduction to Networks Networks and Their Representations Basic Network Structures and Properties Vertex Degree for Undirected & Directed Networks Degree Distribution and Path Radius and Diameter of a Network Paths Other Paths Components in Directed & Undirected Network Slide 10 Independent Paths, Connectivity, and Cut Sets Clustering Coefficient Co-citation and Bibliographic Coupling Cocitation & Bibliographic Coupling: Comparison Bipartite Networks Slide 16 Centrality: Basic Measure in a Network Centrality Measures (II) Centrality Measures on the Web (I): Eigenvector Centrality Centrality Measures on the Web (II): PageRank PageRank: Capturing Page Popularity (Brin & Page’98) Centrality Measures on the Web (III): HITS (Computing Hub & Authority Scores) HITS: Capturing Authorities & Hubs (Kleinberg’98) Metrics (Measures) in Social Network Analysis (I) Metrics (Measures) in Social Network Analysis (II) Metrics (Measures) in Social Network Analysis (III) Metrics (Measures) in Social Network Analysis (IV) Slide 28 Why Network Modeling? Common Properties of Real Networks Probabilistic Models of Networks Power Law (or Pareto) Distributions Distinguishing Distributions in Data Zipf’s Law Zipf Distribution Some Models of Network Generation Erdös-Rényi (ER) Random Graph Model The Watts and Strogatz Model Barabási–Albert Scale-Free Model Slide 40 Scale-Free Networks: Major Ideas Generation of Scale-Free Network Robustness of Random vs. Scale-Free Networks Slide 44 Internet-Map Slide 46 Social Networks Slide 48 “Natural” Networks and Universality Ref: Introduction to Networks

联系我们

智库文档公众号

客服微信

数据挖掘：网络概念与网络建模.pdf

下载

标签

相关专题更多

联系我们

意见反馈

标签

相关专题 更多

联系我们

意见反馈

相关专题更多