- panther_similarity(G, source, k=5, path_length=5, c=0.5, delta=0.1, eps=None)[source]#
Returns the Panther similarity of nodes in the graph
Panther is a similarity metric that says “two objects are considered to be similar if they frequently appear on the same paths.” .
- GNetworkX graph
A NetworkX graph
Source node for which to find the top
ksimilar other nodes
- kint (default = 5)
The number of most similar nodes to return
- path_lengthint (default = 5)
How long the randomly generated paths should be (
- cfloat (default = 0.5)
A universal positive constant used to scale the number of sample random paths to generate.
- deltafloat (default = 0.1)
The probability that the similarity \(S\) is not an epsilon-approximation to (R, phi), where \(R\) is the number of random paths and \(\phi\) is the probability that an element sampled from a set \(A \subseteq D\), where \(D\) is the domain.
- epsfloat or None (default = None)
The error bound. Per , a good value is
sqrt(1/|E|). Therefore, if no value is provided, the recommended computed value will be used.
Dictionary of nodes to similarity scores (as floats). Note: the self-similarity (i.e.,
v) will not be included in the returned dictionary.
Zhang, J., Tang, J., Ma, C., Tong, H., Jing, Y., & Li, J. Panther: Fast top-k similarity search on large networks. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Vol. 2015-August, pp. 1445–1454). Association for Computing Machinery. https://doi.org/10.1145/2783258.2783267.
>>> G = nx.star_graph(10) >>> sim = nx.panther_similarity(G, 0)