Graph Embedding Methods vs Link-based Similarity Measures in Task of Similarity Computation of Nodes in Graphs

This repository provides:

Python implementations of the following similarity measures:

SimRank (2002, ACM SIGKDD, https://doi.org/10.1145/775047.775126)
SimRank* (2013, VLDB Endowment, https://doi.org/10.14778/2732219.2732221)
JacSim (2017, Information Sciences, https://doi.org/10.1016/j.ins.2017.06.005)
JPRank (2019, ACM SAC, https://doi.org/10.1145/3297280.3297331)
Cosine

Datasets:

BlogCatalog
Cora
Wikipedia

The following packages are required:

Python       >= 3.8
networkx     =2.6.*
numpy        =1.21.*
scikit-learn =1.0.*

Notes

All the codes are implemented in Python 3.7 by Eclipse PyDev.
The codes can be easily migrated to other Python IDs and it is also possible to use them via command line by applying small changes.
The implementations of link-based similarity measures are based on their matrix forms, which are significantly faster than their component forms.
The provided codes for link-based similarity measures can be applied to both directed and undirected graphs.
The Cosine implementation is based on a matrix/vector multiplication technique, which is significantly faster than its conventional implementation.

Datasets and Graph Structure:

Each dataset has a “ground_truth” folder containing a text file per each label where each line indicates a node id.
A graph is represented as a text file under the edge list format in which, each line corresponds to an edge in the graph, tab is used as the separator, and the node index is started from 0.

Citing:

If you find the provided source codes and datasets useful for your research, please consider citing the following paper:

Hamedani, M.R.; Kim, S-W. On Investigating Both Effectiveness and Efficiency of Embedding Methods in Task of Similarity Computation of Nodes in Graphs. Applied Sciences. 2021, 11, 162. DOI: https://dx.doi.org/10.3390/app11010162

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
codes		codes
datasets		datasets
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph Embedding Methods vs Link-based Similarity Measures in Task of Similarity Computation of Nodes in Graphs

Notes

Datasets and Graph Structure:

Citing:

About

Releases

Packages

Languages

License

mrhhyu/Graph-Embedding_vs_Linkbased-Measures

Folders and files

Latest commit

History

Repository files navigation

Graph Embedding Methods vs Link-based Similarity Measures in Task of Similarity Computation of Nodes in Graphs

Notes

Datasets and Graph Structure:

Citing:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages