Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to obtain the data set in experimental format #7

Open
buthi opened this issue Nov 9, 2022 · 5 comments
Open

How to obtain the data set in experimental format #7

buthi opened this issue Nov 9, 2022 · 5 comments
Labels
help wanted Extra attention is needed

Comments

@buthi
Copy link

buthi commented Nov 9, 2022

Hello, I am also doing research on graph network now. Your article is very enlightening. I would like to ask, when I was looking for data sets, I found that data sets like CITEand DBLP are all in text or json format, and I could not find dataset with adjacency matrix and attribute data. I want to know whether your experimental data is processed by yourself and then experimented with these original data sets, or where you can find the data in graph format?

@yueliu1999
Copy link
Owner

Thanks. The other datasets can be found at this link: https://github.com/yueliu1999/Awesome-Deep-Graph-Clustering#benchmark-datasets

@yueliu1999 yueliu1999 added the help wanted Extra attention is needed label Nov 9, 2022
@buthi
Copy link
Author

buthi commented Nov 9, 2022

谢谢。其他数据集可在此链接中找到:https ://github.com/yueliu1999/Awesome-Deep-Graph-Clustering#benchmark-datasets
Thank you for your advice! I have already star this library of yours before. I would like to know whether these datasets are processed by yourself or just existing in official datasets

@yueliu1999
Copy link
Owner

Yes, these datasets are processed by myself. I just uniformly store the official data with the npy format. The reason is that the different formats of datasets are hard to process during coding.

@buthi
Copy link
Author

buthi commented Nov 9, 2022

是的,这些数据集是我自己处理的。我只是统一用npy格式存储官方数据。原因是在编码过程中难以处理不同格式的数据集。
Thank you. May I ask how you handled the data? In addition, whether to do graph model research, the data set is their own processing into the form of graph

@yueliu1999
Copy link
Owner

I just read the original data and transfer them to numpy.ndarray,and then save them with npy format. Some data is the nature graph and others is the processed graph.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants