MMmultilayer

Mixed membership EM algorithms for multi-layered data from the manuscript Tensorial and bipartite block models for link prediction in layered networks and temporal networks (Tarrés-Deulofeu, Godoy-Lorite, Guimerà and Sales-Pardo)

The code for the tensorial model applied to multi-layered data with R types of interactions is:

TensorialMMSBM.py

Usage: pypy TensorialMMSBM.py traindata testdata #nodes #layers #groupsnodes #groupslayers #labels #initializations output

The code for the bipartite model applied to multi-layered data with R types of interactions is:

BipartiteMMSBM.py

Usage: pypy BipartiteMMSBM.py traindata testdata #nodes #layers #groupsnodes #groupslayers #labels #initializations output

traindata - file with training data (see XXXtrainXXX.dat in the repository for an example and explanations below) testdata - file with test data (see XXXtestXXX.dat in the repository for an example and explanations below) #nodes - number of nodes in dataset #layers - number of layers in dataset #groupsnodes - number of groups of nodes K considered in the MMSBM #groupslayers - number of groups of layers L considered in the dataset #labels - number of possible interaction types, R #initializations - number of maximization runs. The final output is the average over results for each run. #output - if 1, the code prints the model parameters for each one of the runs; if 0, it does not print any model parameters #outputfolder - folder path to put all the output files

Train/Test datafile specifications:

Nodes(links) and layers should be numbered consecutively [0,...,#nodes] , [0,...,#layers]

The format of the files should be a 4 column file:

layer node1 node2 interaction_type

Columns must be separated with simple spaces. The files drugstrain0.dat and drigstest0.dat provide an example of input file.

Code output:

scores file: Probability of a link being of type 0,1,2...R for each link in the training dataset. For each link, the output probability is the average probability over the #initializations runs.

params files: first line - maximum-liklyhood value of the iteration next <n_nodes> lines - membership vector of a node to be in each community out of the K next <n_layers> lines - membership vector of a layer to be in each community out of the L the rest of the lines in the file - for each pair of communities and layer group, the probabilitie(?) of the something to be in a certain edge type

in order to run the test on the given data:

pypy TensorialMMSBM.py drugstrain0.dat drugstest0.dat 69 85 5 5 3 10 1 output #nodes #layers #groupsnodes #groupslayers #labels #initializations #output #outputfolder

Note: you can also use python to run the script (instead of pypy).

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Tool		Tool
input		input
output/1_K6L3		output/1_K6L3
BipartiteMMSBM.py		BipartiteMMSBM.py
README.md		README.md
TMMSBMparamsK5L5_0.dat		TMMSBMparamsK5L5_0.dat
TMMSBMparamsK5L5_1.dat		TMMSBMparamsK5L5_1.dat
TMMSBMparamsK5L5_2.dat		TMMSBMparamsK5L5_2.dat
TMMSBMparamsK5L5_3.dat		TMMSBMparamsK5L5_3.dat
TMMSBMparamsK5L5_4.dat		TMMSBMparamsK5L5_4.dat
TMMSBMparamsK5L5_5.dat		TMMSBMparamsK5L5_5.dat
TMMSBMparamsK5L5_6.dat		TMMSBMparamsK5L5_6.dat
TMMSBMparamsK5L5_7.dat		TMMSBMparamsK5L5_7.dat
TMMSBMparamsK5L5_8.dat		TMMSBMparamsK5L5_8.dat
TMMSBMparamsK5L5_9.dat		TMMSBMparamsK5L5_9.dat
TMMSBMscoresK5L5.dat		TMMSBMscoresK5L5.dat
TensorialMMSBM.py		TensorialMMSBM.py
drugstest0.dat		drugstest0.dat
drugstrain0.dat		drugstrain0.dat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMmultilayer

Train/Test datafile specifications:

Code output:

in order to run the test on the given data:

About

Releases

Packages

Languages

Geutg/MMmultilayer

Folders and files

Latest commit

History

Repository files navigation

MMmultilayer

Train/Test datafile specifications:

Code output:

in order to run the test on the given data:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages