Markov Logic Networks for the Discovery of Links.
- Java 1.8+
- PostgreSQL 9.4.x
- Gurobi solver
- Maven
- Wget, Unzip
- Download and decompress Mandolin v0.4.0-alpha
- Run
bash install.sh
The following command will discover new links of any predicate (--aim
) on the WordNet dataset (--input
) with mining threshold 0.8 (--mining
) and 1 million Gibbs sampling iterations (--sampling
).
java -Xmx1g -jar target/Mandolin-0.4.0-jar-with-dependencies.jar plain --input data/benchmark/wn18/wordnet-mlj12-train.nt,data/benchmark/wn18/wordnet-mlj12-valid.nt --output eval/wn18 --mining 0.8 --sampling 1000000 --aim "*"
Discovered links can be found in the --output
folder at ./eval/wn18/discovered_X.nt
, where X
is the output threshold, meaning that a file contains all links whose confidence is greater or equal than X
.
An excerpt of the discovered rules and weights:
0.990517419 wn18:_part_of(b, a) => wn18:_has_part(a, b)
0.862068966 wn18:_instance_hypernym(a, c) AND wn18:_synset_domain_topic_of(f, b) => wn18:_synset_domain_topic_of(a, b)
An excerpt of the discovered links with confidence > 0.9:
wn18:08131530 wn18:_has_part wn18:08132046 .
wn18:09189411 wn18:_has_part wn18:08707917 .
wn18:10484858 wn18:_synset_domain_topic_of wn18:08441203 .
wn18:01941987 wn18:_synset_domain_topic_of wn18:00300441 .
Mandolin can be launched as follows.
java -Xmx1g -jar target/Mandolin-0.4.0-jar-with-dependencies.jar <GOAL> <PARAMETERS>
Goal | Description |
---|---|
plain |
Launch a plain Mandolin execution. |
eval |
Evaluate MRR and hits@k. |
Parameters for plain
goal:
Parameter | Description | Example value |
---|---|---|
--input |
Comma-separated N-Triple files. | data1.nt,data2.nt |
--output |
Workspace and output folder. | eval/experiment1 |
--aim |
Aim predicate. For all predicates use wildcard * . |
http://www.w3.org/2002/07/owl#sameAs |
--mining |
Rule mining threshold. | 0.9 (default: 0.0 support) |
--sampling |
Gibbs sampling iterations. | 1000000 (default: 100 x evidence size) |
--rules |
Maximum number of rules. | 1500 (default: none) |
--sim |
Enable similarity among literals as min,step,max . |
0.8,0.1,0.9 (default: none) |
--onto |
Enable ontology import. | true (default: false ) |
--fwc |
Enable forward-chain. | true (default: false ) |
The eval
goal takes two parameters: the N-Triples file of the test set and Mandolin's output directory.
Example run:
java -Xmx1g -jar target/Mandolin-0.4.0-jar-with-dependencies.jar eval data/benchmark/wn18/wordnet-mlj12-test.nt eval/wn18
- Clone project:
git clone https://github.com/mommi84/Mandolin.git
cd Mandolin
- Get PostgreSQL 9.4.x - Ubuntu/Debian binaries
- Launch
bash install.sh -c
- Insert PostgreSQL setting parameters into a file
./mandolin.properties
. Example:
# GENERAL CONFIGURATION FOR MANDOLIN
pgsql_home=/usr/local/Cellar/postgresql/9.4.1
pgsql_username=tom
pgsql_password=
pgsql_url=localhost
-
Download data
-
Compile project:
export MAVEN_OPTS=-Xss4m
mvn clean compile assembly:single
After using Mandolin, stop the DB instance with:
sh pgsql-stop.sh
The instance can be restarted with:
sh pgsql-start.sh
Mandolin is licensed under GNU General Public License v2.0. AMIE is licensed under Creative Commons Attribution-NonComercial license v3.0. ProbKB is licensed under the BSD license. RockIt is licensed under the MIT License. Gurobi can be activated using a free academic license.