hdt_tutorial

Initial Setup

Install HDT C++

./autogen.sh
./configure
make -j2
make install

Troubleshooting:

** copy serd/serd.h into src/libhdt folder

** configure path to serd library:

export LD_LIBRARY_PATH=/my_folder/serd-0/build
make check SERD_LIBS="-L/ivi/ilps/personal/svakule/serd-0/build/ -lserd-0" SERD_CFLAGS="-I/ivi/ilps/personal/svakule/serd-0/serd"

Install pyHDT:

git clone https://github.com/webdata/pyHDT.git
cd pyHDT/
./install.sh

Download and uncompress the HDT file and its index (if available) from http://www.rdfhdt.org/datasets/ e.g.

wget http://gaia.infor.uva.es/hdt/wikidata/wikidata20200309.hdt.gz
wget http://gaia.infor.uva.es/hdt/wikidata/wikidata20200309.hdt.index.v1-1.gz
or
wget http://gaia.infor.uva.es/hdt/freebase-rdf-2013-12-01-00-00.hdt.gz

Setup an Entity Catalog

Dump entity (predicate) labels into a separate file:

  -o Dump also objects

  -u Dump only URIs

  -p <exportPredicateFile> exportPredicateFile (outPred.txt by default)

  -t <exportTermsFile> portTermsFile (outTerms.txt by default)

e.g.

hdt-cpp/libhdt/tests/dumpDictionary wikidata20200309.hdt -o -u -t wikidata20200309Entities.txt
hdt-cpp/libhdt/tests/dumpDictionary wikidata20200309.hdt -o -t wikidata20200309Terms.txt
hdt-cpp/libhdt/tests/dumpDictionary wikidata20200309.hdt -p wikidata20200309Pred.txt

Setup Elastic Search

Point to the JDK in Elastic Search

export JAVA_HOME=YOUR_PATH/elasticsearch-7.6.1/jdk

Run Elastic Search

./bin/elasticsearch

Make sure ES is running with

curl -XGET 'http://localhost:9200'

Define index mapping in ES (see mapping.json)
Index entity labels into Elastic Search to create an entity catalog:

python index_entities.py

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
README.md		README.md
index_entities.py		index_entities.py
mapping.json		mapping.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hdt_tutorial

Initial Setup

Setup an Entity Catalog

About

Releases

Packages

Languages

svakulenk0/hdt_tutorial

Folders and files

Latest commit

History

Repository files navigation

hdt_tutorial

Initial Setup

Setup an Entity Catalog

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages