Scalable Hashtag Recommender System

A hashtag recommender system based on k-means, mini-batch fast k-means and a deep learning feature extraction phase.[1] Usage: hashtag-recommender-system [OPTIONS] Recommends appropriate hashtags for a given image.

Check OPTIONS.md for the list of the possible parameters.

Deploy on Amazon aws via Flintrock

Run deploy.sh inside the "deploy" folder. The script is configured to upload another script on the master and on each slave machine. It copies the jar file on each machine. Finally it runs the previously uploaded script "nodesetup" on each machine in parallel. This script downloads and installs all the dependencies (e.g. python, python libraries ecc.) N.B in order to let flintrock access aws machine you need to export the aws keys and id as system keys via: export AWS_ACCESS_KEY_ID="yourawsid" export AWS_SECRET_ACCESS_KEY="yourawskey"
Conf.yaml is used by flintrock to configure the cluster properties. One can choose the OS, number of slaves ecc.
Once installed all the dependencies, in order to run the jar one needs to login via ssh to the master slave. Then run a script like this:

spark-submit --master spark://ip- --driver-memory 25G --executors-memory 28G --executor-core 8 --class "Main" --deploy-mode client shrs.jar
-f file_path.csv -t tag_listc.txt --image-path "https://image.ibb.co/kYdbKT/IMG_20180725_194058_490.jpg" -c minibatch -i 20 -b 10000 -m spark://ip-:7077

driver-memory, executors-memory and executor core are parameters of master and slaves machines. Spark default uses only 1 GB on each machine.

[1]=https://www.eecs.tufts.edu/~dsculley/papers/fastkmeans.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.idea		.idea
dataset		dataset
deploy		deploy
project		project
python		python
src/main		src/main
.gitignore		.gitignore
LICENCE.md		LICENCE.md
OPTIONS.md		OPTIONS.md
README.md		README.md
build.sbt		build.sbt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scalable Hashtag Recommender System

Deploy on Amazon aws via Flintrock

About

Releases

Packages

Contributors 3

Languages

License

Rhuax/Scalable-Hashtag-Recommender-System

Folders and files

Latest commit

History

Repository files navigation

Scalable Hashtag Recommender System

Deploy on Amazon aws via Flintrock

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages