Information retrieval project in BGU course
How to run:
- Create a cluster and upload in home/dataproc corpus_dat.py, 205557564_gcp.ipynb, inverted_index_gcp.py.
- Create all the indexes to the bucket u want
- Create an Instance using command "bash start1.sh" (change necessary variables)
- Using "bash start2.sh" move necessary files to the instance and then it logs u to the instance
- After logging to instance run "bash get_indexes.sh"
- Run "python3 search_frontend.py"