CMPT732 Yelp Analysis Project
- Make
- The scripts need to run on our gateway to access Spark/Cassandra cluster
- Step 1: Create Cassandra tables: make create_schema -f Makefile.production
- Step 2: Copy data from AWS S3 to Hadoop HDFS: make prepare_data -f Makefile.production
- Step 3: Load Yelp data from Hadoop HDFS to Cassandra tables: make load_data -f Makefile.production
- Step 4: Do all analysis: make run_analyze -f Makefile.production
- Step 5: Store results to Postgres DB: make store_data -f Makefile.production