Twitter map search engine

About

Allows users to search tweets and get ranked results and plots them on a world map with the approximate geo tagged location

Build an end-to-end search engine to scrape tweets from twitter and index them using Lucene and map reduce.

Built frontend using javascript and HTML and is connected to the backend(made with flask) with REST API queries.

Made with a team colaboration for Web Internet retrival graduate course

More info in CS242 IR Project Part B Report.pdf

Getting Started

Instructions to Deploy the System

A. Instructions to build Lucene Index

➢ Since simpleJson package is missing in the existing JRE, use the following code

before running the executable files:

export JSON_JAVA=/opt/home/cs242-w22/ #(path to simple json)

export CLASSPATH=$CLASSPATH:$JSON_JAVA/json-simple-1.1.1.jar:.

➢ We have created two separate .sh files:

Indexer.sh - Executable file for indexer, takes index directory and data file as command line arguments.
Searcher.sh - Executable file for searcher, takes query term as a command line argument.

B. Instructions to build Hadoop Index

Run MRJob.py on the scraper data to get Indexed File
Run Searcher_MR.py with the Indexed file and data File as parameters3) Enter Search query on command Line

C. Instructions to Run flask server

To run the backend flask server use the following command: flask run -h localhost -p 5002

To run the frontend run the html file ‘Search_Engine.html’ in any browser.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
pics		pics
Bipin.java		Bipin.java
CS242 IR Project Part B Report.pdf		CS242 IR Project Part B Report.pdf
Crawler.py		Crawler.py
Crawler.sh		Crawler.sh
Indexer.class		Indexer.class
Indexer.java		Indexer.java
Indexer.sh		Indexer.sh
Inversed_Index_Output		Inversed_Index_Output
MRJob.py		MRJob.py
MongoSearcher.py		MongoSearcher.py
README.md		README.md
Search_Engine.html		Search_Engine.html
Searcher.class		Searcher.class
Searcher.java		Searcher.java
Searcher.sh		Searcher.sh
Searcher_MR.py		Searcher_MR.py
app.py		app.py
data_line.json		data_line.json
data_procressing.ipynb		data_procressing.ipynb
data_tweets.json		data_tweets.json
data_tweets2.json		data_tweets2.json
data_tweets3.json		data_tweets3.json
list_cities.json		list_cities.json
sample_mongo_output.json		sample_mongo_output.json
scraper.py		scraper.py
test_data.json		test_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter map search engine

About

Getting Started

Output

About

Releases

Packages

Languages

bipindr123/twitter-map-search-engine

Folders and files

Latest commit

History

Repository files navigation

Twitter map search engine

About

Getting Started

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages