Skip to content

This web app takes query as an input, and return text from top 5 websites. I had used Google Custom Search API and Scrapping Bee to build it.

Notifications You must be signed in to change notification settings

theanantchoubey/scrapperWebApp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to Scrapper Web App!

This web app takes query as an input and in turn returns data from top 5 websites. This a web scrapping project.

Technologies Used: Node.JS, Express.JS, etc.
APIs Used: Google Custom Search API, Scrapping Bee API

How to Set up the project Locally!

STEP 1: clone the repository

STEP 2: Install all the npm packages

npm install

STEP 3: Change Enviornment Variables with yours
a. Create a .env file
b. Store the following variables

CUSTOM_SEARCH_API_KEY
Above API Key you can generate on - console.cloud.google.com

SEARCH_ENGINE_ID
You can get on - console.cloud.google.com

SCRAPPING_BEE_API_KEY
Above API Key you can generate on - ScrappingBee

STEP 4: Start the server

nodemon app.js

STEP 5: Congratulations! Now go to localhost:3000 and checkout!


Problems I faced while creating the webapp

  1. Removing unwanted Javascript from the fetched data
  2. Extracting only the required text from the response so generated

Future Ideas & Optimizations for Implementation in the project

  1. Error Handling - Currently it does not render data from PDF files
  2. Time Optimization - It takes good amount of time to get all the data together
  3. UI - Building an understandable and representable UI

Please let me know If you have any feedback for me on LinkedIn - theanantchoubey

About

This web app takes query as an input, and return text from top 5 websites. I had used Google Custom Search API and Scrapping Bee to build it.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published