Web Query Summarizer

A Web application which generates an extractive summary of single domain multi documents (scraped web pages) based on user web query using web scraping, NLP

To run the project, clone the repository or download and extract the zip file and execute the following command to install packages and dependencies

pip install -r requirements.txt

The Django project can be run by executing the command

python manage.py runserver

Ensure you have the latest chromedriver.exe file in your project folder

Flow of the system

Allow users to enter queries on a web application
Get the results from searching the query on google
From the multiple results, extract all the links on the first page as they are highly relevant to user query
Scrape and clean the data from all these links and store it in a text file
Send the data to NLP based models to generate a summary

User can enter any query based on single domain

Lemmatization, Stop words removal and punctuation are used for preprocessing the text and cosine similarity and the TextRank algorithm is used for the extractive summary generation

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.idea		.idea
scraping		scraping
screenshots		screenshots
search		search
static		static
summarization		summarization
templates		templates
ui		ui
venv		venv
webQuerySummarizer		webQuerySummarizer
README.md		README.md
chromedriver.exe		chromedriver.exe
db.sqlite3		db.sqlite3
debug.log		debug.log
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Query Summarizer

About

Releases

Packages

Contributors 3

Languages

vinalbagaria/WebQuerySummarizer

Folders and files

Latest commit

History

Repository files navigation

Web Query Summarizer

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages