WikiBot: Summarizing Wikipedia Articles

Overview

As you know there are a lot of articles on Wikipedia and they are lengthy, which in turn takes a long time to read. To make it easy for the user to read the relevant context of these articles, we introduce WikiBot. WikiBot leverages Wikipedia's vast information to deliver relevant summaries of the articles quickly. This project uses sophisticated web scraping and natural language processing (NLP) techniques for data extraction and summarization, enhancing how users interact with information. We also made a content-based recommendation mechanism that suggests related articles.

Features

Data Collection

Scraped and cleaned over 50,000 Wikipedia articles to build a comprehensive dataset for model training.

Summarization Models

Implemented various summarization models, including TextRank, LDA, NMF, T5, and BART.
Achieved a high ROUGE score of 0.88 with our LDA model, demonstrating its effectiveness in distilling essential information.

Recommending Articles

We have enhanced WikiBot with a sophisticated recommendation engine that utilizes Latent Dirichlet Allocation (LDA) for topic modeling. This feature identifies key topics within user inquiries and suggests related articles based on the top words from those topics.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Article_Analysis.ipynb		Article_Analysis.ipynb
Cleaning.ipynb		Cleaning.ipynb
Group30_Final_Presentation.pdf		Group30_Final_Presentation.pdf
Group30_Final_Presentation.pptx		Group30_Final_Presentation.pptx
Group30_Project_Proposal.docx		Group30_Project_Proposal.docx
Group30_Project_Proposal.pdf		Group30_Project_Proposal.pdf
Group30_Project_Report.docx		Group30_Project_Report.docx
Group30_Project_Report.pdf		Group30_Project_Report.pdf
LDA-Summarizer.ipynb		LDA-Summarizer.ipynb
Multi-Class.ipynb		Multi-Class.ipynb
NLP_BART.ipynb		NLP_BART.ipynb
NLP_Recommendation_System.ipynb		NLP_Recommendation_System.ipynb
Preprocessing.ipynb		Preprocessing.ipynb
README.md		README.md
Recommendation.ipynb		Recommendation.ipynb
Scrape-Level5.ipynb		Scrape-Level5.ipynb
Scrape.ipynb		Scrape.ipynb
Text_Classification.ipynb		Text_Classification.ipynb
Wiki_Recommender.ipynb		Wiki_Recommender.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WikiBot: Summarizing Wikipedia Articles

Overview

Features

Data Collection

Summarization Models

Recommending Articles

About

Releases

Packages

Languages

Ayush-Patel-10/WikiBot-NLP-Project

Folders and files

Latest commit

History

Repository files navigation

WikiBot: Summarizing Wikipedia Articles

Overview

Features

Data Collection

Summarization Models

Recommending Articles

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages