GraphInsight: A Graph-based Approach to Summarize multi media content

The project is a Python-based tool that uses various natural language processing (NLP) and computer vision techniques to extract and summarize textual content from various sources such as images, PDFs, and websites. It makes use of several libraries such as NLTK, docx, bs4, cv2, pytesseract, and tika to preprocess the input data and generate a concise and relevant summary.

Installation

Clone the repository git clone https://github.com/4bdul4ziz/GraphInsight.git
Install the required packages using pip -nltk -docx -bs4 -cv2 -pytesseract -tika

Note: pytesseract requires Tesseract OCR to be installed in the system. Please follow the installation instructions for your specific operating system.

Usage

Run the main.py file python main.py
Enter the path to the input file, make sure to have the files in the same directory as the main.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
inputs		inputs
output_screenshots		output_screenshots
summarizer		summarizer
.gitignore		.gitignore
main.py		main.py
readme.md		readme.md
stopwords.py		stopwords.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphInsight: A Graph-based Approach to Summarize multi media content

Installation

Usage

About

Releases

Packages

Contributors 2

Languages

4bdul4ziz/GraphInsight

Folders and files

Latest commit

History

Repository files navigation

GraphInsight: A Graph-based Approach to Summarize multi media content

Installation

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages