PDFWhisperer

The latest version of the application has been deployed on streamlit cloud, and can be accessed through this link.

Description

PDFWhisperer is a chatbot designed to extract and process information from PDF files, and to enable interactive chat sessions with the content of the PDFs. This project includes components for:

PDF parsing
Managing chat sessions
Implementing a processing chain to facilitate user interactions with PDF content

Purpose: This project is a hands-on learning experience for exploring and mastering Generative AI tools and technologies. This project gives a hands-on experience on building Gen AI applications using retrieval augmented generation (RAG).

Run on local

Install the dependencies using the following command:

pip install -r requirements.txt

Fill the .env file with the required credentials. Sample file:

OPENROUTER_API_KEY=""
COHERE_API_KEY=""
GROQ_API_KEY=""
COOKIES_PASSWORD=""

Run the application using the following command:

streamlit run main.py

Tools and Technologies

PyMuPDF: Used for efficient PDF parsing.
Langchain: Utilized for implementing the chatbot functionality.
Streamlit: Powers the frontend interface.
Groq cloud, Openrouter, Cohere: for accessing LLMs

Future Scope

There are various ways to expand the project, such as:

giving the llm access to internet using function calling
converting the llm chain into an agent, allowing it to make its own decisions
adding the ability to chat with multiple pdfs at once

Thanks for checking out the repository, feel free to connect with me on LinkedIn.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
exploration		exploration
.gitignore		.gitignore
README.md		README.md
chain.py		chain.py
main.py		main.py
model.py		model.py
pdf_parser.py		pdf_parser.py
requirements.txt		requirements.txt
session_manager.py		session_manager.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDFWhisperer

Description

Run on local

Tools and Technologies

Future Scope

About

Languages

chaitanya-basava/PDFWhisperer

Folders and files

Latest commit

History

Repository files navigation

PDFWhisperer

Description

Run on local

Tools and Technologies

Future Scope

About

Topics

Resources

Stars

Watchers

Forks

Languages