A Langchain app developed to chat with Multiple PDFs
This is a Streamlit application that allows users to upload multiple PDF documents and interactively chat about the content of those documents using a conversational AI model. The chatbot is designed to process PDF files, extract text from them, and generate embeddings for the text chunks using Hugging Face models. Users can ask questions about the uploaded documents, and the chatbot will provide responses based on the document content.
-
Clone the repository:
git clone https://github.com/Rajendran2201/CogniChat
-
Navigate to the project directory:
cd CogniChat
-
Install dependencies:
pip install -r requirements.txt
To run the Streamlit application, execute the following command:
streamlit run app.py
This will start the Streamlit server locally, and you can access the application in your web browser by navigating to the provided URL.
- Upload multiple PDF documents.
- Process the uploaded documents to extract text.
- Generate embeddings for the text chunks using Hugging Face models.
- Interact with a conversational AI model to ask questions about the document content.
- View chat history and responses in real-time.
- Streamlit: For building interactive web applications.
- PyPDF2: For reading PDF files and extracting text.
- langchain: For text processing and embedding generation.
- Hugging Face Transformers: For accessing pre-trained language models.
- dotenv: For loading environment variables from a .env file.
Contributions are welcome! If you'd like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Make your changes and commit them to your branch.
- Push your changes to your fork.
- Submit a pull request with a description of your changes.
This project is licensed under the MIT License - see the LICENSE file for details.
- Inspiration: Alejandro AO