This is a program that can summarize any texts you can screenshot using Microsoft's Snipping Tool with the help of Google's Gemini. I made this cos i was bored lol also dont mind the long name, i suck at naming things. Should've probably consulted with an AI chatbot for a name in hindsight smh
- Windows OS (well, duh)
- Microsoft's Snipping Tool (should come preinstalled)
- Python 3.12.1 (the one im using)
- UB Mannheim's Tesseract
- Google Gemini API Key
-
Clone this project into your local machine
-
Open up powershell/command prompt inside of the newly cloned folder
-
Initialize it by running
python -m venv venv
-
Activate the virtual environment by running
venv\Scripts\activate
-
Install all of the requirements by running
pip install -r requirements.txt
-
Get tesseract here
-
Install it (preferably in
C:\Program Files\
) -
Add Tesseract to your path in Environment Variables (refer to here if you don't know how to do it)
-
Set the GEMINI_API_KEY variable in your local machine for your Gemini API Key with Powershell/Command Prompt by running
$Env:GEMINI_API_KEY= "(YOUR GEMINI API KEY GOES HERE)"
e.g.,
$Env:GEMINI_API_KEY = "qwertyuiop-asdf_fghjkl"
-
Win+Shift+S
to take a screeshot using Snipping Tool and then run the program. -
The output is then printed onto the console