This project is a Streamlit application that renames images based on the objects identified within them using the Ollama Llama 3.2 Vision model. The application allows users to specify the number of objects to identify and renames the images accordingly.
- Identify and name objects in images using Ollama Llama 3.2 Vision.
- Rename images based on identified objects.
- User-friendly interface with Streamlit.
- Customizable input and output directories.
- Adjustable number of objects to identify.
- Clone the repository:
git clone <repository-url> cd image-dataset-builder
- Install the required dependencies:
pip install -r requirements.txt
- Download the Ollama installer from the official website.
- Run the installer and follow the on-screen instructions.
- Download the Ollama installer from the official website.
- Open the downloaded
.dmg
file and drag the Ollama app to your Applications folder.
Install with one command: curl -fsSL https://ollama.com/install.sh | sh
-
Run the Streamlit application:
streamlit run app1.py
-
Open your web browser and go to
http://localhost:8501
. -
Enter the input directory containing the images you want to process.
-
Enter the output directory where the renamed images will be saved.
-
Use the slider to select the number of objects to identify in each image.
-
Click the "Elabora Immagini" button to start processing the images.
- Input Directory:
./input_images
- Output Directory:
./output_images
- Number of objects to identify:
3
The application will process each image in the input directory, identify the specified number of objects, and save the renamed images in the output directory.
This project is licensed under the MIT License. See the LICENSE file for details.