Fix arg length error, add multimodal examples, update video prompt #634

vishal-dharm · 2024-11-16T01:36:55Z

Description of the change

This PR fixes the "argument list too long" error in the text_generation.sh file when using large base64 encoded images by using temporary files to store the encoded data and the JSON payload. This resolves issues encountered when running the text_gen_multimodal_one_image_prompt and text_gen_multimodal_one_image_prompt_streaming examples.

This PR also makes the following additions and updates:

Adds two new multimodal vision examples to text_generation.sh:
- text_gen_multimodal_two_image_prompt: Demonstrates using two images in a single prompt.
- text_gen_multimodal_one_image_bounding_box_prompt: Shows how to generate bounding boxes for objects.
Updates the text_gen_multimodal_video_prompt example: The prompt is now more comprehensive and does a better job demonstrating Gemini 1.5's multimodal capabilities.

Motivation

These examples are being added so they can be included in the revamped vision documentation.

Type of change

Feature request

Checklist

I have performed a self-review of my code.
I have added detailed comments to my code where applicable.
I have verified that my change does not break existing code.
My PR is based on the latest changes of the main branch (if unsure, please run git pull --rebase upstream main).
I am familiar with the Google Style Guide for the language I have coded in.
I have read through the Contributing Guide and signed the Contributor License Agreement.

MarkDaoust · 2024-11-16T14:05:33Z

Thanks!

…ogle-gemini#634)

* fix: Pass along model_version in GenerateContentResponse. * Revert autogenerated doc files from 94eb16e. * Fix 'argument list too long' error and add couple vision examples (#634) * Update google-ai-generativelanguage version in requirements. * Format updated generation_types and test using black. --------- Co-authored-by: Vishal Dharmadhikari <61256217+vishal-dharm@users.noreply.github.com>

Fix 'argument list too long' error and add couple vision examples

4771b84

vishal-dharm added the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024

vishal-dharm requested a review from MarkDaoust November 16, 2024 01:36

github-actions bot added the component:python sdk Issue/PR related to Python SDK label Nov 16, 2024

MarkDaoust approved these changes Nov 16, 2024

View reviewed changes

MarkDaoust merged commit a04fcd1 into main Nov 16, 2024
12 checks passed

github-actions bot removed the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024

vishal-dharm deleted the rest-vision-examples branch November 16, 2024 18:52

Annhiluc pushed a commit to Annhiluc/generative-ai-python that referenced this pull request Nov 22, 2024

Fix 'argument list too long' error and add couple vision examples (go…

ee8bf1d

…ogle-gemini#634)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix arg length error, add multimodal examples, update video prompt #634

Fix arg length error, add multimodal examples, update video prompt #634

vishal-dharm commented Nov 16, 2024

MarkDaoust commented Nov 16, 2024

Fix arg length error, add multimodal examples, update video prompt #634

Fix arg length error, add multimodal examples, update video prompt #634

Conversation

vishal-dharm commented Nov 16, 2024

Description of the change

Motivation

Type of change

Checklist

MarkDaoust commented Nov 16, 2024