Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix arg length error, add multimodal examples, update video prompt #634

Merged
merged 1 commit into from
Nov 16, 2024

Conversation

vishal-dharm
Copy link
Collaborator

Description of the change

This PR fixes the "argument list too long" error in the text_generation.sh file when using large base64 encoded images by using temporary files to store the encoded data and the JSON payload. This resolves issues encountered when running the text_gen_multimodal_one_image_prompt and text_gen_multimodal_one_image_prompt_streaming examples.

This PR also makes the following additions and updates:

  • Adds two new multimodal vision examples to text_generation.sh:

    • text_gen_multimodal_two_image_prompt: Demonstrates using two images in a single prompt.
    • text_gen_multimodal_one_image_bounding_box_prompt: Shows how to generate bounding boxes for objects.
  • Updates the text_gen_multimodal_video_prompt example: The prompt is now more comprehensive and does a better job demonstrating Gemini 1.5's multimodal capabilities.

Motivation

These examples are being added so they can be included in the revamped vision documentation.

Type of change

Feature request

Checklist

  • I have performed a self-review of my code.
  • I have added detailed comments to my code where applicable.
  • I have verified that my change does not break existing code.
  • My PR is based on the latest changes of the main branch (if unsure, please run git pull --rebase upstream main).
  • I am familiar with the Google Style Guide for the language I have coded in.
  • I have read through the Contributing Guide and signed the Contributor License Agreement.

@vishal-dharm vishal-dharm added the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024
@github-actions github-actions bot added the component:python sdk Issue/PR related to Python SDK label Nov 16, 2024
@MarkDaoust
Copy link
Collaborator

Thanks!

@MarkDaoust MarkDaoust merged commit a04fcd1 into main Nov 16, 2024
12 checks passed
@github-actions github-actions bot removed the status:awaiting review PR awaiting review from a maintainer label Nov 16, 2024
@vishal-dharm vishal-dharm deleted the rest-vision-examples branch November 16, 2024 18:52
Annhiluc pushed a commit to Annhiluc/generative-ai-python that referenced this pull request Nov 22, 2024
MarkDaoust pushed a commit that referenced this pull request Nov 26, 2024
* fix: Pass along model_version in GenerateContentResponse.

* Revert autogenerated doc files from 94eb16e.

* Fix 'argument list too long' error and add couple vision examples (#634)

* Update google-ai-generativelanguage version in requirements.

* Format updated generation_types and test using black.

---------

Co-authored-by: Vishal Dharmadhikari <61256217+vishal-dharm@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:python sdk Issue/PR related to Python SDK
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants