feat(google-genai): Support Gemini system instructions #7235

chrisnathan20 · 2024-11-20T20:11:23Z

Currently for Google Gen AI, system message (it can only be the first element in list of messages) are prepended to the following human message which is used to steer the model's behavior. The handling of system messages is the same although some newer gen ai models support direct invocation of system messages via the systemInstructions field per https://ai.google.dev/gemini-api/docs/system-instructions?lang=node

This PR provides support for models that allow direct system instructions by providing an alternative method of handling system messages when possible or set via convertSystemMessageToHumanContent. The changes in this PR are modeled after the changes in google-common to support system instructions (#5089)

High Level Outline of Changes and Additions:

added optional convertSystemMessageToHumanContent which overrides dependency on model to use new system message logic or not. This is also done to match the Generative AI package on Python.
added method computeUseSystemInstruction to determine whether model supports system Instructions or not. List of supported and unsupported models taken from google-common's with deprecated models (palm) removed.
Added new optional boolean parameter to convertBaseMessagesToContent which defaults to false. Setting this to true triggers new handling of system messages where system messages are treated as independent messages and not prepended to following user message while maintaining existing restrictions.
Prior to sending prompt for response generation, if prompt's first message is an independent system message, then it is removed from the actual prompt to be sent and set as the system instruction of the model. It is important to note that an independent system message can only be the first message in prompt and can only exist if new system message handling in convertBaseMessagesToContent is triggered which can only be for model that supports system instructions.

Test Cases created:

Given: Input has single system message followed by one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: the System message is removed from prompt and passed as systemInstruction field instead and actualPrompt would only have 1 user message under the role of user
Given: Input has single system message followed by one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: the System message is not removed from prompt as actualPrompt would only have the system message prepended to the user message under the role of user
Given: Input has a system message that is not the first message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has a system message that is not the first message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has multiple system messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has multiple system messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: convertBaseMessagesToContent should raise the error - "System message should be the first one"
Given: Input has no system message and one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: actualPrompt would only have the user message under the role of user
Given: Input has no system message and one user message
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: actualPrompt would only have the user message under the role of user
Given: Input has no system message and multiple user messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to true
Then: actualPrompt would only have the first user message followed by the next user messages as separate messages
Given: Input has no system message and multiple user messages
When: convertBaseMessagesToContent is invoked with 3rd parameter set to false
Then: actualPrompt would only have the first user message followed by the next user messages as separate messages

Credits:
Issue selection - @shannon-ab
Implementation - @chrisnathan20 @garychen2002
Testing - @shannon-ab @martinl498

…ction feat: Add direct system instruction invocation for GenAI [local PR]

feat: Update convertSystemMessageToHumanContent logic [local pr]

…rameter set to false

"False" system message test cases

added test cases for third parameter true

vercel · 2024-11-20T20:11:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-docs	✅ Ready (Inspect)	Visit Preview		Dec 3, 2024 8:24pm

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-api-refs	⬜️ Ignored (Inspect)			Dec 3, 2024 8:24pm

chrisnathan20 · 2024-11-21T19:01:10Z

Hi @jacoblee93 and @afirstenberg - this is our PR to support system instructions for capable Generative AI models based on the implementation plan shared under the issue. If there are any points of concern or improvements, please do not hesitate to let us know. Thank you and have a great rest of your day!

jacoblee93

Looks good, apologies for the delay in review!

chrisnathan20 · 2024-12-03T20:16:15Z

Thanks for the review @jacoblee93! I also want to let you know that I pushed a new commit to fix formatting issues as I did yarn lint but did not do yarn format which is my mistake, should I re-request a review from you?

jacoblee93 · 2024-12-03T22:03:44Z

Nope all good, about to pull and try it out now

jacoblee93 · 2024-12-03T22:17:40Z

Thank you!

chrisnathan20 · 2024-12-03T22:20:18Z

@jacoblee93 Thanks for the review!

…7235) Co-authored-by: Gary Chen <thegary.chen@mail.utoronto.ca> Co-authored-by: martinl498 <martinloo498@gmail.com> Co-authored-by: Shannon Budiman <shannon.budiman@mail.utoronto.ca> Co-authored-by: Jacob Lee <jacoblee93@gmail.com>

chrisnathan20 and others added 10 commits November 17, 2024 19:35

feat: Add direct system instruction invocation for GenAI

b1ff373

Merge pull request #1 from shannon-ab/christopher-genai-system-instru…

43c8fa2

…ction feat: Add direct system instruction invocation for GenAI [local PR]

Update convertSystemMessageToHumanContent logic

bd97dc5

suggested refactor for convertSystemMessageToHumanContent usage

6615664

Merge pull request #2 from shannon-ab/gary-system-instruction

db23d0b

feat: Update convertSystemMessageToHumanContent logic [local pr]

Add system message tests for convertBaseMessagesToContent with 3rd pa…

5f82744

…rameter set to false

Add specific error in throw test, add test with multiple sys messages

ea59aac

Merge pull request #3 from shannon-ab/martin-sys-test

d785573

"False" system message test cases

added test cases for third parameter true

17bf68d

Merge pull request #4 from shannon-ab/shannon-genai-systest

27058b0

added test cases for third parameter true

vercel bot deployed to Preview – langchainjs-docs November 20, 2024 20:25 View deployment

chrisnathan20 marked this pull request as ready for review November 21, 2024 18:56

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. auto:improvement Medium size change to existing code to handle new use-cases labels Nov 21, 2024

jacoblee93 approved these changes Dec 3, 2024

View reviewed changes

dosubot bot added the lgtm PRs that are ready to be merged as-is label Dec 3, 2024

Merge branch 'main' into feat-genai-system-instruction-support

6c97cc7

jacoblee93 changed the title ~~google-genai [minor]: Support Gemini system messages~~ feat(google-genai): Support Gemini system instructions Dec 3, 2024

fixed formatting issues

39f18c1

vercel bot deployed to Preview – langchainjs-docs December 3, 2024 20:24 View deployment

jacoblee93 merged commit 8eadded into langchain-ai:main Dec 3, 2024
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(google-genai): Support Gemini system instructions #7235

feat(google-genai): Support Gemini system instructions #7235

chrisnathan20 commented Nov 20, 2024 •

edited

Loading

vercel bot commented Nov 20, 2024 •

edited

Loading

chrisnathan20 commented Nov 21, 2024

jacoblee93 left a comment

chrisnathan20 commented Dec 3, 2024 •

edited

Loading

jacoblee93 commented Dec 3, 2024

jacoblee93 commented Dec 3, 2024

chrisnathan20 commented Dec 3, 2024

feat(google-genai): Support Gemini system instructions #7235

feat(google-genai): Support Gemini system instructions #7235

Conversation

chrisnathan20 commented Nov 20, 2024 • edited Loading

vercel bot commented Nov 20, 2024 • edited Loading

chrisnathan20 commented Nov 21, 2024

jacoblee93 left a comment

Choose a reason for hiding this comment

chrisnathan20 commented Dec 3, 2024 • edited Loading

jacoblee93 commented Dec 3, 2024

jacoblee93 commented Dec 3, 2024

chrisnathan20 commented Dec 3, 2024

chrisnathan20 commented Nov 20, 2024 •

edited

Loading

vercel bot commented Nov 20, 2024 •

edited

Loading

chrisnathan20 commented Dec 3, 2024 •

edited

Loading