Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Split NVCF generator into completion chat #696

Merged
merged 7 commits into from
May 27, 2024
Merged

Conversation

leondz
Copy link
Collaborator

@leondz leondz commented May 24, 2024

NVCF models support at least two interfaces - Completion and Chat styles. They also sometimes require custom headers. This PR splits the NVCF class into two, one for each interface, providing separate methods for building the payload and parsing the output under each interface style. It also adds code to allow external configuration of the payload fields that the NVCF endpoint is queried with.

@leondz leondz added the generators Interfaces with LLMs label May 24, 2024
@leondz leondz requested a review from jmartin-tech May 24, 2024 13:44
@leondz
Copy link
Collaborator Author

leondz commented May 24, 2024

ps. i know, i know, sorry about the _config indexing jeffrey

Copy link
Collaborator

@jmartin-tech jmartin-tech left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor tweak ideas offered.

Stray thought as I looked at this, are these services really just OpenAICompatible like NIM? ( This is likely a futures effort thought. )

garak/generators/nvcf.py Outdated Show resolved Hide resolved
garak/generators/nvcf.py Outdated Show resolved Hide resolved
@leondz
Copy link
Collaborator Author

leondz commented May 24, 2024

Stray thought as I looked at this, are these services really just OpenAICompatible like NIM? ( This is likely a futures effort thought. )

One would hope so! I like where this is going.

leondz and others added 3 commits May 24, 2024 19:58
Co-authored-by: Jeffrey Martin <jemartin@nvidia.com>
Signed-off-by: Leon Derczynski <leonderczynski@gmail.com>
@leondz leondz merged commit 59e4150 into main May 27, 2024
6 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators May 27, 2024
@leondz leondz deleted the feature/nvcf_completion_chat branch May 29, 2024 07:14
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
generators Interfaces with LLMs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants