Split NVCF generator into completion chat #696

leondz · 2024-05-24T13:38:38Z

NVCF models support at least two interfaces - Completion and Chat styles. They also sometimes require custom headers. This PR splits the NVCF class into two, one for each interface, providing separate methods for building the payload and parsing the output under each interface style. It also adds code to allow external configuration of the payload fields that the NVCF endpoint is queried with.

…o take extra params from config

leondz · 2024-05-24T13:45:18Z

ps. i know, i know, sorry about the _config indexing jeffrey

jmartin-tech

Minor tweak ideas offered.

Stray thought as I looked at this, are these services really just OpenAICompatible like NIM? ( This is likely a futures effort thought. )

garak/generators/nvcf.py

leondz · 2024-05-24T17:56:57Z

Stray thought as I looked at this, are these services really just OpenAICompatible like NIM? ( This is likely a futures effort thought. )

One would hope so! I like where this is going.

Co-authored-by: Jeffrey Martin <jemartin@nvidia.com> Signed-off-by: Leon Derczynski <leonderczynski@gmail.com>

leondz added 2 commits May 24, 2024 15:27

allow nvcf to work with both chat and completion models; allow nvcf t…

fedd430

…o take extra params from config

also log nvcf payload in case of failed request

35e7b60

leondz added the generators Interfaces with LLMs label May 24, 2024

leondz requested a review from jmartin-tech May 24, 2024 13:44

leondz added 2 commits May 24, 2024 15:50

Merge branch 'main' into feature/nvcf_completion_chat

0c3d40d

NVCF banner should differentiate between classes

e80e057

jmartin-tech reviewed May 24, 2024

View reviewed changes

garak/generators/nvcf.py Outdated Show resolved Hide resolved

garak/generators/nvcf.py Outdated Show resolved Hide resolved

jmartin-tech reviewed May 24, 2024

View reviewed changes

garak/generators/nvcf.py Outdated Show resolved Hide resolved

leondz and others added 3 commits May 24, 2024 19:58

fix env var descr

db64329

include generator name in banner

dbd9d5c

Co-authored-by: Jeffrey Martin <jemartin@nvidia.com> Signed-off-by: Leon Derczynski <leonderczynski@gmail.com>

bubble list production up to _extract_text_output

9dd48b8

leondz merged commit 59e4150 into main May 27, 2024
6 checks passed

github-actions bot locked and limited conversation to collaborators May 27, 2024

leondz deleted the feature/nvcf_completion_chat branch May 29, 2024 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split NVCF generator into completion chat #696

Split NVCF generator into completion chat #696

leondz commented May 24, 2024

leondz commented May 24, 2024

jmartin-tech left a comment

leondz commented May 24, 2024

Split NVCF generator into completion chat #696

Split NVCF generator into completion chat #696

Conversation

leondz commented May 24, 2024

leondz commented May 24, 2024

jmartin-tech left a comment

Choose a reason for hiding this comment

leondz commented May 24, 2024