-
Notifications
You must be signed in to change notification settings - Fork 349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Split NVCF generator into completion chat #696
Conversation
…o take extra params from config
ps. i know, i know, sorry about the _config indexing jeffrey |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor tweak ideas offered.
Stray thought as I looked at this, are these services really just OpenAICompatible
like NIM
? ( This is likely a futures effort thought. )
One would hope so! I like where this is going. |
Co-authored-by: Jeffrey Martin <jemartin@nvidia.com> Signed-off-by: Leon Derczynski <leonderczynski@gmail.com>
NVCF models support at least two interfaces - Completion and Chat styles. They also sometimes require custom headers. This PR splits the NVCF class into two, one for each interface, providing separate methods for building the payload and parsing the output under each interface style. It also adds code to allow external configuration of the payload fields that the NVCF endpoint is queried with.