feat: Model Discovery #75

MatKuhr · 2024-08-18T22:47:46Z

Context

API Design

Variant 1:

client.chatCompletion(OpenAiModels.GPT_4o, prompt);
client.chatCompletion({ name: 'gpt-4o', type: 'chat', version: '0613'}, prompt);

Autocompletion is available for name and type.

Variant 2:

client.chatCompletion(OpenAiModels.GPT_4o(), prompt);
client.chatCompletion(OpenAiModels.GPT_4o('0613'), prompt);

Autocompletion is available for the version number.

Variant 3:

client.chatCompletion('gpt-4o', prompt);
client.chatCompletion({ name: 'gpt-4o', version: '0613' }, prompt);

Autocompletion is available for the name.

Decision: We choose alternative 3, as it is the most JS-like style. We can still choose to offer 1 or 2 in addition the future, if we see need for it.

…feat/model-discovery

packages/gen-ai-hub/src/core.ts

packages/gen-ai-hub/src/client/openai/openai-client.ts

packages/core/src/http-client.ts

packages/gen-ai-hub/src/core.ts

tomfrenken · 2024-08-20T12:12:59Z

When we talked about the model_params in the refinement, I had an idea in regards to your static variant.

What if instead of

client.chatCompletion(OpenAiModels.GPT_4o(), prompt);
client.chatCompletion(OpenAiModels.GPT_4o('0613'), prompt);

You instantiate the models first, and then use them to create a config with proper autocomplete, e.g.:

const gptModel = OpenAiModels.GPT_4o('0613')
const config = gptModel.createConfig(<model_params_with_autocomplete>)
client.chatCompletion(config, prompt);

or depending on the implementation:

client.chatCompletion(
   OpenAiModels
   .GPT_4o('0613')
   .createConfig(<<model_params_with_autocomplete>>), prompt);

packages/gen-ai-hub/src/core.ts

sample-code/src/aiservice.ts

tomfrenken

Overall looks like a great improvement, generally, I would like to stick to async / await instead of .then().catch(), and keep function calls shorter by destructuring the APIs, instead of:

WayTooLongApiCallFromSomeApi.theOneFunctionIActuallyNeed() <--- I get Java PTSD flashbacks
const { theOneFunctionIActuallyNeed } = WayTooLongApiCallFromSomeApi <--- the destructure variant

Additionally, if there is a way to avoid the many any types, we should try to narrow them as much as we can.

Other than that just some general suggestions for the open TODOs 👍

packages/gen-ai-hub/src/utils/deployment-resolver.test.ts

MatKuhr · 2024-08-22T16:37:37Z

packages/gen-ai-hub/src/orchestration/orchestration-client.test.ts

@@ -28,8 +24,7 @@ describe('GenAiHubClient', () => {
  });

  it('calls chatCompletion with minimum configuration', async () => {
-    const request: GenAiHubCompletionParameters = {
-      deploymentConfiguration,
+    const request = {


Follow up for making the constants usable with orchestration: https://github.tools.sap/AI/gen-ai-hub-sdk-js-backlog/issues/91
Follow up for adding convenience for model_params: https://github.tools.sap/AI/gen-ai-hub-sdk-js-backlog/issues/89#issuecomment-7440288

MatKuhr · 2024-08-22T16:53:36Z

packages/gen-ai-hub/src/client/openai/openai-client.ts

    data: OpenAiChatCompletionParameters,
+    deploymentResolver?: DeploymentResolver,


Follow-up BLI to improve the deploymentResolver: https://github.tools.sap/AI/gen-ai-hub-sdk-js-backlog/issues/92

MatKuhr · 2024-08-22T16:56:26Z

packages/gen-ai-hub/src/client/openai/openai-client.ts

+  }
+  const llm =
+    typeof model === 'string' ? { name: model, version: 'latest' } : model;
+  const deployment = await resolveDeployment({


if resolver is a function it is currently ignored. See https://github.tools.sap/AI/gen-ai-hub-sdk-js-backlog/issues/92

MatKuhr · 2024-08-22T16:57:07Z

packages/gen-ai-hub/src/index.ts

@@ -1,17 +1,10 @@
+export * from './client/index.js';


Is this fine?

As long as no checks fail, yes. I think the API check is not in place yet, but once it is this might cause problems, not sure. I'd suggest to fix it then if needed.

packages/gen-ai-hub/src/utils/deployment-resolver.ts

packages/gen-ai-hub/src/orchestration/orchestration-client.ts

packages/gen-ai-hub/src/utils/deployment-resolver.ts

marikaner

LGTM, I left a few minor comments and answers to questions you asked.

tomfrenken

Unintentionally dismissed Marika's LGTM when I resolved one of her comments 😬.

LGTM

deekshas8 · 2024-08-23T08:29:57Z

packages/gen-ai-hub/src/client/openai/openai-client.ts

        apiVersion
      },
      data,
-      requestConfig
+      mergeRequestConfig(requestConfig)


[q] Aren't we calling merge twice? Once here and then again in executeRequest?

deekshas8 · 2024-08-23T08:34:20Z

packages/gen-ai-hub/src/client/openai/openai-client.ts

+      'content-type': 'application/json'
+    },
+    params: { 'api-version': apiVersion },
+    ...requestConfig


here the requestConfig will entirely replace the headers and params keys.

yes, this is intended. My thinking was that the open AI client should only mix in whatever is specific to the open ai client.

tomfrenken

LGTM

Integrate changes from main

eb5a411

MatKuhr mentioned this pull request Aug 18, 2024

feat: Automated Deployment Lookup & Model Constants #50

Closed

MatKuhr and others added 6 commits August 19, 2024 00:56

Fix linting

4bb8713

Merge branch 'main' into feat/model-discovery

c684896

Fix tests

c39e009

fix: Changes from lint

4732b39

Fix type test

4458fe4

Merge branch 'feat/model-discovery' of github.com:SAP/ai-sdk-js into …

f2fb3dc

…feat/model-discovery

MatKuhr marked this pull request as ready for review August 19, 2024 13:18

Merge branch 'main' into feat/model-discovery

98ef924

MatKuhr commented Aug 19, 2024

View reviewed changes

packages/gen-ai-hub/src/core.ts Outdated Show resolved Hide resolved

MatKuhr commented Aug 19, 2024

View reviewed changes

packages/gen-ai-hub/src/client/openai/openai-client.ts Show resolved Hide resolved

MatKuhr commented Aug 19, 2024

View reviewed changes

packages/core/src/http-client.ts Show resolved Hide resolved

MatKuhr commented Aug 19, 2024

View reviewed changes

packages/gen-ai-hub/src/core.ts Outdated Show resolved Hide resolved

MatKuhr added 3 commits August 19, 2024 15:26

Add alternative API variant

b68ff8d

Make models readonly

a9671a1

Remove obsolete class

dd676c0