Allow alternative OpenAI-compatible endpoints #494

betschki · 2024-09-12T06:24:10Z

Context

This PR introduces functionality to allow users to specify a custom OpenAI-compatible API endpoint in addition to the default OpenAI API. The main goal behind this is to enable the usage of alternative endpoints such as self-hosted or third-party services. At stadt.werk, we use this feature to interact with our own self-hosted OpenAI-compatible server.

The enhancement extends the flexibility of the existing interface by letting users switch between OpenAI’s official endpoint (default behaviour) and their own custom endpoint through the UI. Additionally, the PR refines model fetching, which is essential for any non-OpenAI endpoints.

Compatability

This change is fully backward-compatible with existing OpenAI API usage. If no custom endpoint is specified, the application will default to using OpenAI’s API as before.
The enhancements have been tested with both OpenAI’s API and OpenAI-compatible llama.cpp infrastructure.
Users who do not require a custom endpoint will not be affected by these changes.

We'd love to see this enhancement merged into chatgpt-web, since we believe this can be a great feature that has also been mentioned a few times (here and here, for example).

Looking forward to feedback.

[AI-32] Add option for custom OpenAI-compatible endpoint

Niek · 2024-09-12T11:36:25Z

Great work! I have a few questions:

How does this play together with VITE_API_BASE? I see you still support it, I suppose it's good to keep that in as a way to default to some other value than OpenAI?
I tried with the Anthropic API, but I get CORS errors. We might need to add some exceptions for specific popular APIs
I think it makes sense to "test" a newly entered API endpoint by calling /models before storing it
Maybe it makes sense to remove Petals support and recommend https://docs.litellm.ai/docs/simple_proxy instead

betschki · 2024-09-27T06:21:02Z

Hey @Niek!

Thank you for your feedback, appreciate it.

To answer your questions:

Yes, we did not want to impact current functionality, so if no custom endpoint is set, VITE_API_BASE will still work. That was important from our end to ensure nothing breaks during a potential update, in case somebody set the variable and expects it to keep working.
We have run into the same issue on our llamacpp server. From a client perspective, there is not too much we can do in an agnostic way, in my eyes. Anthropic has added CORS suppport as of recently, though this requires setting an additional header. Not quite sure how this can be handled effectively. Hard-coding configurations for popular APIs is an option, though this can quickly become a maintenance nightmare (and would still not guarantee that all APIs will work browser-based).
Great idea with sending a request to /models before storing an endpoint. Will implement that today and update th PR.
We haven't touched Petals, since none of us uses it or has experience with it. Always a fan of simplifying though - and this could be a great opportunity for that, indeed. I would see that out of scope for this specific PR though.

So, I'll implement that call to /models and will wait for your feedback on CORS.

[AI-32] Check /models endpoint before saving API URL

betschki · 2024-10-04T07:00:06Z

@Niek I have added the check for the /models endpoint before saving the endpoint in the settings.

Looking forward to your feedback regarding CORS.

Niek · 2024-10-04T10:07:53Z

Thanks a lot, merged now @betschki!

I tried to get it working with Anthropic, but no luck. The changes needed are:

Authentication is done with x-api-key instead of the Authorization header
anthropic-dangerous-direct-browser-access: true needs to be set on all API calls (small change)
the /models endpoint is non-existent, quite ridiculous if you ask me (see: Model List? anthropics/anthropic-sdk-typescript#458)

So for now, this is only available for 100% OpenAI-compatible endpoints like llama.cpp.

Niek · 2024-10-04T10:22:15Z

@all-contributors please add @betschki for code, ideas

allcontributors · 2024-10-04T10:22:19Z

@Niek

@betschki already contributed before to code, ideas

akarelas · 2024-10-04T10:56:15Z

I tried to get it working with Anthropic, but no luck.

Sounds like an important enough change to implement though, because Anthropic is better in many use-cases, like it's better for tech & coding than OpenAI.

Niek · 2024-10-04T11:03:40Z

I tried to get it working with Anthropic, but no luck.

Sounds like an important enough change to implement though, because Anthropic is better in many use-cases, like it's better for tech & coding than OpenAI.

Agreed, it would be nice if the code could be restructured a bit so all API requests are done through requests.svelte. There we can add some logic depending on the hostname.

betschki and others added 7 commits June 21, 2024 11:03

[AI-32] Add option for custom OpenAI-compatible endpoint

a8764b9

[AI-32] fix changes from code review

54badb0

[AI-32] Added models endpoint for model fetching

af348af

[AI-32] sort models list in chat settings

e0345d4

[AI-32] load first available model as default model

c2dec8d

Merge pull request #1 from stadtwerk/issue/ai-32

efacb3b

[AI-32] Add option for custom OpenAI-compatible endpoint

Merge branch 'Niek:main' into main

b33f990

Merge branch 'Niek:main' into main

4c3e9bd

This was referenced Sep 27, 2024

[AI-32] Check /models route before saving API URL #498

Closed

[AI-32] Check /models endpoint before saving API URL stadtwerk/ai_chatgpt_web_fork#2

Merged

betschki and others added 2 commits October 4, 2024 08:57

[AI-32] Check /models endpoint before saving API URL

e526668

Merge pull request #2 from stadtwerk/issue/AI-32

5f25f90

[AI-32] Check /models endpoint before saving API URL

Niek merged commit ceca482 into Niek:main Oct 4, 2024

This was referenced Oct 4, 2024

Add UI to change API URL #150

Closed

Plans on supporting Claude? #433

Open

countzero mentioned this pull request Oct 10, 2024

feature: add llama api using local models #108

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow alternative OpenAI-compatible endpoints #494

Allow alternative OpenAI-compatible endpoints #494

betschki commented Sep 12, 2024 •

edited

Loading

Niek commented Sep 12, 2024

betschki commented Sep 27, 2024

betschki commented Oct 4, 2024

Niek commented Oct 4, 2024

Niek commented Oct 4, 2024

allcontributors bot commented Oct 4, 2024

akarelas commented Oct 4, 2024

Niek commented Oct 4, 2024

Allow alternative OpenAI-compatible endpoints #494

Allow alternative OpenAI-compatible endpoints #494

Conversation

betschki commented Sep 12, 2024 • edited Loading

Context

Compatability

Niek commented Sep 12, 2024

betschki commented Sep 27, 2024

betschki commented Oct 4, 2024

Niek commented Oct 4, 2024

Niek commented Oct 4, 2024

allcontributors bot commented Oct 4, 2024

akarelas commented Oct 4, 2024

Niek commented Oct 4, 2024

betschki commented Sep 12, 2024 •

edited

Loading