[Request] LLM 适配 #151

arvinxx · 2023-09-06T12:40:54Z

独立 LLM 设置 Tab，为上层助手角色提供丰富的大模型选项。

清单：

完整 RFC 在： #737

ddwinhzy · 2023-10-11T11:31:16Z

Would like to add OpenAI's fine-tuned model call

arvinxx · 2023-10-11T11:32:50Z

Would like to add OpenAI's fine-tuned model call

Yes, with custom model name

gavinliu · 2023-12-13T15:13:04Z

本地 LLM 是否有一个明确的支持计划？

lobehubbot · 2023-12-13T15:13:15Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Does the local LLM have a clear support plan?

jyboy · 2023-12-13T16:03:17Z

Google's Gemini Pro API is also available now.
Please see the official tweet.

arvinxx · 2023-12-14T01:08:20Z

本地 LLM 是否有一个明确的支持计划？

计划在1月份开始做

lobehubbot · 2023-12-14T01:08:31Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Does the local LLM have a clear support plan?

Plan to start in January

hazelnutcloud · 2023-12-18T05:17:10Z

any plans to add support for mistral?

arvinxx · 2023-12-18T05:48:16Z

yes

structure-charger · 2023-12-27T15:22:43Z

推荐 https://github.com/xorbitsai/inference

WayneShao · 2024-01-03T11:23:52Z

我是用了 2023-12-01-preview 版本的API调用 Azure 的 gpt-4-vision 模型，可以成功获得回复，但是回复一句之后就会直接截断

控制台输出

lobehubbot · 2024-01-03T11:24:04Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

I used the 2023-12-01-preview version of the API to call Azure's gpt-4-vision model. I could successfully get a reply, but the reply would be truncated directly after one sentence.

console output

arvinxx · 2024-01-03T11:45:30Z

我是用了 2023-12-01-preview 版本的API调用 Azure 的 gpt-4-vision 模型，可以成功获得回复，但是回复一句之后就会直接截断

控制台输出

这个应该是 gpt-4-vision 内置的max_tokens 太小了导致的

lobehubbot · 2024-01-03T11:45:40Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

I used the 2023-12-01-preview version of the API to call Azure's gpt-4-vision model. I could successfully get a reply, but the reply would be truncated directly after one sentence.

Console output

This should be caused by the built-in max_tokens of gpt-4-vision being too small.

Met-Du · 2024-01-27T00:34:12Z

Are there any plans for Gemini Pro? Thank you

cocobeach · 2024-01-30T10:38:54Z

cocobeach · 2024-01-30T10:47:55Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Are there any plans to adapt to ollama?

ollama wrapped in litellm should be a piece of cake, I am going to try it.

arvinxx · 2024-01-30T11:16:03Z

@cocobeach Thanks for your introduction！ We will add ollama support also~ 😁

j0ole · 2024-02-01T07:57:27Z

Are thers any plants to add a key pool and calling weight to dynamic call those keys?

arvinxx · 2024-02-02T01:45:22Z

Are thers any plants to add a key pool and calling weight to dynamic call those keys?

No. There is not plan about key polling.

zhangheli · 2024-02-05T07:46:59Z

有没有可能，直接接入 langchain 内 chat model 的支持？

lobehubbot · 2024-02-05T07:47:10Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Is it possible to directly access the chat model support in langchain?

arvinxx · 2024-02-05T07:50:38Z

@zhangheli 试过了，不行：#737 (reply in thread)

lobehubbot · 2024-02-05T07:50:48Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

@zhangheli tried it, but it didn’t work: #737 (reply in thread)

cocobeach · 2024-02-05T07:55:19Z

@cocobeach Thanks for your introduction！ We will add ollama support also~ 😁

That's great, it would allow Mac users to use the interface with local models easily.

Also for the best inference but that's for nvidia therefore mostly windows station vLLM is quite unique in the way it manages gpu memory, for instance for me it's the only setup that allows me to actually take advantage of my dual RTX a4000 setup but I can run Mixtral on it quite smoothly and with excellent results.
Now that gpt4 got "lazy" I think there is a good opening in the market for open source models, if they are managed properly.

I really like your platform, but I don't understand why the total amount of tokens allowed for a whole chat thread is limited to the model's context windows. Meaning if I am using gpt 3.5 turbo 16k, I get to chat up to 16k thread size, then it gives me an error?
Can I configure that differently?
Best Regards

arvinxx · 2024-02-05T09:14:30Z

@cocobeach you can set a limit of current topic context. We don't set a limit by default.

cocobeach · 2024-02-05T09:15:57Z

Tanks a lot!!

ifsheldon · 2024-02-05T10:54:57Z

Please add Moonshot support! See https://platform.moonshot.cn
The APIs are a compatible subset with OpenAI's

sjy · 2024-02-07T16:03:52Z

@cocobeach Thanks for your introduction！ We will add ollama support also~ 😁

This a PR WIP for Ollama Provider, #1265 @arvinxx

lobehubbot · 2024-02-13T16:01:45Z

✅ @arvinxx

This issue is closed, If you have any questions, you can comment and reply.
此问题已经关闭。如果您有任何问题，可以留言并回复。

lobehubbot · 2024-02-13T16:22:07Z

🎉 This issue has been resolved in version 0.127.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

@ccdanpian

[AgentSubmit] travel-assistant @ccdanpian

arvinxx added this to Lobe Chat Routine Sep 6, 2023

arvinxx converted this from a draft issue Sep 6, 2023

arvinxx self-assigned this Sep 6, 2023

canisminor1990 changed the title ~~LLM适配~~ [Request] LLM适配 Sep 6, 2023

canisminor1990 added the 🌠 Feature Request New feature or request | 特性与建议 label Sep 6, 2023

arvinxx moved this from Todo - Chat to In Progress in Lobe Chat Routine Sep 9, 2023

arvinxx changed the title ~~[Request] LLM适配~~ [Request] LLM 适配 Sep 9, 2023

arvinxx mentioned this issue Sep 10, 2023

✨ feat: support Azure OpenAI #177

Merged

5 tasks

arvinxx removed the status in Lobe Chat Routine Oct 6, 2023

arvinxx moved this to Roadmap - Chat 1.0 in Lobe Chat Routine Oct 6, 2023

arvinxx mentioned this issue Oct 11, 2023

✨ feat: support custom model name #305

Merged

6 tasks

lobehubbot added the Inactive No response in 30 days | 超过 30 天未活跃 label Nov 11, 2023

arvinxx mentioned this issue Nov 27, 2023

[Question] How can I connect the UI to text-generation-webui #512

Closed

arvinxx moved this from Roadmap - Chat 1.x to In Progress in Lobe Chat Routine Nov 27, 2023

mushan0x0 mentioned this issue Nov 29, 2023

[Request] Request to add web proxy #524

Closed

arvinxx mentioned this issue Dec 15, 2023

[Request] 添加默认prompt以欺骗模型为chatGPT #676

Closed

arvinxx added Model Provider 模型服务商 and removed Inactive No response in 30 days | 超过 30 天未活跃 labels Feb 2, 2024

arvinxx linked a pull request Feb 2, 2024 that will close this issue

✨ feat: support Google / Zhipu / AWS Bedrock model providers #1173

Merged

42 tasks

arvinxx closed this as completed in #1173 Feb 5, 2024

github-project-automation bot moved this from In Progress to Done in Lobe Chat Routine Feb 5, 2024

lobehub deleted a comment from lobehubbot Feb 5, 2024

arvinxx reopened this Feb 5, 2024

lobehub deleted a comment from lobehubbot Feb 9, 2024

This was referenced Feb 11, 2024

✨ feat(llm): support Ollama AI Provider (local llm) #1265

Merged

✨ feat(llm): support Ollama AI Provider (local LLM) #1293

Merged

arvinxx closed this as completed in #1293 Feb 13, 2024

lobehubbot added the released label Feb 13, 2024

skylinemusiccds referenced this issue in skylinemusiccds/BrainWeaver-development May 12, 2024

Merge pull request #151 from lobehub/agent/travel-assistant

9522c05

[AgentSubmit] travel-assistant @ccdanpian

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Request] LLM 适配 #151

[Request] LLM 适配 #151

arvinxx commented Sep 6, 2023 •

edited

Loading

ddwinhzy commented Oct 11, 2023

arvinxx commented Oct 11, 2023

gavinliu commented Dec 13, 2023

lobehubbot commented Dec 13, 2023

jyboy commented Dec 13, 2023

arvinxx commented Dec 14, 2023

lobehubbot commented Dec 14, 2023

hazelnutcloud commented Dec 18, 2023

arvinxx commented Dec 18, 2023

structure-charger commented Dec 27, 2023

WayneShao commented Jan 3, 2024

lobehubbot commented Jan 3, 2024

arvinxx commented Jan 3, 2024

lobehubbot commented Jan 3, 2024

Met-Du commented Jan 27, 2024

cocobeach commented Jan 30, 2024

cocobeach commented Jan 30, 2024

arvinxx commented Jan 30, 2024

j0ole commented Feb 1, 2024

arvinxx commented Feb 2, 2024

zhangheli commented Feb 5, 2024

lobehubbot commented Feb 5, 2024

arvinxx commented Feb 5, 2024

lobehubbot commented Feb 5, 2024

cocobeach commented Feb 5, 2024

arvinxx commented Feb 5, 2024

cocobeach commented Feb 5, 2024

ifsheldon commented Feb 5, 2024

sjy commented Feb 7, 2024

lobehubbot commented Feb 13, 2024

lobehubbot commented Feb 13, 2024

[Request] LLM 适配 #151

[Request] LLM 适配 #151

Comments

arvinxx commented Sep 6, 2023 • edited Loading

ddwinhzy commented Oct 11, 2023

arvinxx commented Oct 11, 2023

gavinliu commented Dec 13, 2023

lobehubbot commented Dec 13, 2023

jyboy commented Dec 13, 2023

arvinxx commented Dec 14, 2023

lobehubbot commented Dec 14, 2023

hazelnutcloud commented Dec 18, 2023

arvinxx commented Dec 18, 2023

structure-charger commented Dec 27, 2023

WayneShao commented Jan 3, 2024

lobehubbot commented Jan 3, 2024

arvinxx commented Jan 3, 2024

lobehubbot commented Jan 3, 2024

Met-Du commented Jan 27, 2024

cocobeach commented Jan 30, 2024

cocobeach commented Jan 30, 2024

arvinxx commented Jan 30, 2024

j0ole commented Feb 1, 2024

arvinxx commented Feb 2, 2024

zhangheli commented Feb 5, 2024

lobehubbot commented Feb 5, 2024

arvinxx commented Feb 5, 2024

lobehubbot commented Feb 5, 2024

cocobeach commented Feb 5, 2024

arvinxx commented Feb 5, 2024

cocobeach commented Feb 5, 2024

ifsheldon commented Feb 5, 2024

sjy commented Feb 7, 2024

lobehubbot commented Feb 13, 2024

lobehubbot commented Feb 13, 2024

arvinxx commented Sep 6, 2023 •

edited

Loading