Local embedding model cannot be used #12464

opsvr202 · 2025-01-08T01:44:36Z

Self Checks

This is only for bug report, if you would like to ask a question, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:）
Please do not modify this template :) and fill in all the required fields.

Dify version

0.15.0

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

Purely intranet environment, the steps are as follows:

1、Enter the knowledge base;
2、Create a knowledge base;
3、Choose between the general mode and the parent-child mode;
4、Upload documents;
5、An error is reported during vector processing;

The error content is as follows：

HTTPSConnectionPool(host='openaipublic.blob.core.windows.net', port=443): Max retroes exceeded with url: /gpt-2/encodings/main/vocab.bpe (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7fa5469b7500>: Failed to resolve 'openaipublic.blob.core.windows.net' ([Error -2] Name or service not know)"))

✔️ Expected Behavior

When the intranet environment cannot access the Internet, a locally deployed model can be used to embed the knowledge base

❌ Actual Behavior

When the intranet environment implements knowledge base embedding, the service will access the extranet, resulting in the inability to use privatized deployment

The text was updated successfully, but these errors were encountered:

dosubot · 2025-01-08T01:47:57Z

I found a similar unsolved discussion regarding a NameResolutionError when trying to resolve 'openaipublic.blob.core.windows.net' in a self-hosted Dify environment ^[1]. This issue seems related to network configuration and accessing external resources in an intranet-only setup.

To address the problem of using local embedding models in a self-hosted Dify environment without internet access, you can configure a local embedding model using either Xinference or LocalAI. Here are the steps for LocalAI:

Clone the LocalAI repository:

$ git clone https://github.com/go-skynet/LocalAI
$ cd LocalAI/examples/langchain-chroma

Download example LLM and Embedding models:

$ wget https://huggingface.co/skeskinen/ggml/resolve/main/all-MiniLM-L6-v2/ggml-model-q4_0.bin -O models/bert
$ wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j

Configure the .env file:
```
$ mv .env.example .env
```

Start LocalAI:

$ docker-compose up -d --build
$ docker logs -f langchain-chroma-api-1

Integrate the models into Dify:
- Go to Settings > Model Providers > LocalAI and fill in the details for each model:
  - Model 1: ggml-gpt4all-j
    - Model Type: Text Generation
    - Model Name: gpt-3.5-turbo
    - Server URL: http://127.0.0.1:8080
  - Model 2: all-MiniLM-L6-v2
    - Model Type: Embeddings
    - Model Name: text-embedding-ada-002
    - Server URL: http://127.0.0.1:8080

Ensure that the Dify container can access the LocalAI endpoint by using the appropriate host IP address. This setup should allow you to use local models for embedding without needing internet access ^[2].

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other}

opsvr202 · 2025-01-08T01:52:40Z

The model provider docks with xinference, which includes embedded models. Similar problems occurred when dify first supported the call of locally deployed model functions to go online. They were subsequently resolved. I wonder if this time's occurrence is related to the new online knowledge base support for parent-child mode

crazywoola · 2025-01-08T01:54:54Z

Duplicated, I saw this error message lots of times.

HTTPSConnectionPool(host='openaipublic.blob.core.windows.net', port=443): Max retroes exceeded with url: /gpt-2/encodings/main/vocab.bpe (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7fa5469b7500>: Failed to resolve 'openaipublic.blob.core.windows.net' ([Error -2] Name or service not know)"))

This is a general network issue, you seems to call OpenAI service which you might not intend to, please check the system provider settings, QA and P-C will use system models for reasoning.

opsvr202 · 2025-01-08T02:09:30Z

The model provider selected the correct one, but v0.14.2 did not have this problem and was used normally. This problem occurred when upgrading to v0.15.0.

crazywoola · 2025-01-08T03:07:32Z

Seems related (#12416) @laipz8200

crazywoola · 2025-01-08T03:27:26Z

#12471

dosubot bot added 🐞 bug Something isn't working 👻 feat:rag Embedding related issue, like qdrant, weaviate, milvus, vector database. labels Jan 8, 2025

crazywoola closed this as completed Jan 8, 2025

crazywoola reopened this Jan 8, 2025

crazywoola assigned JohnJyong Jan 8, 2025

crazywoola removed the 👻 feat:rag Embedding related issue, like qdrant, weaviate, milvus, vector database. label Jan 8, 2025

crazywoola assigned laipz8200 and unassigned JohnJyong Jan 8, 2025

crazywoola closed this as completed Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local embedding model cannot be used #12464

Local embedding model cannot be used #12464

opsvr202 commented Jan 8, 2025 •

edited by crazywoola

Loading

dosubot bot commented Jan 8, 2025

opsvr202 commented Jan 8, 2025

crazywoola commented Jan 8, 2025

opsvr202 commented Jan 8, 2025

crazywoola commented Jan 8, 2025

crazywoola commented Jan 8, 2025

Local embedding model cannot be used #12464

Local embedding model cannot be used #12464

Comments

opsvr202 commented Jan 8, 2025 • edited by crazywoola Loading

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

dosubot bot commented Jan 8, 2025

opsvr202 commented Jan 8, 2025

crazywoola commented Jan 8, 2025

opsvr202 commented Jan 8, 2025

crazywoola commented Jan 8, 2025

crazywoola commented Jan 8, 2025

opsvr202 commented Jan 8, 2025 •

edited by crazywoola

Loading