Add custom models #725

FerLuisxd · 2023-05-26T07:50:19Z

Feature request

Since LLM models are made basically everyday it would be good to simply search for models directly from hugging face or allow us to manually download and setup new models

Motivation

It would allow for more experimentations and comparison between models

Your contribution

Testing

manyoso · 2023-05-26T13:08:05Z

You can easily download and manually install models right now. All you need to do is place the model in the models download directory and make sure the model name begins with 'ggml-*' and ends with '.bin' and of course you have to be compatible with our version of llama.cpp

twilsonco · 2023-07-25T14:16:47Z

I know this is closed, but it sounds like the suggestion had as much to do with the easy finding and acquisition of models rather than the technical task of running them in GPT4All. I, too think that would be a great feature.

cosmic-snow · 2023-07-25T14:34:37Z

I know this is closed, but it sounds like the suggestion had as much to do with the easy finding and acquisition of models rather than the technical task of running them in GPT4All. I, too think that would be a great feature.

Searching for/finding compatible models isn't so simple that it could be automated. But maybe have a look at this newer issue: #1241

tailkinker · 2023-10-25T04:30:34Z

This no longer works.

cebtenzzre · 2023-10-25T04:38:29Z

This no longer works.

As long as your are downloading .gguf files from HF, it should work fine. That's the file format used by GPT4All v2.5.0+.

tailkinker · 2023-10-25T04:40:25Z

Figured that out eventually. But now it's spelled out. Latest update invalidated every model I had...

IzzyHibbert · 2023-10-27T16:22:14Z

This no longer works.

As long as your are downloading .gguf files from HF, it should work fine. That's the file format used by GPT4All v2.5.0+.

No it doesn't :-(
You can try checking for instance this one :

galatolo/cerbero-7b-gguf

Placing the gguf in the model folder and running it on a 2.5.1 in both Win / Mac M1 it fails even on the smallest Q4 version.

manyoso · 2023-10-27T16:42:24Z

This no longer works.

As long as your are downloading .gguf files from HF, it should work fine. That's the file format used by GPT4All v2.5.0+.

No it doesn't :-( You can try checking for instance this one :

galatolo/cerbero-7b-gguf

Placing the gguf in the model folder and running it on a 2.5.1 in both Win / Mac M1 it fails even on the smallest Q4 version.

Which one? You posted a link that has 3 different files in 3 different quantizations... can you specify which one you are talking about?

cebtenzzre · 2023-10-27T17:08:48Z

No it doesn't :-( You can try checking for instance this one :

galatolo/cerbero-7b-gguf

Placing the gguf in the model folder and running it on a 2.5.1 in both Win / Mac M1 it fails even on the smallest Q4 version.

This is GGUF version 3 which was recently released to support big endian systems (ggerganov/llama.cpp#3552). We do not support that version yet. Please open a new issue.

manyoso closed this as completed May 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add custom models #725

Add custom models #725

FerLuisxd commented May 26, 2023

manyoso commented May 26, 2023

twilsonco commented Jul 25, 2023

cosmic-snow commented Jul 25, 2023

tailkinker commented Oct 25, 2023

cebtenzzre commented Oct 25, 2023 •

edited

Loading

tailkinker commented Oct 25, 2023

IzzyHibbert commented Oct 27, 2023

manyoso commented Oct 27, 2023

cebtenzzre commented Oct 27, 2023

Add custom models #725

Add custom models #725

Comments

FerLuisxd commented May 26, 2023

Feature request

Motivation

Your contribution

manyoso commented May 26, 2023

twilsonco commented Jul 25, 2023

cosmic-snow commented Jul 25, 2023

tailkinker commented Oct 25, 2023

cebtenzzre commented Oct 25, 2023 • edited Loading

tailkinker commented Oct 25, 2023

IzzyHibbert commented Oct 27, 2023

manyoso commented Oct 27, 2023

cebtenzzre commented Oct 27, 2023

cebtenzzre commented Oct 25, 2023 •

edited

Loading