Support for the local models? #34

yukiarimo · 2024-04-05T15:46:59Z

I want to use this locally with Ollama and SD (or anything else). Is it possible?

sqrt10pi · 2024-04-07T17:28:04Z

Something like this could work for Ollama. I don't have SD running and I'm less familiar with the API so I commented out that and the epub stuff, but here's a start:

https://gist.github.com/sqrt10pi/dfca547ee263328cafb55947458a0838

The main annoying part is that ollama doesn't support max_tokens so I instead timeout the request after 5 minutes and try again (this is my try_until_successful function). I think depending on the model you're using in ollama you can use the num_predict model option but I only have dolphin-mixtral installed which doesn't have support.

Here's an example output using dolphin-mixtral: https://gist.github.com/sqrt10pi/20de2db367a78b63cd7b3719ad19937a (definitely not perfect)

yukiarimo · 2024-04-08T00:14:55Z

ollama doesn't support max_tokens

Then, I think KoboldCPP API is a very good alternative to Ollama

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for the local models? #34

Support for the local models? #34

yukiarimo commented Apr 5, 2024

sqrt10pi commented Apr 7, 2024 •

edited

Loading

yukiarimo commented Apr 8, 2024

Support for the local models? #34

Support for the local models? #34

Comments

yukiarimo commented Apr 5, 2024

sqrt10pi commented Apr 7, 2024 • edited Loading

yukiarimo commented Apr 8, 2024

sqrt10pi commented Apr 7, 2024 •

edited

Loading