New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Support for connection to llamacpp server #14

Merged

gsuuon merged 3 commits into gsuuon:main from JoseConseco:main

Sep 26, 2023

Contributor

JoseConseco commented Sep 24, 2023

This PR will allow to connect to llamacpp server, and fixes #13 . This way llama does not have to be started for each user prompt. This includes completions patch from issue mentioned above, provided by gsuuon.
I'm new to this so fell free to fix any issues in this PR.


          Added llamacpp_server

e762458

This will allow to connect to llamacpp server. This way  llama does not have to be started for each user prompt. This inludes completions patch.

gsuuon reviewed

View reviewed changes

Owner

gsuuon left a comment

Almost there, just need to clean up some of the stuff pulled from the huggingface example. We should replace the llamacpp provider with this (so accept the changes and rename llamacpp_server.lua to llamacpp.lua)

Do you want to add some basic setup steps to the README for this? No worries if not!

lua/llm/providers/llamacpp_server.lua Outdated

Comment on lines 22 to 26

+                  headers = {
+                    -- Authorization = 'Bearer ' .. util.env_memo('HUGGINGFACE_API_KEY'),
+                    ["Content-Type"] = "application/json",
+                    -- ['data'] = '{"prompt": "Building a website can be done in 10 simple steps:","n_predict": 128}',
+                  },

Owner

gsuuon Sep 25, 2023

Headers aren't necessary here

Suggested change

      
                headers = {
          
                  -- Authorization = 'Bearer ' .. util.env_memo('HUGGINGFACE_API_KEY'),
          
                  ["Content-Type"] = "application/json",
          
                  -- ['data'] = '{"prompt": "Building a website can be done in 10 simple steps:","n_predict": 128}',
          
                },

Contributor Author

JoseConseco Sep 26, 2023

ok done

lua/llm/providers/llamacpp_server.lua Outdated

+              ---@param params? any Additional params for request
+              ---@param options? { model?: string }
+              function M.request_completion(handlers, params, options)
+                local model = (options or {}).model or "bigscience/bloom"

Owner

gsuuon Sep 25, 2023

We can get rid of all the huggingface stuff here

Suggested change

local model = (options or {}).model or "bigscience/bloom"

Contributor Author

JoseConseco Sep 26, 2023

done

lua/llm/providers/llamacpp_server.lua Outdated

+                -- TODO handle non-streaming calls
+                return curl.stream({
+                  -- url = 'https://api-inference.huggingface.co/models/', --.. model,

Owner

gsuuon Sep 25, 2023

Suggested change

-- url = 'https://api-inference.huggingface.co/models/', --.. model,

Contributor Author

JoseConseco Sep 26, 2023

done

lua/llm/providers/llamacpp_server.lua Outdated

+                    end
+                    if data.generation_settings ~= nil then -- last message
+                      handlers.on_finish('', "stop")

Owner

gsuuon Sep 25, 2023

Can just be a naked call now that it's handled in provider.lua

Suggested change

      
                    handlers.on_finish('', "stop")
          
                    handlers.on_finish()

Contributor Author

JoseConseco Sep 26, 2023

done

lua/llm/providers/llamacpp_server.lua Outdated

Comment on lines 98 to 103

+                options = {
+                  -- model = 'bigscience/bloom'
+                },
+                params = {
+                  return_full_text = false,
+                },

Owner

gsuuon Sep 25, 2023

These are huggingface options / params, not needed here

Suggested change

      
              options = {
          
                -- model = 'bigscience/bloom'
          
              },
          
              params = {
          
                return_full_text = false,
          
              },

Contributor Author

JoseConseco Sep 26, 2023

done

JoseConseco added 2 commits

September 26, 2023 16:40


          implelent requested changes, better default query

d581817

by lllama docs we should use:
<instr><sys>U are assistane answer all</sys> What is the age of universe?</instr>

But
<sys>U are assistant</sys><instr>What is the age of uni</inst> somehow gives betterr results.. Leave it to user..


          clean on_finish - no need for arts

a6a2fd0

Contributor Author

JoseConseco commented Sep 26, 2023

Ok, I also pushed some changes to default query. By default user can select text, and ask question about it. Best default query format is not clear. In the end from my test I got best results by using:
< instr>
Fix my code? -- user question or command
'''
Code block
'''
</ instr>
Adding block usually just gave me worse results... It may depend on selected model, not sure. I do not have time to test it more. IMO it is best leave it to user to decide - for best prompt format.
That is why I made bunch of helper methods in last commit-with presets of prompts.

Contributor Author

JoseConseco commented Sep 26, 2023 •

edited

Loading

Demo of how this works currently (using my custom modify prompt):

assome.mp4

gsuuon merged commit da785e3 into gsuuon:main

Owner

gsuuon commented Sep 26, 2023

Ok, lets get this merged. I'll replace the llamacpp provider and clean up. Thanks for contributing!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet