feat: openAI explicit value for maxToken and temperature #659

panpan0000 · 2023-09-15T05:46:50Z

Because when k8sgpt talks with vLLM, the default MaxToken is 16, which is so small.
Given the most model supports 2048 token(like Llama1 ..etc), so put here for a safe value.

Closes # NA

📑 Description

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed

ℹ Additional Information

When k8sgpt talks with local(on-premise) LLM model, for example, model serving by vLLM(refer to blog https://k8sgpt.ai/blog/post-6/)
Problem was met when request from k8sgpt has NO default MaxToken value.

From the log of vLLM

you will see temperature=0.7 and max_tokens=16

such a small max_tokens will cause below issue: the answers from LLM was truncated.

So this fix try to make the value explicit , just like what was done in https://github.com/k8sgpt-ai/k8sgpt/blob/main/pkg/ai/cohere.go#L66

With this fix , we will see the new log from vLLM become better :

Regression Test

when swich back to online openAI, the result is still good

AlexsJones

Please can you add those as constants at the top of file?
PR name needs to be semantic e.g. "feat: OpenAI: explicit value for MaxToken and Temp "

Nice suggestion!

panpan0000 · 2023-09-15T08:18:04Z

Please can you add those as constants at the top of file?

PR name needs to be semantic e.g. "feat: OpenAI: explicit value for MaxToken and Temp "

Nice suggestion!

done . Thanks !

arbreezy · 2023-09-15T08:19:26Z

Why do we need to specify Temperature value as a constant ?

Is this worth doing across all AI backends ?

panpan0000 · 2023-09-15T08:30:23Z

Why do we need to specify Temperature value as a constant ?

Is this worth doing across all AI backends ?

Hi, @arbreezy :
(1) I learned from previous code for cohere backend : https://github.com/k8sgpt-ai/k8sgpt/blob/main/pkg/ai/cohere.go#L66
(2) no default temperature value from go-openAI library
(3) so what the values will be depends on openAI API server. openAI make it 0.7 by default. but if we build a on-premise LLM server , it may vary.
(4) to be honest, I think for k8sgpt is more serious case that normal chat case, so the 0.7 temperature should not be too high to lead hallucination, but I don't want to change the value for now..

arbreezy · 2023-09-16T13:28:52Z

@panpan0000 makes sense.
Two additional things to consider,
a) is it worth making temperature's value configurable to users?
b) i think it makes sense have temperature explicitly defined in all our ai backends e.g azure openai

Because when k8sgpt talks with vLLM, the default MaxToken is 16, which is so small. Given the most model supports 2048 token(like Llama1 ..etc), so put here for a safe value. Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

panpan0000 · 2023-09-18T10:26:20Z

@panpan0000 makes sense. Two additional things to consider, a) is it worth making temperature's value configurable to users? b) i think it makes sense have temperature explicitly defined in all our ai backends e.g azure openai

sure. @arbreezy . already updated with second commit : (1) add it for all backend (2) add it as a cmd flag.

can you please review again? Thanks a lot

arbreezy · 2023-09-18T15:06:00Z

@panpan0000 @AlexsJones ah I think we need to also set it in the server mode right?

AlexsJones · 2023-09-18T18:20:38Z

@panpan0000 @AlexsJones ah I think we need to also set it in the server mode right?

Feels like an oversight that needs fixing yes

panpan0000 requested review from a team as code owners September 15, 2023 05:46

AlexsJones requested changes Sep 15, 2023

View reviewed changes

panpan0000 force-pushed the maxToken branch from 8e5325e to 6db4faf Compare September 15, 2023 08:17

panpan0000 changed the title ~~OpenAI: explicit value for MaxToken and Temp~~ feat: OpenAI- explicit value for MaxToken and Temp Sep 15, 2023

panpan0000 changed the title ~~feat: OpenAI- explicit value for MaxToken and Temp~~ feat: OpenAI: explicit value for MaxToken and Temp Sep 15, 2023

panpan0000 force-pushed the maxToken branch from e0dc60d to 6db4faf Compare September 15, 2023 08:19

panpan0000 force-pushed the maxToken branch from 6db4faf to a9ccb06 Compare September 15, 2023 08:19

panpan0000 changed the title ~~feat: OpenAI: explicit value for MaxToken and Temp~~ feat: OpenAI explicit value for MaxToken and Temp Sep 15, 2023

panpan0000 force-pushed the maxToken branch from a9ccb06 to ed138fb Compare September 15, 2023 08:20

panpan0000 changed the title ~~feat: OpenAI explicit value for MaxToken and Temp~~ feat: openAI explicit value for maxToken and temperature Sep 15, 2023

panpan0000 force-pushed the maxToken branch from ed138fb to e01f149 Compare September 15, 2023 08:24

panpan0000 force-pushed the maxToken branch from 9a2da11 to 3869dc3 Compare September 18, 2023 10:25

panpan0000 added 2 commits September 18, 2023 06:25

feat: openAI explicit value for maxToken and temp

30f066d

Because when k8sgpt talks with vLLM, the default MaxToken is 16, which is so small. Given the most model supports 2048 token(like Llama1 ..etc), so put here for a safe value. Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

feat: make temperature a flag

dcec8f4

Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

panpan0000 force-pushed the maxToken branch from 3869dc3 to dcec8f4 Compare September 18, 2023 10:25

AlexsJones approved these changes Sep 18, 2023

View reviewed changes

arbreezy approved these changes Sep 18, 2023

View reviewed changes

AlexsJones merged commit f55946d into k8sgpt-ai:main Sep 18, 2023
7 checks passed

github-actions bot mentioned this pull request Sep 18, 2023

chore(main): release 0.3.16 #655

Merged

arbreezy mentioned this pull request Oct 11, 2023

feat: adding temperature to server mode #705

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: openAI explicit value for maxToken and temperature #659

feat: openAI explicit value for maxToken and temperature #659

panpan0000 commented Sep 15, 2023 •

edited

Loading

AlexsJones left a comment

panpan0000 commented Sep 15, 2023

arbreezy commented Sep 15, 2023

panpan0000 commented Sep 15, 2023 •

edited

Loading

arbreezy commented Sep 16, 2023

panpan0000 commented Sep 18, 2023

arbreezy commented Sep 18, 2023

AlexsJones commented Sep 18, 2023

feat: openAI explicit value for maxToken and temperature #659

feat: openAI explicit value for maxToken and temperature #659

Conversation

panpan0000 commented Sep 15, 2023 • edited Loading

📑 Description

✅ Checks

ℹ Additional Information

Regression Test

AlexsJones left a comment

Choose a reason for hiding this comment

panpan0000 commented Sep 15, 2023

arbreezy commented Sep 15, 2023

panpan0000 commented Sep 15, 2023 • edited Loading

arbreezy commented Sep 16, 2023

panpan0000 commented Sep 18, 2023

arbreezy commented Sep 18, 2023

AlexsJones commented Sep 18, 2023

panpan0000 commented Sep 15, 2023 •

edited

Loading

panpan0000 commented Sep 15, 2023 •

edited

Loading