Skip to content

Commit

Permalink
Revert "Request less max_new_tokens on tools and reduce truncate
Browse files Browse the repository at this point in the history
…on CMD R+ (#1292)"

This reverts commit c1fc4f9.
  • Loading branch information
nsarrazin committed Jun 18, 2024
1 parent c1fc4f9 commit a547bbf
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion chart/env/prod.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ envVars:
"tools": true,
"parameters": {
"stop": ["<|END_OF_TURN_TOKEN|>"],
"truncate" : 12288,
"truncate" : 28672,
"max_new_tokens" : 2048,
"temperature" : 0.3
},
Expand Down
2 changes: 1 addition & 1 deletion src/lib/server/textGeneration/tools.ts
Original file line number Diff line number Diff line change
Expand Up @@ -131,7 +131,7 @@ export async function* runTools(
for await (const output of await endpoint({
messages: messagesWithFilesPrompt,
preprompt,
generateSettings: { ...assistant?.generateSettings, max_new_tokens: 512 },
generateSettings: assistant?.generateSettings,
tools,
})) {
// model natively supports tool calls
Expand Down

0 comments on commit a547bbf

Please sign in to comment.