Reset context instead of quitting in interactive mode #145

avada-z · 2023-03-14T21:26:49Z

It's really annoying that I have to restart the program every time it quits by [end of text] or exceeding context limits, as I need to reload model, which is inefficient.
Is there any way to add an option that instead of quitting just resets to the initial prompt?

zzhkikyou · 2023-03-15T11:43:39Z

@ggerganov Please，Thanks！

aratic · 2023-03-15T16:11:44Z

it's currently on remaining_tokens , the -n parameter, main verb is on llama_tokenize
not sure where the context reset is.

aratic · 2023-03-15T16:19:00Z

think this relates to #23 but not exactly the same

j3k0 · 2023-03-16T07:50:09Z

I'm on an attempt to fix the [end of text] issue:

diff --git a/main.cpp b/main.cpp
index ca0fca8..126e53f 100644
--- a/main.cpp
+++ b/main.cpp
@@ -991,6 +991,14 @@ int main(int argc, char ** argv) {
             fflush(stdout);
         }

+        // check for [end of text]
+        if (params.interactive && embd.back() == 2) {
+            fprintf(stderr, " [end of text]\n");
+            // insert the antiprompt to continue the conversation.
+            // however, after this it seems like everything was lost.
+            embd_inp.insert(embd_inp.end(), antiprompt_inp.begin(), antiprompt_inp.end());
+        }
+
         // in interactive mode, and not currently processing queued inputs;
         // check if we should prompt the user for more
         if (params.interactive && embd_inp.size() <= input_consumed) {
@@ -1037,7 +1045,7 @@ int main(int argc, char ** argv) {
         }

         // end of text token
-        if (embd.back() == 2) {
+        if (!params.interactive && embd.back() == 2) {
             fprintf(stderr, " [end of text]\n");
             break;
         }

However, it looks like the context is broken when resuming the conversation after the [end of text]... It's like starting a fresh interaction. So I guess being able to restore the context will be necessary to complete this.

jart · 2023-03-16T12:04:28Z

Thank you for using llama.cpp and thank you for sharing your feature request. While you've provided valuable feedback on UX improvements, it overlaps a lot with what's being discussed in #23, and right now my top priority is to solve this issue by fixing the underlying technical issue described in #91. Please use those other issues for further discussion, which will reach a broader audience of folks following it.

aratic mentioned this issue Mar 15, 2023

feature request, restful api / exposure #162

Closed

gjmulder added the enhancement New feature or request label Mar 15, 2023

jart closed this as completed Mar 16, 2023

jart added the duplicate This issue or pull request already exists label Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reset context instead of quitting in interactive mode #145

Reset context instead of quitting in interactive mode #145

avada-z commented Mar 14, 2023

zzhkikyou commented Mar 15, 2023

aratic commented Mar 15, 2023 •

edited

Loading

aratic commented Mar 15, 2023

j3k0 commented Mar 16, 2023

jart commented Mar 16, 2023

Reset context instead of quitting in interactive mode #145

Reset context instead of quitting in interactive mode #145

Comments

avada-z commented Mar 14, 2023

zzhkikyou commented Mar 15, 2023

aratic commented Mar 15, 2023 • edited Loading

aratic commented Mar 15, 2023

j3k0 commented Mar 16, 2023

jart commented Mar 16, 2023

aratic commented Mar 15, 2023 •

edited

Loading