how to get unbuffered responses? #4

alph4b3th · 2023-04-12T01:47:18Z

I noticed that .predict returns a complete string, which is the model's response. However, I need to give the user a feeling of iteration in which the model must send what it is "typing" in time to the user's client. But the .predict function only returns post-finished responses. How can I get an answer as predicted by the model? A feeling of "typing"

mudler · 2023-04-12T07:01:18Z

This is not implemented yet - however it's something I'm interested in supporting.

mudler · 2023-04-12T13:52:18Z

one way would be to restrict the binding to send back data directly to a golang channel, for instance: https://github.com/matiasinsaurralde/cgo-channels/tree/master however, I see still that could incur in a huge penalty, as context switch from golang and C in a loop have a high computational cost.

I think we could offer a low-level functionality to address the specific case and scope it to have just a few functions exposed, but wouldn't suggest usage when performance is a requirement.

alph4b3th · 2023-04-13T01:19:31Z

one way would be to restrict the binding to send back data directly to a golang channel, for instance: https://github.com/matiasinsaurralde/cgo-channels/tree/master however, I see still that could incur in a huge penalty, as context switch from golang and C in a loop have a high computational cost.

I think we could offer a low-level functionality to address the specific case and scope it to have just a few functions exposed, but wouldn't suggest usage when performance is a requirement.

can you create this functionality?

noxer · 2023-04-27T19:40:21Z

To answer the original question:

llama.SetTokenCallback(func(token string) bool {
    fmt.Print(token)
    return true // we want the predictor to continue
})

mudler · 2023-04-28T06:34:00Z

I think we can close this now, thanks @noxer ❤️!

mudler changed the title ~~how to get buffered responses?~~ how to get unbuffered responses? Apr 12, 2023

mudler mentioned this issue Apr 12, 2023

Local models Niek/chatgpt-web#105

Open

noxer mentioned this issue Apr 27, 2023

Implemented callback for individual tokens #28

Merged

mudler closed this as completed Apr 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to get unbuffered responses? #4

how to get unbuffered responses? #4

alph4b3th commented Apr 12, 2023

mudler commented Apr 12, 2023

mudler commented Apr 12, 2023 •

edited

Loading

alph4b3th commented Apr 13, 2023

noxer commented Apr 27, 2023

mudler commented Apr 28, 2023

how to get unbuffered responses? #4

how to get unbuffered responses? #4

Comments

alph4b3th commented Apr 12, 2023

mudler commented Apr 12, 2023

mudler commented Apr 12, 2023 • edited Loading

alph4b3th commented Apr 13, 2023

noxer commented Apr 27, 2023

mudler commented Apr 28, 2023

mudler commented Apr 12, 2023 •

edited

Loading