-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Return hidden states (in progress?) #6165
Comments
Thanks for the question! We currently use |
Hi, I also have the same need. I hope to store the |
same need, hope we can get this as an option to return embedding. |
same need! |
Hi, |
same need |
2 similar comments
same need |
same need |
🚀 The feature, motivation and pitch
I know this feature request sort of already exists: #5950
(and older, semi related requests) #3594 #1857
This is a similar pitch but I am creating a new issue as I noticed newer developments in the codebase. The pitch is to support returning hidden states when generating sequences. This enables many potential behaviors such as output classification, guardrails, etc. Whereas #5950 suggested a different step for embedding, I would suggest building it in as an option to EngineArgs or as an option that can be passed in with each generation request.
I see that in
v0.5.1
there is already some new code inModelDriverBase
to supportreturn_hidden_states
. However, I don't see that supported yet in the LLM engine yet (not an input toEngineArgs
). Basically, it seems like this feature is under development. I am mainly wondering what the timeline is for that? And what is the approach being taken so that I and the community can develop accordingly?Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: