[Feature]: Return hidden states (in progress?) #6165

Elanmarkowitz · 2024-07-06T01:26:10Z

🚀 The feature, motivation and pitch

I know this feature request sort of already exists: #5950
(and older, semi related requests) #3594 #1857

This is a similar pitch but I am creating a new issue as I noticed newer developments in the codebase. The pitch is to support returning hidden states when generating sequences. This enables many potential behaviors such as output classification, guardrails, etc. Whereas #5950 suggested a different step for embedding, I would suggest building it in as an option to EngineArgs or as an option that can be passed in with each generation request.

I see that in v0.5.1 there is already some new code in ModelDriverBase to support return_hidden_states. However, I don't see that supported yet in the LLM engine yet (not an input to EngineArgs). Basically, it seems like this feature is under development. I am mainly wondering what the timeline is for that? And what is the approach being taken so that I and the community can develop accordingly?

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

LiuXiaoxuanPKU · 2024-07-08T17:18:51Z

Thanks for the question! We currently use return_hidden_states for speculative decoding. You just need to pass it a a config as here. Feel free to mimic the behavior there.

Hambaobao · 2024-07-11T05:56:32Z

Hi, I also have the same need. I hope to store the hidden_states during model inference so that I can conduct some interpretability research.

PeterAdam2015 · 2024-07-25T11:57:20Z

same need, hope we can get this as an option to return embedding.

ummagumm-a · 2024-07-29T14:09:57Z

same need!

freesunshine0316 · 2024-07-30T23:46:58Z

Thanks for the question! We currently use return_hidden_states for speculative decoding. You just need to pass it a a config as here. Feel free to mimic the behavior there.

Hi,
can you further specify, e.g. with demo code?

J0hnArren · 2024-08-01T11:45:37Z

same need

Gxy-2001 · 2024-08-29T08:11:01Z

same need

zkwhandan · 2024-09-24T08:36:45Z

same need

jvlinsta · 2024-10-25T09:52:51Z

Same need to generate some attention heatmaps, akin to

Elanmarkowitz added the feature request label Jul 6, 2024

LiuXiaoxuanPKU self-assigned this Sep 24, 2024

DarkLight1337 mentioned this issue Jan 21, 2025

[RFC]: Hidden states processor #12249

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Return hidden states (in progress?) #6165

[Feature]: Return hidden states (in progress?) #6165

Elanmarkowitz commented Jul 6, 2024 •

edited

Loading

LiuXiaoxuanPKU commented Jul 8, 2024

Hambaobao commented Jul 11, 2024

PeterAdam2015 commented Jul 25, 2024

ummagumm-a commented Jul 29, 2024

freesunshine0316 commented Jul 30, 2024

J0hnArren commented Aug 1, 2024 •

edited

Loading

Gxy-2001 commented Aug 29, 2024

zkwhandan commented Sep 24, 2024

jvlinsta commented Oct 25, 2024

[Feature]: Return hidden states (in progress?) #6165

[Feature]: Return hidden states (in progress?) #6165

Comments

Elanmarkowitz commented Jul 6, 2024 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

LiuXiaoxuanPKU commented Jul 8, 2024

Hambaobao commented Jul 11, 2024

PeterAdam2015 commented Jul 25, 2024

ummagumm-a commented Jul 29, 2024

freesunshine0316 commented Jul 30, 2024

J0hnArren commented Aug 1, 2024 • edited Loading

Gxy-2001 commented Aug 29, 2024

zkwhandan commented Sep 24, 2024

jvlinsta commented Oct 25, 2024

Elanmarkowitz commented Jul 6, 2024 •

edited

Loading

J0hnArren commented Aug 1, 2024 •

edited

Loading