Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation #28674

CuriousPanCake · 2025-01-24T17:22:00Z

VLMs have tokens and their generation already served as the "inputs_embeds" input, so there's no need to transform the input in the way this is done for "input_ids".

Tickets:

CVS-160598

Signed-off-by: Andrii Staikov andrii.staikov@intel.com

itikhono · 2025-01-27T08:21:57Z

src/core/src/pass/sdpa_to_paged_attention.cpp

+        // for "input_ids"
+        processed_input_ids = input_ids_node;
+    } else {
+        OPENVINO_ASSERT(processed_input_ids, "Counln't process neither input_ids, nor inputs_embeds.");


Suggested change

OPENVINO_ASSERT(processed_input_ids, "Counln't process neither input_ids, nor inputs_embeds.");

OPENVINO_ASSERT(processed_input_ids, "Couldn't process neither input_ids, nor inputs_embeds.");

itikhono · 2025-01-27T08:23:59Z

src/core/src/pass/sdpa_to_paged_attention.cpp

+        for (const auto& target : input_ids_target_inputs) {
+            target.replace_source_output(processed_input_ids);
+        }
+    } else if (input_ids_node->get_friendly_name() == "inputs_embeds") {


could you add a new unit test for this scenario?

Yes, of course, I'm working on it rn

Created a test for the "inputs_embeds" input based on the nanoLLaVA pattern

preliminary fix

437dc90

CuriousPanCake requested a review from a team as a code owner January 24, 2025 17:22

CuriousPanCake requested review from itikhono and removed request for a team January 24, 2025 17:22

github-actions bot added the category: Core OpenVINO Core (aka ngraph) label Jan 24, 2025

ilya-lavrenov assigned itikhono Jan 24, 2025

ilya-lavrenov requested a review from popovaan January 24, 2025 17:26

ilya-lavrenov added this to the 2025.1 milestone Jan 24, 2025

CuriousPanCake changed the title ~~Fix support of "inputs_embeds" input of VLM in SDPAToPA transformation~~ Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation Jan 24, 2025

fix tests

f2df605

github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Jan 24, 2025

clear code

033e939

itikhono reviewed Jan 27, 2025

View reviewed changes

add test

18dba73

CuriousPanCake requested a review from itikhono January 28, 2025 09:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation #28674

Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation #28674

CuriousPanCake commented Jan 24, 2025 •

edited

Loading

itikhono Jan 27, 2025

CuriousPanCake Jan 27, 2025

itikhono Jan 27, 2025

CuriousPanCake Jan 27, 2025

CuriousPanCake Jan 28, 2025

	OPENVINO_ASSERT(processed_input_ids, "Counln't process neither input_ids, nor inputs_embeds.");
	OPENVINO_ASSERT(processed_input_ids, "Couldn't process neither input_ids, nor inputs_embeds.");

Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation #28674

Are you sure you want to change the base?

Fix support of "inputs_embeds" input of VLMs in SDPAToPA transformation #28674

Conversation

CuriousPanCake commented Jan 24, 2025 • edited Loading

itikhono Jan 27, 2025

Choose a reason for hiding this comment

CuriousPanCake Jan 27, 2025

Choose a reason for hiding this comment

itikhono Jan 27, 2025

Choose a reason for hiding this comment

CuriousPanCake Jan 27, 2025

Choose a reason for hiding this comment

CuriousPanCake Jan 28, 2025

Choose a reason for hiding this comment

CuriousPanCake commented Jan 24, 2025 •

edited

Loading