Add support to export ColPali Model to ONNX #2074

akshayballal95 · 2024-10-21T22:01:42Z

What does this PR do?

This PR adds support for exporting the ColPali merged model to ONNX format. The model is based on the "pali gemma" model type, and thus, I have added it under the "feature-extraction" task. Do suggest if there is a better way to integrate this. If this looks fine with a few modifications, I can add support for the Paligemma text-generation task as well.

Before submitting

I have tested the model by exporting the https://huggingface.co/vidore/colpali-v1.2-merged model and tested it by providing both document inputs and text query inputs.

Who can review?

@fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun

akshayballal95 · 2024-10-28T16:55:19Z

@fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun

Are you open to merging this?

echarlaix · 2024-12-06T17:24:29Z

Apologies for the delay @akshayballal95, could you add a test with a tiny random model like https://huggingface.co/hf-internal-testing/tiny-random-PaliGemmaForConditionalGeneration, can be added here https://github.com/huggingface/optimum/blob/main/tests/exporters/exporters_utils.py#L37

echarlaix · 2024-12-09T16:04:48Z

optimum/exporters/onnx/model_patcher.py

+class ColPaliModelPatcher(ModelPatcher):
+    def __init__(
+        self,
+        config: "OnnxConfig",
+        model: Union["PreTrainedModel", "TFPreTrainedModel"],
+        model_kwargs: Optional[Dict[str, Any]] = None,
+    ):
+        super().__init__(config, model, model_kwargs)
+
+        def patched_forward(input_ids=None, pixel_values=None, attention_mask=None, **kwargs):
+            outputs = self.orig_forward(
+                input_ids=input_ids, pixel_values=pixel_values, attention_mask=attention_mask, **kwargs
+            )
+            return outputs
+
+        self.patched_forward = patched_forward


why is it needed ?

echarlaix · 2024-12-09T16:04:58Z

optimum/exporters/onnx/model_patcher.py

+
+class ColPaliModelPatcher(ModelPatcher):
+    def __init__(
+        self,
+        config: "OnnxConfig",
+        model: Union["PreTrainedModel", "TFPreTrainedModel"],
+        model_kwargs: Optional[Dict[str, Any]] = None,
+    ):
+        super().__init__(config, model, model_kwargs)
+        def patched_forward(input_ids=None, pixel_values=None, attention_mask=None, **kwargs):
+            outputs = self.orig_forward(
+                input_ids=input_ids, pixel_values=pixel_values, attention_mask=attention_mask, **kwargs
+            )
+            return outputs
+        self.patched_forward = patched_forward


already added above

Suggested change

class ColPaliModelPatcher(ModelPatcher):

def __init__(

self,

config: "OnnxConfig",

model: Union["PreTrainedModel", "TFPreTrainedModel"],

model_kwargs: Optional[Dict[str, Any]] = None,

):

super().__init__(config, model, model_kwargs)

def patched_forward(input_ids=None, pixel_values=None, attention_mask=None, **kwargs):

outputs = self.orig_forward(

input_ids=input_ids, pixel_values=pixel_values, attention_mask=attention_mask, **kwargs

)

return outputs

self.patched_forward = patched_forward

echarlaix · 2024-12-09T16:11:26Z

optimum/exporters/onnx/model_configs.py

+                )
+                dummy_inputs["input_ids"] = generator.concat_inputs([prefix_tensor, dummy_inputs["input_ids"]], dim=1)
+                dummy_inputs["attention_mask"] = generator.random_mask_tensor(
+                    shape=[generator.batch_size, generator.sequence_length + 1024],


where does the value 1024 comes from? shouldn't it depend from the models config ?

akshayballal95 added 2 commits October 21, 2024 23:40

add colpali export support

4323e34

Merge branch 'main' of https://github.com/huggingface/optimum

10214de

akshayballal95 added 2 commits December 8, 2024 23:17

Merge remote-tracking branch 'optimum/main' into qwen

b1833eb

colpali exporter

a1f7c06

echarlaix reviewed Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support to export ColPali Model to ONNX #2074

Add support to export ColPali Model to ONNX #2074

akshayballal95 commented Oct 21, 2024

akshayballal95 commented Oct 28, 2024

echarlaix commented Dec 6, 2024 •

edited

Loading

echarlaix Dec 9, 2024

echarlaix Dec 9, 2024

echarlaix Dec 9, 2024

Add support to export ColPali Model to ONNX #2074

Are you sure you want to change the base?

Add support to export ColPali Model to ONNX #2074

Conversation

akshayballal95 commented Oct 21, 2024

What does this PR do?

Before submitting

Who can review?

akshayballal95 commented Oct 28, 2024

echarlaix commented Dec 6, 2024 • edited Loading

echarlaix Dec 9, 2024

Choose a reason for hiding this comment

echarlaix Dec 9, 2024

Choose a reason for hiding this comment

echarlaix Dec 9, 2024

Choose a reason for hiding this comment

echarlaix commented Dec 6, 2024 •

edited

Loading