Strategy to embed images #6

dcramer · 2024-09-11T18:05:50Z

There's a lot of visuals in rulebooks that'd be great to reference. Ignoring whether we can textual represent them, the marker parser we're using will actually transform text and extract images at the same time. I believe we could design a system that lets us easily reference those in outputs (and embed them).

The text was updated successfully, but these errors were encountered:

tian-yi · 2024-09-11T19:35:13Z

Yes, that's exactly copali for. Copali is used to embed the images. and direct semantically search the images using text. Also, the cool part is that you can then feed the retrieved image and text into a multi-modal LLMs like gpt-4o, gemini etc. they will answer you questions using both.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strategy to embed images #6

Strategy to embed images #6

dcramer commented Sep 11, 2024

tian-yi commented Sep 11, 2024

Strategy to embed images #6

Strategy to embed images #6

Comments

dcramer commented Sep 11, 2024

tian-yi commented Sep 11, 2024