You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to index a long document with muti-vector embeddings.
Say I have span annotation for how I want my document to be chunked and want to apply late chunking to it, that is i want to have all the embeddings of my tokens before I perform the chunking.
This way I have contextualized chunks with multi-vector representation.
Essentially what I want to do is Late Chunking without the chunk-wise vector pooling at the end
How do I do it with ragatouille?
I know how to get multi-vector embeddings for my document but can't figure out how to index and leverage the optimizations provided by Ragatouille.
The text was updated successfully, but these errors were encountered:
BlueKiji77
changed the title
Can colbert v2 take chunks with their embeddings as input for indexing?
[HELP] Can colbert v2 take chunks with their embeddings as input for indexing?
Nov 23, 2024
BlueKiji77
changed the title
[HELP] Can colbert v2 take chunks with their embeddings as input for indexing?
[HELP] Can Ragatouille take chunks with their embeddings as input for indexing?
Nov 23, 2024
Essentially what I want to do is Late Chunking without the chunk-wise vector pooling at the end
How do I do it with ragatouille?
I know how to get multi-vector embeddings for my document but can't figure out how to index and leverage the optimizations provided by Ragatouille.
The text was updated successfully, but these errors were encountered: