You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It looks like I can input multiple image queries with only one query_embedding. But, after I get predicted logits and bboxes, I can't figure out corresponding labels and bboxes.
The text was updated successfully, but these errors were encountered:
I want to detect multiple image-conditioned queries on a single image at one time.
I use the code from
OWL_ViT_minimal_example.ipynb
for Image-conditioned detection.It looks like I can input multiple image queries with only one query_embedding. But, after I get predicted logits and bboxes, I can't figure out corresponding labels and bboxes.
The text was updated successfully, but these errors were encountered: