New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Questions abouts u-space #1

Open

harveymannering opened this issue Nov 5, 2024 · 0 comments

harveymannering commented Nov 5, 2024

Hi,

Thank you for your work, this is a very interesting paper! I have a couple of question:

Does u-space contain Image Tokens + Text Tokens + Timestep? Or is u-space refereing to the input image itself (i.e. $x_t$)? Or is the u-space something completely different?
Are any pretrained models available to be downloaded? In the experiments you mentioned that you used a pretrained latent
flow-matching model on MS COCO. Is this model publicly available?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment