Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating the same embeddings for all tokens #1

Open
makwana050 opened this issue Aug 10, 2023 · 0 comments
Open

Generating the same embeddings for all tokens #1

makwana050 opened this issue Aug 10, 2023 · 0 comments

Comments

@makwana050
Copy link

I am trying to use the SAILER_en model from the Hugging Face model hub to generate embeddings for my legal documents. However, I am having trouble generating proper embeddings using this model. The embeddings for each token in my document are the same, regardless of the content of the document. I am simply loading the model and trying to fetch the embedding from last hidden state while passing the input tokens.

I noticed that the sample code provided by Hugging Face for using the SAILER_en model with the transformers library references a BertForCotMAE class that does not exist in the library. I am not sure if this is related to the issue I am experiencing.

Can you provide any guidance on how to use the SAILER_en model correctly to generate embeddings for legal documents? Any help would be greatly appreciated.

Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant