loading directly from hugginface #3

pengzhangzhi · 2024-11-25T19:07:42Z

Hi,
In the current codebase, we have to download the ckpt to the local and load it using the following method:

# download "{model}.safetensors" to the local 
# and load it like below
model = ESM2.from_pretrained("{model}.safetensors", device=0)

I wonder if we can directly load the ckpt from Hugginface?
such as

model = ESM2.from_pretrained("facebook/esm2_t30_150M_UR50D", device=0)

That way, it's more straightforward to replace existing codebase with a flash-attention version of esm2.

It seems doable to me bc eesm shares the same model architecture with ESM2 except for the use of flash attention?

Would love to hear ur thoughts!

The text was updated successfully, but these errors were encountered:

pengzhangzhi · 2024-11-25T21:55:21Z

I'm working on it. I guess the work is converting of the names defined in esm-efficient to be what's defined in esm2, which is the standrad hugginface names? Let me know if you are interested! We can talk more about that!!

MuhammedHasan · 2024-11-27T05:27:16Z

Just so you know, pull requests are welcome. Please make sure any change passes the test cases. I renamed and created safetensors from the checkpoints because pickles are not reliable in my experience.

Given we have the weights in the huggingface https://huggingface.co/mhcelik/esm-efficient/tree/main, it need to fetched to implement something like:

model = ESM2.from_pretrained("esm-efficient/esm2_8M", device=0)

I plan to support this at some point, but pull requests are appreciated and make it sooner.

MuhammedHasan · 2024-12-20T07:13:56Z

Fixed by #9

MuhammedHasan self-assigned this Nov 25, 2024

MuhammedHasan added the question Further information is requested label Nov 25, 2024

MuhammedHasan added the enhancement New feature or request label Dec 20, 2024

MuhammedHasan closed this as completed Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

loading directly from hugginface #3

loading directly from hugginface #3

pengzhangzhi commented Nov 25, 2024 •

edited

Loading

pengzhangzhi commented Nov 25, 2024

MuhammedHasan commented Nov 27, 2024

MuhammedHasan commented Dec 20, 2024

loading directly from hugginface #3

loading directly from hugginface #3

Comments

pengzhangzhi commented Nov 25, 2024 • edited Loading

pengzhangzhi commented Nov 25, 2024

MuhammedHasan commented Nov 27, 2024

MuhammedHasan commented Dec 20, 2024

pengzhangzhi commented Nov 25, 2024 •

edited

Loading