You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm working on it. I guess the work is converting of the names defined in esm-efficient to be what's defined in esm2, which is the standrad hugginface names? Let me know if you are interested! We can talk more about that!!
Just so you know, pull requests are welcome. Please make sure any change passes the test cases. I renamed and created safetensors from the checkpoints because pickles are not reliable in my experience.
Hi,
In the current codebase, we have to download the ckpt to the local and load it using the following method:
I wonder if we can directly load the ckpt from Hugginface?
such as
That way, it's more straightforward to replace existing codebase with a flash-attention version of esm2.
It seems doable to me bc eesm shares the same model architecture with ESM2 except for the use of flash attention?
Would love to hear ur thoughts!
The text was updated successfully, but these errors were encountered: