Add ESM to huggingface #13662

liujas000 · 2021-09-21T00:15:52Z

What does this PR do?

Adding ESM-1b to huggingface following the steps in https://huggingface.co/transformers/add_new_model.html

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@sgugger

sgugger

Thanks a lot for your PR! I left a few comments but it's already in pretty good shape.

I have a few more general interrogations:

The tokenizer fast is imported in the main inits and used in the doc, but it does not exist in the files, so you should either add the tokenizer fast or remove any mention of it.
Do all the model head make sense for this new architecture? From what I understand it's linked to proteins so I don't know if the task-specific heads like multiple choice or question answering really are useful.
There should be a file to test the tokenizer

docs/source/index.rst

docs/source/model_doc/esm.rst

src/transformers/__init__.py

src/transformers/models/esm/modeling_esm.py

src/transformers/utils/modeling_auto_mapping.py

tests/test_modeling_esm.py

liujas000 · 2021-10-19T06:43:13Z

@sgugger , thanks for the feedback!
There's two common tests that I'm failing; do you have any insight into what the proper fix would be?

❯ pytest tests/test_modeling_esm.py --disable-warnings
==================================================== test session starts ====================================================
platform linux -- Python 3.7.10, pytest-6.2.4, py-1.10.0, pluggy-0.13.1
rootdir: /private/home/jasonliu/work-huggingface/transformers-dev, configfile: setup.cfg
plugins: dash-1.21.0, forked-1.3.0, xdist-2.3.0, timeout-1.4.2, hydra-core-1.1.0
collected 67 items

tests/test_modeling_esm.py .....................................s..............FF.....sss..ss.                        [100%]

========================================================= FAILURES ==========================================================
______________________________________ ESMModelTest.test_save_load_fast_init_from_base ______________________________________

self = <tests.test_modeling_esm.ESMModelTest testMethod=test_save_load_fast_init_from_base>

    def test_save_load_fast_init_from_base(self):
        config, inputs_dict = self.model_tester.prepare_config_and_inputs_for_common()
>       base_class = MODEL_MAPPING[config.__class__]

tests/test_modeling_common.py:208:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

self = _LazyAutoMapping(), key = <class 'transformers.models.esm.configuration_esm.ESMConfig'>

    def __getitem__(self, key):
>       model_type = self._reverse_config_mapping[key.__name__]
E       KeyError: 'ESMConfig'

src/transformers/models/auto/auto_factory.py:513: KeyError
_______________________________________ ESMModelTest.test_save_load_fast_init_to_base _______________________________________

self = <tests.test_modeling_esm.ESMModelTest testMethod=test_save_load_fast_init_to_base>

    def test_save_load_fast_init_to_base(self):
        config, inputs_dict = self.model_tester.prepare_config_and_inputs_for_common()
>       base_class = MODEL_MAPPING[config.__class__]

tests/test_modeling_common.py:253:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

self = _LazyAutoMapping(), key = <class 'transformers.models.esm.configuration_esm.ESMConfig'>

    def __getitem__(self, key):
>       model_type = self._reverse_config_mapping[key.__name__]
E       KeyError: 'ESMConfig'

src/transformers/models/auto/auto_factory.py:513: KeyError
================================================== short test summary info ==================================================
FAILED tests/test_modeling_esm.py::ESMModelTest::test_save_load_fast_init_from_base - KeyError: 'ESMConfig'
FAILED tests/test_modeling_esm.py::ESMModelTest::test_save_load_fast_init_to_base - KeyError: 'ESMConfig'
============================== 2 failed, 59 passed, 6 skipped, 30 warnings in 99.74s (0:01:39) ==============================

sgugger · 2021-10-19T12:36:21Z

It doesn't look like you added your model in the configuration_auto mappings, just the modeling_auto mappings. That's why you get this error.

liujas000 · 2021-10-19T22:56:06Z

Thanks! I think this is ready for review again (rebased to upstream)

patrickvonplaten · 2021-10-25T10:30:50Z

src/transformers/models/esm/convert_esm.py

+import torch
+
+import esm as esm_module
+from transformers.models.bert.modeling_bert import (


could we maybe replace those classes by the corresponding ESM... classes, e.g. ESMIntermediate, ...

This would still be nice to change before merging

src/transformers/models/esm/modeling_esm.py

tests/test_tokenization_esm.py

patrickvonplaten

I fixed a couple of things (corrected the config and removed dead code of ESMForQuestionAnswering) - hope that was ok!

PR is good for merge IMO. It would be nice if we could replace the BERT modules with ESM modules in the conversion script and then we should probably also flip the private model repo to public so that the integration test can run :-)

Should we upload the other checkpoitns as well: https://github.com/facebookresearch/esm#pre-trained-models-

LysandreJik

Nice, this looks good to me! Thank you for working on this!

src/transformers/models/esm/configuration_esm.py

patrickvonplaten · 2021-11-10T08:12:18Z

Hey @liujas000,

Can we help you in any way to get this PR merged? :-)

liujas000 · 2021-11-10T23:42:07Z

@patrickvonplaten sorry for the delay; I will land this week!

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

patrickvonplaten · 2021-11-15T17:01:53Z

Test failure seems unrelated

patrickvonplaten · 2021-11-15T17:03:46Z

Thanks a lot for making this PR more or less mergeable @liujas000 . I think there are just some final comments from @sgugger and @patrickvonplaten to be taken care of and the PR is good to go :-)

github-actions · 2022-02-08T15:10:26Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

gianhiltbrunner · 2022-03-03T12:04:43Z

I'd love to use this model, what still needs to be done to get this merged?

patrickvonplaten · 2022-03-03T13:16:10Z

cc @liujas000 @Rocketknight1

Rocketknight1 · 2022-03-03T17:12:42Z

@gianhiltbrunner We're still waiting on an internal review from Facebook from the contributors, I believe! I'll let you know if there's any update.

franzigeiger · 2022-07-20T06:42:29Z

I would also be very happy if this gets merged! Any progress?

patrickvonplaten · 2022-08-23T15:33:24Z

Any updates here?

liujas000 marked this pull request as ready for review September 21, 2021 00:17

sgugger reviewed Sep 21, 2021

View reviewed changes

sgugger requested review from LysandreJik, patrickvonplaten and patil-suraj September 21, 2021 14:22

liujas000 force-pushed the add_esm-proper branch from 7ca02c6 to fdc582a Compare October 7, 2021 00:55

liujas000 force-pushed the add_esm-proper branch from b61acce to fc0f4bd Compare October 19, 2021 22:43

patrickvonplaten reviewed Oct 25, 2021

View reviewed changes

src/transformers/models/esm/modeling_esm.py Show resolved Hide resolved

patrickvonplaten reviewed Oct 25, 2021

View reviewed changes

src/transformers/models/esm/modeling_esm.py Show resolved Hide resolved

patrickvonplaten reviewed Oct 25, 2021

View reviewed changes

tests/test_tokenization_esm.py Outdated Show resolved Hide resolved

patrickvonplaten approved these changes Oct 25, 2021

View reviewed changes

LysandreJik approved these changes Oct 26, 2021

View reviewed changes

jasoniliu added 12 commits November 12, 2021 09:17

use cookiecutter to make esm

43390b3

port tokenizers

1fda860

first pass at esm model. 1 test failed

662e8f1

add conversion script, still works

fc634d2

copy roberta tests (except integration), all passing

442e92f

make masked lm integration test

4e4437c

add working integration tests

fca437f

made tokenizer work and removed fast tokenizer

52acafb

added extra lm models to auto mapping

e29e6ff

make style

2839cef

make sure style and quality commands pass

4e2565b

update docs

62afe54

jasoniliu and others added 12 commits November 12, 2021 09:18

address comments and add tokenization test

7bbe1f9

removed QA+MC models, fixed all but 2 tests

116e6fe

make quality passes

61d17ce

add esmconfig to auto and re-run make style/quality

c86897b

rebase upstream and re-run make style/quality

1143e75

fix some more tests

14d764e

correct config

75366af

add docs to doc tree

3953378

update

9c9dd60

Update src/transformers/models/esm/configuration_esm.py

7e4670a

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

Update tests/test_tokenization_esm.py

bb9907b

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

final rebase before push

d1b4640

liujas000 force-pushed the add_esm-proper branch from 4bfad1a to d1b4640 Compare November 12, 2021 17:32

jasoniliu added 2 commits November 12, 2021 10:39

remove decoder tests

cd963f2

make style/quality

6b4b768

huggingface deleted a comment from github-actions bot Dec 13, 2021

github-actions bot closed this Jan 14, 2022

huggingface deleted a comment from github-actions bot Jan 14, 2022

patrickvonplaten reopened this Jan 14, 2022

liujas000 mentioned this pull request Jan 18, 2022

ESMForMaskedLM not in transformers facebookresearch/esm#158

Closed

github-actions bot closed this Feb 17, 2022

Rocketknight1 mentioned this pull request Sep 15, 2022

Rebase ESM PR and update all file formats #19055

Merged

26 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ESM to huggingface #13662

Add ESM to huggingface #13662

liujas000 commented Sep 21, 2021 •

edited

Loading

sgugger left a comment

liujas000 commented Oct 19, 2021

sgugger commented Oct 19, 2021

liujas000 commented Oct 19, 2021

patrickvonplaten Oct 25, 2021

patrickvonplaten Nov 15, 2021

patrickvonplaten left a comment

LysandreJik left a comment

patrickvonplaten commented Nov 10, 2021

liujas000 commented Nov 10, 2021

patrickvonplaten commented Nov 15, 2021

patrickvonplaten commented Nov 15, 2021

github-actions bot commented Feb 8, 2022

gianhiltbrunner commented Mar 3, 2022

patrickvonplaten commented Mar 3, 2022

Rocketknight1 commented Mar 3, 2022

franzigeiger commented Jul 20, 2022

patrickvonplaten commented Aug 23, 2022

Add ESM to huggingface #13662

Add ESM to huggingface #13662

Conversation

liujas000 commented Sep 21, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

sgugger left a comment

Choose a reason for hiding this comment

liujas000 commented Oct 19, 2021

sgugger commented Oct 19, 2021

liujas000 commented Oct 19, 2021

patrickvonplaten Oct 25, 2021

Choose a reason for hiding this comment

patrickvonplaten Nov 15, 2021

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Nov 10, 2021

liujas000 commented Nov 10, 2021

patrickvonplaten commented Nov 15, 2021

patrickvonplaten commented Nov 15, 2021

github-actions bot commented Feb 8, 2022

gianhiltbrunner commented Mar 3, 2022

patrickvonplaten commented Mar 3, 2022

Rocketknight1 commented Mar 3, 2022

franzigeiger commented Jul 20, 2022

patrickvonplaten commented Aug 23, 2022

liujas000 commented Sep 21, 2021 •

edited

Loading