Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

make RoBERTa usable in more tasks including QA #1017

Closed
wants to merge 1 commit into from

Conversation

borguz
Copy link
Contributor

@borguz borguz commented Oct 1, 2019

Summary:
Currently Roberta encoder, model and tensorizer are pretty stand-alone, not conforming to other PyText tasks. This diff is an attempt to better integrate it.

It involves the following:

  • Make GPT-2 BPE act like a proper tokenizer and also return char indices. This makes Roberta tensorizer more modular so code can be re-used
  • Make Roberta tensorizer conform more closely to BERTTensorizer so that the TransformerSentenceEncoder interfaces are better aligned.
  • Add a Roberta tensorizer for question answering

Differential Revision: D17690805

@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Oct 1, 2019
borguz added a commit to borguz/pytext-1 that referenced this pull request Oct 2, 2019
Summary:
Pull Request resolved: facebookresearch#1017

Currently Roberta encoder, model and tensorizer are pretty stand-alone, not conforming to other PyText tasks.  This diff is an attempt to better integrate it.

It involves the following:
- Make GPT-2 BPE act like a proper tokenizer and also return char indices.  This makes Roberta tensorizer more modular so code can be re-used
- Make Roberta tensorizer conform more closely to BERTTensorizer so that the TransformerSentenceEncoder interfaces are better aligned.
- Add a Roberta tensorizer for question answering

Differential Revision: D17690805

fbshipit-source-id: fc7872098e5a654e1da2c8d4878be56535991c60
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D17690805

Summary:
Pull Request resolved: facebookresearch#1017

Currently Roberta encoder, model and tensorizer are pretty stand-alone, not conforming to other PyText tasks.  This diff is an attempt to better integrate it.

It involves the following:
- Make GPT-2 BPE act like a proper tokenizer and also return char indices.  This makes Roberta tensorizer more modular so code can be re-used
- Make Roberta tensorizer conform more closely to BERTTensorizer so that the TransformerSentenceEncoder interfaces are better aligned.
- Add a Roberta tensorizer for question answering

Differential Revision: D17690805

fbshipit-source-id: a7c6d10497f2c0a0cdf7bdffe843e4c64384d5c1
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D17690805

borguz added a commit to borguz/pytext-1 that referenced this pull request Oct 3, 2019
Summary:
Pull Request resolved: facebookresearch#1017

Currently Roberta encoder, model and tensorizer are pretty stand-alone, not conforming to other PyText tasks.  This diff is an attempt to better integrate it.

It involves the following:
- Make GPT-2 BPE act like a proper tokenizer and also return char indices.  This makes Roberta tensorizer more modular so code can be re-used
- Make Roberta tensorizer conform more closely to BERTTensorizer so that the TransformerSentenceEncoder interfaces are better aligned.
- Add a Roberta tensorizer for question answering

Differential Revision: D17690805

fbshipit-source-id: 3c774429ed97598094d657c3069bd9865dc46f27
borguz added a commit to borguz/pytext-1 that referenced this pull request Oct 3, 2019
Summary:
Pull Request resolved: facebookresearch#1017

Currently Roberta encoder, model and tensorizer are pretty stand-alone, not conforming to other PyText tasks.  This diff is an attempt to better integrate it.

It involves the following:
- Make GPT-2 BPE act like a proper tokenizer and also return char indices.  This makes Roberta tensorizer more modular so code can be re-used
- Make Roberta tensorizer conform more closely to BERTTensorizer so that the TransformerSentenceEncoder interfaces are better aligned.
- Add a Roberta tensorizer for question answering

Differential Revision: D17690805

fbshipit-source-id: e97fe1352047e8d915005a3fcdd76bd53b268a03
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 3fd9764.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants