Experiments with Transformers

Read article at https://arxiv.org/abs/2009.04968

Infer masked token with BERT

Description

BERT's authors trained it in two tasks and one of them is Masked Language Model (MLM). The task consists of mask some input tokens randomly, and then try to predict those random tokens. Hence, we do not need to pretrain BERT and we can directly test it.

In this repository, I have implemented a basic script to infer the masked token in a sentence.

Read the BERT paper: https://arxiv.org/pdf/1810.04805.pdf

Code

With this repository, there are some commands in Bash to run the Python code in a Docker container. Anyway, you can find all the Python scripts at ./code

You can see the code execution step by step in the notebook Infer masked token with BERT. Additionally, the notebooks are available at Kaggle:

https://www.kaggle.com/dimasmunoz/infer-masked-token-with-bert

Commands

Note: These commands have been tested in MacOS and Git Bash (Windows).

You can start/stop the docker container with these two commands:

sh manager.sh docker:run
sh manager.sh docker:down

Once the docker container is running, execute the BERT script as follows:

sh manager.sh bert

Then, it will display a prompt where you can write your sentences with a masked token. For example:

$ sh manager.sh bert
Downloading: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████| 996k/996k [00:02<00:00, 485kB/s]
Downloading: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████| 625/625 [00:00<00:00, 176kB/s]
Downloading: 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████| 714M/714M [01:07<00:00, 10.7MB/s]

Input text: This is a [MASK] model
----------------------------------------
mathematical 0.04100513085722923
model 0.03972144424915314
single 0.01666860282421112
similar 0.014901496469974518
common 0.01419056300073862
========================================

...

Question-answering with BERT

Description

In Question-Answering tasks, the model receives a text and a question regarding to the text, and it should mark the beginning and end of the answer in the text.

In this case, it is necessary to fine-tune BERT. We will use XQuAD (Cross-lingual Question Answering Dataset) to fine-tune it, which consists of a subset of 240 paragraphs and 1190 question-answer pairs from the development set of SQuAD v1.1 together with their professional translations into 10 languages. Source: https://github.com/deepmind/xquad

Code

You can see the code execution step by step in the notebook Question answering with BERT.

Conditional Text Generation

Description

The goal of this tasks consists of generating a text about a specific topic or situation. We used two models in this experiment: GPT-2 and a Transformer model.

On the one hand, despite GPT-2 was only trained to predict the next token in a sentence, it surprisingly learned basic competence in some tasks like translating between languages and answering questions. On the other hand, we implemented a basic Transformer encoder-decoder architecture to analyse whether it is able to generate a similar text to the training data.

Code

There are two notebooks:

Have fun! ᕙ (° ~ ° ~)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
code		code
misc		misc
notebooks		notebooks
.env		.env
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
manager.sh		manager.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experiments with Transformers

Infer masked token with BERT

Description

Code

Commands

Question-answering with BERT

Description

Code

Conditional Text Generation

Description

Code

About

Releases

Packages

Languages

DimasDMM/transformers

Folders and files

Latest commit

History

Repository files navigation

Experiments with Transformers

Infer masked token with BERT

Description

Code

Commands

Question-answering with BERT

Description

Code

Conditional Text Generation

Description

Code

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages