GitHub - vivek-rd/tinystoriesGPT: Train GPT style model on tinystories dataset

Objective

The aim of this project is to familiarize myself with few things -

Training a GPT style decoder only model which generates grammatically correct sentences.
Learn how to use CUDA enabled environment for running PyTorch models.
Understand the ins and outs of transformer architecture.

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
gpt_neo		gpt_neo
Dockerfile		Dockerfile
README.md		README.md
data_prepare.py		data_prepare.py
model.py		model.py
requirements.txt		requirements.txt
sample.py		sample.py
train_gpt.py		train_gpt.py