gpt

💜 See also the article how to build the minimalistic GPT version in Paged Out! #5 Issue, page 6 "GPT in PyTorch"

Generative Pre-trained Transformer in PyTorch from scratch

Train

CLI

python src/train.py

Options:

--batch_size 64
--num-epochs 100
--lr 0.0001
--from-checkpoint checkpoint_path.pth

Model is checkpointed after each epoch and stored in checkpoints/ directory

Code

from train import train

train()

Run

CLI

python src/run.py --from-checkpoint checkpoint_path.pth

Code

from run import run

run(model_path="checkpoint_path.pth", prompt="Rick:\nMorty, where are you?)

Cite

If you use this software in your research, please use the following citation:

@misc{Maczan_GPT_2024,
  title = "Generative Pre-trained Transformer in PyTorch",
  author = "{Maczan, Jędrzej Paweł}",
  howpublished = "\url{https://github.com/jmaczan/gpt}",
  year = 2024,
  publisher = {GitHub}
}

License

GPL v3

Author

Jędrzej Maczan, 2024

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.idea		.idea
.lightning_studio		.lightning_studio
.vscode		.vscode
src		src
test		test
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gpt

Train

CLI

Code

Run

CLI

Code

Cite

License

Author

About

Releases

Packages

Languages

License

jmaczan/gpt

Folders and files

Latest commit

History

Repository files navigation

gpt

Train

CLI

Code

Run

CLI

Code

Cite

License

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages