Skip to content
/ minilm Public

A tiny transformer language model that learns to babble its own code

License

Notifications You must be signed in to change notification settings

neftlon/minilm

Repository files navigation

minilm

A tiny transformer language model that learns to babble its own code.

Demo animation

requirements

The program is written in Python using jax. It requires regex and tqdm for training.

usage

The application consists of two trainable components: a tokenizer and the language model. The following code first trains a tokenizer, then trains a model, and then uses both to generate text.

python3 tokenizer.py
python3 train.py
python3 generate.py

About

A tiny transformer language model that learns to babble its own code

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages