a smol course

This is a practical course on aligning language models for your specific use case. It's a handy way to get started with aligning language models, because everything runs on most local machines. There are minimal GPU requirements and no paid services. The course is based on the SmolLM2 series of models, but you can transfer the skills you learn here to larger models or other small language models.

Participation is open, free, and now!

This course is open and peer reviewed. To get involved with the course open a pull request and submit your work for review. Here are the steps:

Fork the repo here
Read the material, make changes, do the exercises, add your own examples.
Open a PR on the december_2024 branch
Get it reviewed and merged

This should help you learn and to build a community-driven course that is always improving.

We can discuss the process in this discussion thread.

Course Outline

This course provides a practical, hands-on approach to working with small language models, from initial training through to production deployment.

Module	Description	Status	Release Date
Instruction Tuning	Learn supervised fine-tuning, chat templating, and basic instruction following	✅ Complete	Dec 3, 2024
Preference Alignment	Explore DPO and ORPO techniques for aligning models with human preferences	🚧 In Progress	Dec 6, 2024
Parameter-efficient Fine-tuning	Learn LoRA, prompt tuning, and efficient adaptation methods	🚧 In Progress	Dec 9, 2024
Evaluation	Use automatic benchmarks and create custom domain evaluations	🚧 In Progress	Dec 16, 2024
Vision-language Models	Adapt multimodal models for vision-language tasks	📝 Planned	Dec 20, 2024
Synthetic Datasets	Create and validate synthetic datasets for training	📝 Planned	Dec 23, 2024
Inference	Infer with models efficiently	📝 Planned	Dec 27, 2024
Deployment	Deploy and serve models at scale	📝 Planned	Dec 30, 2024

Why Small Language Models?

While large language models have shown impressive capabilities, they often require significant computational resources and can be overkill for focused applications. Small language models offer several advantages for domain-specific applications:

Efficiency: Require significantly less computational resources to train and deploy
Customization: Easier to fine-tune and adapt to specific domains
Control: Better understanding and control of model behavior
Cost: Lower operational costs for training and inference
Privacy: Can be run locally without sending data to external APIs

Prerequisites

Before starting, ensure you have the following:

Basic understanding of machine learning and natural language processing.
Familiarity with Python, PyTorch, and the transformers library.
Access to a pre-trained language model and a labeled dataset.

Installation

All the examples run in the same environment, so you only need to install the dependencies once like this:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
1_instruction_tuning		1_instruction_tuning
2_preference_alignment		2_preference_alignment
3_parameter_efficient_finetuning		3_parameter_efficient_finetuning
4_evaluation		4_evaluation
5_vision_language_models		5_vision_language_models
6_synthetic_datasets		6_synthetic_datasets
7_inference		7_inference
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
banner.png		banner.png
pull_request_template.md		pull_request_template.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

a smol course

Participation is open, free, and now!

Course Outline

Why Small Language Models?

Prerequisites

Installation

About

Releases

Packages

Languages

License

Gibs0111/smol-course

Folders and files

Latest commit

History

Repository files navigation

a smol course

Participation is open, free, and now!

Course Outline

Why Small Language Models?

Prerequisites

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages