Name	Name	Last commit message	Last commit date
Latest commit History 14 Commits
data	data
web_scraper	web_scraper
DUKE_LoRA_Llama3_2_3B_Instruct_public.ipynb	DUKE_LoRA_Llama3_2_3B_Instruct_public.ipynb
README.md	README.md

Name

Last commit message

Last commit date

DUKE_LoRA_Llama3_2_3B_Instruct_public.ipynb

Supervised Fine-tuning of Small Language Models using DUKE-based Model Distillation

Fine-tune Meta's lightweight Llama small language models using document understanding and knowledge extraction (DUKE) for model distillation

This project uses a novel technique I've coined, DUKE (Document Understanding and Knowledge Extraction), along with LoRA (Low-Rank Adaptation), specifically, Rank-Stabilized LoRA (rsLoRA), a supervised fine-tuning technique and a method within PEFT (Parameter-Efficient Fine-Tuning), to fine-tune Meta Llama 3.2 3B Instruct, a lightweight 3 billion parameter instruction-tuned generative model. We will fine-tune the Llama model on an entirely new domain, similar to the trade show example.

Hugging Face Resources

LoRA adapter: garystafford/Llama-3.2-3B-Instruct-lora-nvidia-blackwell
Fine-tuning dataset: garystafford/fine-tune-nvidia-blackwell

Web Scraper

Create a virtual Python environment on Mac/Linux.

python --version # I am using Python 3.13.2

python -m pip install virtualenv -Uqqq
python -m venv .venv
source .venv/bin/activate

Install Python package dependencies.

python -m pip install pip -Uqqq
python -m pip install -r requirements.txt -Uqqq

Deactivate and delete the virtual environment once you are done.

deactivate
rm -rf .venv

Jupyter Notebook

Tested with a NVIDIA-based GPU with a minimum of 12-16 GB of VRAM.

Tested with the following Python package versions:

accelerate                              1.7.0
bitsandbytes                            0.45.5
peft                                    0.15.2
torch                                   2.7.0+cu128
torchaudio                              2.7.0+cu128
torchvision                             0.22.0+cu128
transformers                            4.51.3
trl                                     0.17.0
xformers                                0.0.30

About

DUKE (Document Understanding and Knowledge Extraction) along with Rank-Stabilized LoRA (rsLoRA) to fine-tune the Meta Llama 3.2 3B Instruct model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Supervised Fine-tuning of Small Language Models using DUKE-based Model Distillation

Fine-tune Meta's lightweight Llama small language models using document understanding and knowledge extraction (DUKE) for model distillation

Hugging Face Resources

Web Scraper

Jupyter Notebook

About

Uh oh!

Releases

Packages

Uh oh!

Languages

garystafford/duke-fine-tuning-llama

Folders and files

Latest commit

History

Repository files navigation

Supervised Fine-tuning of Small Language Models using DUKE-based Model Distillation

Fine-tune Meta's lightweight Llama small language models using document understanding and knowledge extraction (DUKE) for model distillation

Hugging Face Resources

Web Scraper

Jupyter Notebook

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages