Are-SLMs-Performant?

This repository houses python notebooks used to fine tune, evaluate and compare performance of Small Language Models (SLMs) used to answer the following question -

Can Small Language Models (SMLs) display a comparable performance to LLMs in extracting information from HTML?

Also, I have published a Substack article discussing the approach I take to tackle this question, my finding, thoughts and more!

📦 Installation

git clone https://github.com/pradyGn/are-SLMs-performant.git
cd are-SLMs-performant
pip install -r requirements.txt

🧪 Notebooks

2025-05-26_finetuning-SML.ipynb: Contains code to fine tune a language model of your choice.

2025-05-20_evaluate-SML.ipynb: Contains code to perform inference on the fine tuned language model.

2025-04-07_results-comparison.ipynb: Contains helper functions and usage examples to compare the performance of the fine tuned language model.

📁 Folder Structure

are-SLMs-performant/
├── notebooks/
│   ├── 2025-05-26_finetuning-SML.ipynb
│   ├── 2025-05-20_evaluate-SML.ipynb
│   └── 2025-04-07_results-comparison.ipynb
├── results/
│   ├── Llama-3.2-1B_test_dataset_output.parquet
│   ├── Llama-3.2-1B_unseen_test_dataset_output.parquet
│   ├── ReaderLM-v2_test_dataset_output.parquet
│   └── ReaderLM-v2_unseen_test_dataset_output.parquet
├── README.md
├── .gitignore
└── requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Are-SLMs-Performant?

📦 Installation

🧪 Notebooks

📁 Folder Structure

🔗 References

🙋‍♂️ Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
notebooks		notebooks
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

pradyGn/are-SLMs-performant

Folders and files

Latest commit

History

Repository files navigation

Are-SLMs-Performant?

📦 Installation

🧪 Notebooks

📁 Folder Structure

🔗 References

🙋‍♂️ Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages