README

#This project evaluates PEFT approaches, BitFit and Lora, used for large language models (LLMs) with an emphasis on small scale models and ultra-low resource fine-tuning.

To start, please create a HuggingFace token to access the OPT-125m model from transformers and the Sentiment140 dataset from datasets

Next, run the baseline model in the ipynb notebook "OPT125M_Baseline_PEFT.ipynb" where you will evaluate the base OPT125m model on binary sentiment (tweet) analysis as well as with the addition of two PEFT techniques (BitFit and LoRA)

Next, to produce visualizations as included in the report, run the ipynb within the visualizations folder -- as a guide a pre-printed pdf version is provided in this folder.

Next, run the distilled model in the ipynb notebooke "OPT125M_distilled.ipynb" where you will perform distillation of the OPT350m model (which takes the form of the teacher) to the student OPT125m model. Expected is that the distilled OPT125m, which has learned from a more powerful 350m model, will perform better on the binary sentiment analysis task.

For documentation, contributions and publication, see the doc folder.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
doc		doc
model_data		model_data
visualizations		visualizations
.DS_Store		.DS_Store
OPT125M_Sentiment140.ipynb		OPT125M_Sentiment140.ipynb
OPT125M_baseline_PEFT.ipynb		OPT125M_baseline_PEFT.ipynb
OPT125M_distilled.ipynb		OPT125M_distilled.ipynb
README.md		README.md
baseline_opt125.py		baseline_opt125.py
environment.yml		environment.yml
sentiment140.py		sentiment140.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Priya-753/SEND-IT

Folders and files

Latest commit

History

Repository files navigation

README

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages