Skip to content

This is a project of my course "Deep learning Fundamentals and Practice". Note, code and other materials will be uploaded

Notifications You must be signed in to change notification settings

Toflamus/Small_language_model_project

Repository files navigation

Small_language_model_project

This is a project of my course "Deep learning Fundamentals and Practice". Note, code and other materials will be uploaded

Learning materials

Hugging face

Hugging face intro-Chinese version in Zhihu
Hugging face courses and tutorial

miniGPT: model archtecture

mingpt

Pretrain

Dataset

TinyStories

Tokenizer

gpt-2like Byte-Pair-Enconding(BPE) tokenizer: roneneldan

Pretrain code

See the file: Server_Training_a_causal_language_model_from_scratch_(PyTorch).ipynb This is adapted from the NLP course in hugging face. See here.

Model

Right now this model and relative information of the pretain process can be found here

Finetune

Dataset

standford alpaca

Finetune code

See file: Server_Fineture_process.ipynb

Model

Right now this model and relative information of the pretain process can be found here

Report

The final report can be found here

About

This is a project of my course "Deep learning Fundamentals and Practice". Note, code and other materials will be uploaded

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published