Remove hf_auth_token use by Abhishek-Varma · Pull Request #1822 · nod-ai/AMD-SHARK-Studio

Abhishek-Varma · 2023-09-06T15:14:42Z

-- This commit removes --hf_auth_token uses from vicuna.py.
-- It adds llama2 models based on daryl49's HF.

Signed-off-by: Abhishek Varma abhishek@nod-labs.com

Abhishek-Varma · 2023-09-06T15:15:42Z

Currently marking it as draft since 13B and 70B paths need testing.
CC: @powderluv

powderluv · 2023-09-06T15:18:41Z

If we only download the mlir we wouldn't hit the token right?

Abhishek-Varma · 2023-09-06T15:25:35Z

If we only download the mlir we wouldn't hit the token right?

I did try doing that but during the run saw that we will hit that issue - because we're using tokenizers to decode each generated token. And this tokenizer is being instantiated as per the HF repo we use.

Abhishek-Varma · 2023-09-08T14:18:20Z

If we only download the mlir we wouldn't hit the token right?

I did try doing that but during the run saw that we will hit that issue - because we're using tokenizers to decode each generated token. And this tokenizer is being instantiated as per the HF repo we use.

Even this would work since we're anyway blocking the IR generation.
It'd then essentially download the tokenizer's config files from daryl149/llama-2-7b-hf and we already have the MLIR generated from meta-llama/Llama-2-7b-chat-hf.

I verified it on CPU for llama2 7B.

With this PR we don't need to maintain config files for tokenizer but we're changing the base HF repo and this would impact the workflow when the IR generation is given a green signal.

But with the other PR we only need to incur an overhead for maintaining the config files - keeping rest of the infra same.

apps/language_models/scripts/vicuna.py

-- This commit removes `--hf_auth_token` uses from vicuna.py. -- It adds llama2 models based on daryl49's HF. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>

Abhishek-Varma requested review from powderluv and vivekkhandelwal1 September 6, 2023 15:14

Abhishek-Varma force-pushed the hf_auth_removal branch from fe4fcd6 to 1745c8f Compare September 8, 2023 14:22

Abhishek-Varma marked this pull request as ready for review September 8, 2023 14:23

vivekkhandelwal1 requested changes Sep 8, 2023

View reviewed changes

apps/language_models/scripts/vicuna.py Outdated Show resolved Hide resolved

Remove hf_auth_token use

c76c519

-- This commit removes `--hf_auth_token` uses from vicuna.py. -- It adds llama2 models based on daryl49's HF. Signed-off-by: Abhishek Varma <abhishek@nod-labs.com>

Abhishek-Varma force-pushed the hf_auth_removal branch from 1745c8f to c76c519 Compare September 8, 2023 16:12

Abhishek-Varma requested a review from vivekkhandelwal1 September 8, 2023 16:31

vivekkhandelwal1 approved these changes Sep 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove hf_auth_token use#1822

Remove hf_auth_token use#1822
Abhishek-Varma wants to merge 1 commit intonod-ai:mainfrom
Abhishek-Varma:hf_auth_removal

Abhishek-Varma commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 6, 2023

Uh oh!

powderluv commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 8, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Abhishek-Varma commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 6, 2023

Uh oh!

powderluv commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 6, 2023

Uh oh!

Abhishek-Varma commented Sep 8, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants