Add support for Llama and Qwen models by marswen · Pull Request #135 · FMInference/FlexLLMGen

marswen · 2024-03-29T03:19:22Z

This PR is to add support for Llama and Qwen models. Based on the scripts for OPT, RMSNorm and ROPE were added, and some parameters were adjusted for corresponding model architecture.

lztjy · 2024-11-18T13:28:55Z

我采纳了你的提交，在qwen测试的设置中默认把输入序列padding到128个tokens，当我修改这个输入序列长度时候会出现报错，是我没用对吗？

marswen added 2 commits July 16, 2024 18:15

support llama models

8e2cc94

support qwen models

8b89e02

marswen force-pushed the llama branch from 55767df to 8b89e02 Compare July 16, 2024 10:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Llama and Qwen models#135

Add support for Llama and Qwen models#135
marswen wants to merge 2 commits intoFMInference:mainfrom
marswen:llama

marswen commented Mar 29, 2024

Uh oh!

lztjy commented Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

marswen commented Mar 29, 2024

Uh oh!

lztjy commented Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants