Fuse moe lora by cjw-d · Pull Request #3801 · PaddlePaddle/PaddleFormers

cjw-d · 2026-02-03T07:40:35Z

PR types

Others

PR changes

Others

Description

fuse moe lora

paddle-bot · 2026-02-03T07:40:43Z

Thanks for your contribution!

cjw-d · 2026-02-04T01:52:54Z

/re-run all-failed

codecov-commenter · 2026-02-04T02:05:09Z

Codecov Report

❌ Patch coverage is 59.30233% with 105 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@1d6500a). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
paddleformers/peft/lora/lora_layers.py	48.99%	76 Missing ⚠️
paddleformers/peft/lora/lora_model.py	26.31%	14 Missing ⚠️
paddleformers/transformers/auto/modeling.py	23.07%	10 Missing ⚠️
paddleformers/transformers/glm4_moe/modeling.py	66.66%	2 Missing ⚠️
paddleformers/nn/experts.py	97.22%	1 Missing ⚠️
paddleformers/transformers/deepseek_v3/modeling.py	80.00%	1 Missing ⚠️
paddleformers/transformers/qwen2_moe/modeling.py	83.33%	1 Missing ⚠️

❌ Your patch status has failed because the patch coverage (59.30%) is below the target coverage (75.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             develop    #3801   +/-   ##
==========================================
  Coverage           ?   32.22%           
==========================================
  Files              ?      433           
  Lines              ?    82298           
  Branches           ?        0           
==========================================
  Hits               ?    26524           
  Misses             ?    55774           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lugimzzz · 2026-02-04T05:55:41Z

paddleformers/peft/lora/lora_model.py

            lora_module = RowParallelQuantizationLoRALinear(module, lora_config)
            # Lora row parallel will spilt lora A matrix
            self.add_lora_split_mapping(module_name + ".lora_A", is_column=False)
+        elif attribute_chain[-1] == "experts":


1.这个匹配规则是否具有通用性，会不会替换其他存量模型导致问题？
2. 需要考虑如果模型的expert写法比较特殊能够流一个接口适配自定义的loraexpert
3.是否能够匹配paddlefleet的expert?

已修改匹配规则，并且保留接口用于适配自定义的lora expert

lugimzzz · 2026-02-04T05:56:19Z

paddleformers/peft/lora/lora_model.py

@@ -1055,8 +1017,7 @@ def get_lora_model(self, model: Union[PretrainedModel, nn.Layer], lora_config: L
            return model
        if isinstance(lora_config.target_modules, str):
            lora_config.target_modules = [lora_config.target_modules]


需要添加相关单测

已添加相关单测

lugimzzz · 2026-02-04T05:57:04Z

paddleformers/peft/lora/lora_model.py

            lora_config.target_modules = [lora_config.target_modules]
-        for i in model.named_sublayers():
-            module_name = i[0]
+        for module_name, module in model.named_sublayers():


需要考虑开发lora merge

已适配merge_model

lugimzzz · 2026-02-04T05:57:52Z

paddleformers/peft/lora/lora_model.py

@@ -1055,8 +1017,7 @@ def get_lora_model(self, model: Union[PretrainedModel, nn.Layer], lora_config: L
            return model
        if isinstance(lora_config.target_modules, str):
            lora_config.target_modules = [lora_config.target_modules]


需要适配get_merge_state_dict函数

cjw-d · 2026-02-04T12:59:01Z

/re-run all-failed

lugimzzz

lgtm

lugimzzz · 2026-02-05T04:20:46Z

paddleformers/nn/experts.py

+import paddle
+import paddle.nn as nn
+
+from .activation import ACT2FN


把其他模型的MOE也一起替换上

lugimzzz · 2026-02-05T04:21:34Z

paddleformers/transformers/qwen3_vl_moe/modeling.py

 from ...nn.attention.interface import ALL_ATTENTION_FUNCTIONS
 from ...nn.criterion.interface import CriterionLayer
 from ...nn.embedding import Embedding as GeneralEmbedding
+from ...nn.experts import MoeExperts as Qwen3VLMoeTextExperts


实验验证一下正确性

cjw-d · 2026-02-05T07:47:52Z

/re-run all-failed

cjw-d added 3 commits February 1, 2026 11:37

support lora

7964f0b

update LoRARxperts

d2d9667

Merge remote-tracking branch 'upstream/develop' into fuse_moe_lora

d6bb005

paddle-bot bot added the contributor label Feb 3, 2026

lugimzzz reviewed Feb 4, 2026

View reviewed changes

cjw-d added 2 commits February 4, 2026 19:30

fix

0d577b2

fix typo

2921563

lugimzzz reviewed Feb 5, 2026

View reviewed changes

cjw-d added 2 commits February 5, 2026 15:05

adapt mode models

7ed5523

remove redundant code

9b6ea8b

cjw-d added 2 commits February 6, 2026 17:44

stash fleet lora

2c0d2d0

remove comments

1fc86c9

Conversation

cjw-d commented Feb 3, 2026

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Feb 3, 2026

Uh oh!

cjw-d commented Feb 4, 2026

Uh oh!

codecov-commenter commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cjw-d commented Feb 4, 2026

Uh oh!

lugimzzz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cjw-d commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Feb 4, 2026 •

edited

Loading