[Mergekit]update & add LoRA merge #9811

lugimzzz · 2025-01-22T10:05:51Z

PR types

New features

PR changes

APIs

Description

优化mergekit
新增支持loramerge

merge_config = MergeConfig(
    lora_model_path=lora_model_path，
    base_model_path=base_model_path，
    output_path=output_path，
)
mergekit = MergeModel(merge_config)
mergekit.merge_model()

lugimzzz · 2025-01-23T07:47:26Z

paddlenlp/mergekit/merge_method.py

-            weights = paddle.to_tensor(weight_list, dtype=stacked_tensors.dtype)
-            weights = weights.reshape([-1] + [1] * (len(stacked_tensors.shape) - 1))
-            weighted_sum = paddle.sum(stacked_tensors * weights, axis=0)
+            weighted_sum = paddle.zeros_like(tensor_list[0])


测试发现for循环实现比stack后快，所以更换为stack方式

lugimzzz · 2025-01-23T07:47:52Z

paddlenlp/mergekit/sparsify_method.py

-            tensor *= mask
-            if self.merge_config.rescale:
-                tensor /= self.merge_config.reserve_p
+            mode = "upscale_in_train" if self.merge_config.rescale else "downscale_in_infer"


改为dropout更高效的方式

lugimzzz · 2025-01-23T07:48:47Z

paddlenlp/mergekit/sparsify_method.py

            return tensor
        else:
            raise ValueError(f"Unkonwn tensor type {self.merge_config.tensor_type}")

    def magprune(self, tensor):
        if self.merge_config.tensor_type == "np":
-            if np.all(tensor == 0):
+            if not np.any(tensor != 0):


测试发现，any的方式比all更高效

codecov · 2025-01-23T08:27:57Z

Codecov Report

Attention: Patch coverage is 66.98113% with 35 lines in your changes missing coverage. Please review.

Project coverage is 52.24%. Comparing base (ac095f5) to head (a2ac30d).
Report is 12 commits behind head on develop.

❗ Current head a2ac30d differs from pull request most recent head 3407cad

Please upload reports for the commit 3407cad to get more accurate results.

Files with missing lines	Patch %	Lines
paddlenlp/mergekit/merge_model.py	59.21%	31 Missing ⚠️
paddlenlp/mergekit/merge_method.py	80.95%	4 Missing ⚠️

❌ Your patch check has failed because the patch coverage (66.98%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
❌ Your project check has failed because the head coverage (52.24%) is below the target coverage (58.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9811      +/-   ##
===========================================
+ Coverage    51.27%   52.24%   +0.96%     
===========================================
  Files          735      730       -5     
  Lines       121550   115724    -5826     
===========================================
- Hits         62329    60458    -1871     
+ Misses       59221    55266    -3955

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lugimzzz · 2025-02-10T01:56:17Z

paddlenlp/mergekit/merge_method.py

-            weights = weights.reshape([-1] + [1] * (len(stacked_tensors.shape) - 1))
-            weighted_sum = paddle.sum(stacked_tensors * weights, axis=0)
-            return weighted_sum
+            tensor_output = paddle.zeros_like(tensor_list[0])


stack tensor耗时长于直接处理，后面修改理由一样

lugimzzz · 2025-02-10T01:57:30Z

paddlenlp/mergekit/merge_utils.py

@@ -29,3 +30,44 @@ def divide_positions(m, n):
            positions.append(positions[-1] + base_value)
    positions.append(m)
    return positions
+
+
+def divide_lora_key_list(key_list, n, lora_config):


发现按key_list直接切分可能导致分配不均，不同卡速度不同，所以进行一些优化

lugimzzz · 2025-02-10T01:58:20Z

paddlenlp/mergekit/merge_model.py

+                self.mergekit()
+        self.copy_file()
+
+    def copy_file(self):


用于copy tokenizer相关文件

lugimzzz · 2025-02-10T01:59:13Z

paddlenlp/mergekit/merge_model.py

+            tensor_mem = int(np.prod(tensor_list[0].shape) * self.numpy_dtype_map[str(tensor_list[0].dtype)]) / (
+                1024**3
+            )
+            if self.merge_config.tensor_type == "pd" and tensor_mem > self.merge_config.max_tensor_mem:


处理类似word embedding等超大tensor，防止oom

lugimzzz · 2025-02-10T02:00:38Z

paddlenlp/mergekit/merge_model.py

+                        tensor_list = [paddle.Tensor(tensor, zero_copy=True) for tensor in tensor_list]
+                elif self.merge_config.tensor_type == "np" and is_bf16:
+                    tensor_list = [
+                        paddle.Tensor(tensor, zero_copy=True).astype("float32").numpy() for tensor in tensor_list


paddle.Tensor(tensor, zero_copy=True)在cpu场景比paddle.to_tensor快

lugimzzz · 2025-02-10T02:01:38Z

paddlenlp/mergekit/merge_model.py

-        if use_gpu:
-            positions = divide_positions(len(key_list), dist.get_world_size())
+        num = self.merge_config.n_process if self.is_cpu else dist.get_world_size()
+        if file_type_list[0] == "safetensors" and len(set(index_list[0]["weight_map"].values())) >= num:


如果safetensor有多片按照片数切分更加均衡

add

fbd0b12

lugimzzz commented Jan 23, 2025

View reviewed changes

fix bug

ff86075

lugimzzz added 9 commits January 23, 2025 21:30

fix

a2ac30d

add

b088411

add lora merge

73c5118

add

cd33158

add

ec48e96

add

d49deb8

add

095ca3a

add

abd525c

add

3407cad

lugimzzz commented Feb 10, 2025

View reviewed changes

lugimzzz changed the title ~~update mergekit~~ [Mergekit]update & add LoRA merge Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Mergekit]update & add LoRA merge #9811

[Mergekit]update & add LoRA merge #9811

lugimzzz commented Jan 22, 2025 •

edited

Loading

lugimzzz Jan 23, 2025

lugimzzz Jan 23, 2025

lugimzzz Jan 23, 2025

codecov bot commented Jan 23, 2025 •

edited

Loading

lugimzzz Feb 10, 2025

lugimzzz Feb 10, 2025

lugimzzz Feb 10, 2025

lugimzzz Feb 10, 2025

lugimzzz Feb 10, 2025

lugimzzz Feb 10, 2025

[Mergekit]update & add LoRA merge #9811

Are you sure you want to change the base?

[Mergekit]update & add LoRA merge #9811

Conversation

lugimzzz commented Jan 22, 2025 • edited Loading

PR types

PR changes

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jan 23, 2025 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lugimzzz commented Jan 22, 2025 •

edited

Loading

codecov bot commented Jan 23, 2025 •

edited

Loading