[Model] Add Qwen2 model. #330

marvin-Yu · 2024-04-19T07:38:46Z

No description provided.

changqi1 · 2024-04-19T09:23:17Z

What is the differences between Qwen1.0 vs Qwen 2.0?

pujiang2018 · 2024-04-20T08:05:43Z

src/models/qwen2.h

+    RmsNorm finalLN;
+};
+
+REGISTER_DECODER(Qwen2LLM, qwen2, float)


What's the error when put such macro to cpp file?

What's the error when put such macro to cpp file?

Putting in a CPP file won't trigger the execution of this macro, it confused me. It instantiates a static object of a registrar.

not the scope of this PR.

changqi1 · 2024-04-22T01:54:55Z

examples/model_config/qwen2-0_5b/config.json

+  "initializer_range": 0.02,
+  "intermediate_size": 2816,
+  "max_position_embeddings": 32768,
+  "max_window_layers": 21,


what is the meaning for max_window_layers?

Duyi-Wang

copy right's year

Duyi-Wang · 2024-04-22T01:48:46Z

benchmark/run.sh

@@ -1,5 +1,5 @@
 #!/bin/bash
-# set -x


Duyi-Wang · 2024-04-22T01:50:35Z

examples/model_config/qwen2-14b/model.safetensors.index.json

unnecessary

Duyi-Wang · 2024-04-22T01:52:08Z

examples/model_config/qwen2-4b/model.safetensors.index.json

Unnecessary

Duyi-Wang · 2024-04-22T01:52:21Z

examples/model_config/qwen2-72b/model.safetensors.index.json

Unnecessary

Duyi-Wang · 2024-04-22T01:52:34Z

examples/model_config/qwen2-7b/model.safetensors.index.json

Unnecessary

changqi1 · 2024-04-22T01:56:27Z

What is the differences between Qwen1.0 vs Qwen 2.0?

If Qwen2.0 have no difference w/ LLama2. Do we need to inherit from LLama2?

marvin-Yu requested review from changqi1, pujiang2018 and Duyi-Wang April 19, 2024 07:38

pujiang2018 reviewed Apr 20, 2024

View reviewed changes

changqi1 reviewed Apr 22, 2024

View reviewed changes

Duyi-Wang reviewed Apr 22, 2024

View reviewed changes

marvin-Yu force-pushed the model/add_qwen2 branch from 1cb752f to e368bf6 Compare April 23, 2024 03:45

[Model] Add Qwen2 model.

89cf4dc

marvin-Yu force-pushed the model/add_qwen2 branch from e368bf6 to 89cf4dc Compare April 23, 2024 05:30

marvin-Yu requested a review from Duyi-Wang April 23, 2024 05:31

changqi1 approved these changes Apr 23, 2024

View reviewed changes

pujiang2018 merged commit d37178d into main Apr 23, 2024
1 check passed

marvin-Yu mentioned this pull request Apr 23, 2024

qwen2支持吗 #300

Closed

Duyi-Wang deleted the model/add_qwen2 branch April 23, 2024 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add Qwen2 model. #330

[Model] Add Qwen2 model. #330

marvin-Yu commented Apr 19, 2024

changqi1 commented Apr 19, 2024

pujiang2018 Apr 20, 2024

Duyi-Wang Apr 22, 2024 •

edited

Loading

marvin-Yu Apr 23, 2024

changqi1 Apr 22, 2024

marvin-Yu Apr 23, 2024

Duyi-Wang left a comment

Duyi-Wang Apr 22, 2024

marvin-Yu Apr 23, 2024

Duyi-Wang Apr 22, 2024

marvin-Yu Apr 23, 2024

Duyi-Wang Apr 22, 2024

marvin-Yu Apr 23, 2024

Duyi-Wang Apr 22, 2024

marvin-Yu Apr 23, 2024

Duyi-Wang Apr 22, 2024

marvin-Yu Apr 23, 2024

changqi1 commented Apr 22, 2024

[Model] Add Qwen2 model. #330

[Model] Add Qwen2 model. #330

Conversation

marvin-Yu commented Apr 19, 2024

changqi1 commented Apr 19, 2024

Choose a reason for hiding this comment

Duyi-Wang Apr 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Duyi-Wang left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

changqi1 commented Apr 22, 2024

Duyi-Wang Apr 22, 2024 •

edited

Loading