Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[Optimization] Text-generation support qwen #513

Merged
merged 43 commits into from
Oct 23, 2023
Merged

[Optimization] Text-generation support qwen #513

merged 43 commits into from
Oct 23, 2023

Conversation

changwangss
Copy link
Contributor

@changwangss changwangss commented Oct 20, 2023

Type of Change

Qwen/Qwen-7B, Qwen/Qwen-14B, Qwen/Qwen-7B-Chat, Qwen/Qwen-14B-Chat pass,

optimum: huggingface/optimum#1470

optimum-intel: huggingface/optimum-intel#458

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

zhewang1-intc and others added 15 commits October 19, 2023 13:22
Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>
Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>
* Support StreamingLLM on CPU

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>
Signed-off-by: Wang, Chang <chang1.wang@intel.com>
Co-authored-by: Wenxin Zhang <wenxin.zhang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
* Fix ChatGLM2 model loading issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: VincyZhang <wenxin.zhang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
* support Memcpy2D

* support gelu fusion

---------

Co-authored-by: luoyu-intel <yu.luo@intel.com>
xin3he and others added 10 commits October 20, 2023 16:18
Signed-off-by: Xin He <xin3.he@intel.com>
* Update README.md to new added docker setup session

Signed-off-by: Louie Tsai <louie.tsai@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Wang, Chang <chang1.wang@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
louie-tsai and others added 3 commits October 21, 2023 18:07
Signed-off-by: Louie Tsai <louie.tsai@intel.com>
convertion -> conversion

Signed-off-by: Ikko Eltociear Ashimine <eltociear@gmail.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
hshen14 and others added 12 commits October 21, 2023 18:50
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: Haihao Shen <haihao.shen@intel.com>
Signed-off-by: ayushrakesh <115995339+ayushrakesh@users.noreply.github.com>
Signed-off-by: Surav Shrestha <suravshresth@gmail.com>
Signed-off-by: Aditya Aryaman Das <128703909+alienishi@users.noreply.github.com>
@VincyZhang
Copy link
Contributor

Unit Test failed with lines coverage decrease -0.064%
Unit Test failed with branches coverage decrease -0.158%

@changwangss
Copy link
Contributor Author

changwangss commented Oct 23, 2023

Unit Test failed with lines coverage decrease -0.064% Unit Test failed with branches coverage decrease -0.158%

yes,it is as expected. qwen doesn't have tiny model to add ut. After qwen is officially included by transformers, the newly added code in generate dummy past-kv func can be deleted, the coverage will improve.
PR is ready, please merge. @VincyZhang

@hshen14 hshen14 changed the title Text-generation support qwen [WIP] Text-generation support qwen Oct 23, 2023
@VincyZhang VincyZhang changed the title [WIP] Text-generation support qwen [Optimize] Text-generation support qwen Oct 23, 2023
@VincyZhang VincyZhang changed the title [Optimize] Text-generation support qwen [Optimization] Text-generation support qwen Oct 23, 2023
@VincyZhang VincyZhang merged commit 8f41d49 into main Oct 23, 2023
15 of 16 checks passed
@VincyZhang VincyZhang deleted the wangchang/qwen branch October 23, 2023 14:50
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.