This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[Optimization] Text-generation support qwen #513

Merged

VincyZhang merged 43 commits into main from wangchang/qwen

Oct 23, 2023

Contributor

changwangss commented Oct 20, 2023 •

edited

Loading

Type of Change

Qwen/Qwen-7B, Qwen/Qwen-14B， Qwen/Qwen-7B-Chat， Qwen/Qwen-14B-Chat pass，

optimum: huggingface/optimum#1470

optimum-intel: huggingface/optimum-intel#458

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

zhewang1-intc and others added 15 commits

October 19, 2023 13:22


          [CPP Graph] Opt qbits dequant (#465)

f04d0fd


          use INC 2.3.1

4adacf1

Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>


          use INC 2.3.1 (#500)

d962f58

Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>


          [RUNTIME] Enabing streaming llm for Runtime (#501)

66238a5

* Support StreamingLLM on CPU

Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>


          Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

ea112e7

…ransformers


          Reduce the UT evaluation time (#498)

51485c6

Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: Wenxin Zhang <wenxin.zhang@intel.com>
Signed-off-by: Wang, Chang <chang1.wang@intel.com>
Co-authored-by: Wenxin Zhang <wenxin.zhang@intel.com>


          Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

ff4abb8

…ransformers


          Minor fix (#507)

9bdc764


          support qwen

6bd2b60

Signed-off-by: changwangss <chang1.wang@intel.com>


          Fix ChatGLM2 model loading issue (#510)

ea720c2

* Fix ChatGLM2 model loading issue

Signed-off-by: lvliang-intel <liang1.lv@intel.com>


          Update README.md

02523e9

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Remove OneDNN env setint for BF16 inference (#509)

0cff05a

Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Co-authored-by: VincyZhang <wenxin.zhang@intel.com>


          remove invalid code

1bee379

Signed-off-by: changwangss <chang1.wang@intel.com>


          support Avx2 (#493)

ea69f9a

* support Memcpy2D

* support gelu fusion

---------

Co-authored-by: luoyu-intel <yu.luo@intel.com>


          add neuralchat ut for audio util (#466)

f7d0d97

changwangss requested a review from PenghuiCheng as a code owner

October 20, 2023 05:59

xin3he and others added 10 commits

October 20, 2023 16:18


          reduce ut time consumption (#499)

b9155ef

Signed-off-by: Xin He <xin3.he@intel.com>


          update python api readme (#504)

5f4175a


          Add docker setup session for neuralchat finetuning sample (#496)

a8873ea

* Update README.md to new added docker setup session

Signed-off-by: Louie Tsai <louie.tsai@intel.com>


          Update README.md

22fe7ad

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update run_generation.py

53b1b61

Signed-off-by: Wang, Chang <chang1.wang@intel.com>


          Update README.md

b38241d

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md

1d91245

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md

18d9c57

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md

f98d72a

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md

0f6aee6

Signed-off-by: Haihao Shen <haihao.shen@intel.com>

hshen14 approved these changes

View reviewed changes

louie-tsai and others added 3 commits

October 21, 2023 18:07


          Update README.md for fast token issue (#515)

a8db98f

Signed-off-by: Louie Tsai <louie.tsai@intel.com>


          Fix typo in README.md (#516)

52717e4

convertion -> conversion

Signed-off-by: Ikko Eltociear Ashimine <eltociear@gmail.com>


          Update README.md

3cf68ee

Signed-off-by: Haihao Shen <haihao.shen@intel.com>

hshen14 and others added 12 commits

October 21, 2023 18:50


          Update README.md

7fed478

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md

dc81e4c

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          improve Avx2 (#511)

dcfbcfd


          Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

a615905

…ransformers


          Revert "update python api readme (#504)"

61993cc

This reverts commit 5f4175a.


          Merge branch 'main' into wangchang/qwen


          Update README.md

5b01e95

Signed-off-by: Haihao Shen <haihao.shen@intel.com>


          Update README.md (#519)

bfb6a25

Signed-off-by: ayushrakesh <115995339+ayushrakesh@users.noreply.github.com>


          docs: fix typos in question answering of pytorch (#520)

0e0a9eb

Signed-off-by: Surav Shrestha <suravshresth@gmail.com>


          fixed typos (#522)

ec29f2f


          Updated README.md (#517)

1357a02

Signed-off-by: Aditya Aryaman Das <128703909+alienishi@users.noreply.github.com>


          Merge branch 'main' into wangchang/qwen

b3e4b25

VincyZhang force-pushed the main branch from 1357a02 to f04d0fd Compare

October 23, 2023 03:40

VincyZhang requested review from VincyZhang, lvliang-intel, zhenwei-intel and airMeng as code owners

October 23, 2023 03:40

VincyZhang force-pushed the main branch from f04d0fd to 1ab6ce3 Compare

October 23, 2023 03:47

VincyZhang requested a review from a32543254 as a code owner

October 23, 2023 03:47


          Merge branch 'main' into wangchang/qwen

572ecbf

Contributor

VincyZhang commented Oct 23, 2023

Unit Test failed with lines coverage decrease -0.064%
Unit Test failed with branches coverage decrease -0.158%


          Merge branch 'main' into wangchang/qwen

2e77b6b

VincyZhang approved these changes

View reviewed changes

Contributor Author

changwangss commented Oct 23, 2023 •

edited

Loading

Unit Test failed with lines coverage decrease -0.064% Unit Test failed with branches coverage decrease -0.158%

yes，it is as expected. qwen doesn't have tiny model to add ut. After qwen is officially included by transformers, the newly added code in generate dummy past-kv func can be deleted, the coverage will improve.
PR is ready, please merge. @VincyZhang

hshen14 changed the title ~~Text-generation support qwen~~ [WIP] Text-generation support qwen

VincyZhang changed the title ~~[WIP] Text-generation support qwen~~ [Optimize] Text-generation support qwen

VincyZhang changed the title ~~[Optimize] Text-generation support qwen~~ [Optimization] Text-generation support qwen

VincyZhang merged commit 8f41d49 into main

15 of 16 checks passed

VincyZhang deleted the wangchang/qwen branch

October 23, 2023 14:50

VincyZhang pushed a commit that referenced this pull request


          [Optimization] Text-generation support qwen (#513)

f78d114

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Reviewers

VincyZhang VincyZhang approved these changes

hshen14 hshen14 approved these changes

PenghuiCheng Awaiting requested review from PenghuiCheng

lvliang-intel Awaiting requested review from lvliang-intel

zhenwei-intel Awaiting requested review from zhenwei-intel

airMeng Awaiting requested review from airMeng

a32543254 Awaiting requested review from a32543254

Labels

None yet