LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #5007

lrw04 · 2023-09-05T03:49:58Z

Please provide feedback on how the patch could be improved.

codecov-commenter · 2023-09-05T04:01:29Z

Codecov Report

Attention: Patch coverage is 89.47368% with 4 lines in your changes missing coverage. Please review.

Project coverage is 89.52%. Comparing base (bcfec1d) to head (68b4b2d).
Report is 288 commits behind head on master.

Files with missing lines	Patch %	Lines
src/layer/linearint8.cpp	89.47%	4 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (bcfec1d) and HEAD (68b4b2d). Click for more details.

HEAD has 28 uploads less than BASE

Flag BASE (bcfec1d) HEAD (68b4b2d)

30 2

Additional details and impacted files

@@             Coverage Diff             @@
##           master    #5007       +/-   ##
===========================================
- Coverage   94.72%   89.52%    -5.20%     
===========================================
  Files         772      303      -469     
  Lines      228777    89191   -139586     
===========================================
- Hits       216705    79848   -136857     
+ Misses      12072     9343     -2729

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lrw04 · 2023-09-07T14:19:34Z

oh no, it didn't work

lrw04 · 2023-09-07T14:23:42Z

Closing following @nihui's instructions.

lrw04 closed this Sep 7, 2023

lrw04 reopened this Sep 7, 2023

lrw04 and others added 14 commits September 12, 2023 11:43

Rebase: Rebase: Rebase: Add custom layer for int8-quantized LLM

1436252

Fix trailing newlines

6e8d028

apply code-format changes

3b40f30

Add more error checking

2ecd5af

Add include guard

5184b87

Initial commit for tests

e53be3c

Change test function names

01f30ec

Add general optimizations

52cbc59

apply code-format changes

9426487

Fix int8 weights being converted into fp16 or bf16

d2ce74b

[skip ci] Add a trailing newline?

61d6c84

Remove TODO banner

f4e961a

Update documentation

6be8196

Update operators table

68b4b2d

lrw04 changed the title ~~Add LinearInt8 layer for inference of int8-quantized LLMs~~ LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics Sep 16, 2023

nihui closed this Oct 11, 2023

nihui reopened this Oct 11, 2023

github-actions bot added core test layer arm doc labels Oct 11, 2023

lrw04 closed this by deleting the head repository Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #5007

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #5007

lrw04 commented Sep 5, 2023 •

edited

Loading

codecov-commenter commented Sep 5, 2023 •

edited

Loading

lrw04 commented Sep 7, 2023

lrw04 commented Sep 7, 2023

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #5007

LinearInt8 layer for inference of int8-quantized LLMs and Arm intrinsics #5007

Conversation

lrw04 commented Sep 5, 2023 • edited Loading

codecov-commenter commented Sep 5, 2023 • edited Loading

Codecov Report

lrw04 commented Sep 7, 2023

lrw04 commented Sep 7, 2023

lrw04 commented Sep 5, 2023 •

edited

Loading

codecov-commenter commented Sep 5, 2023 •

edited

Loading