[BUGFix] Fix MatmulDequantize with FP4 Format #254

LeiWang1999 · 2024-11-29T05:25:23Z

PR Fix ref to #253 , Example:

matmul_config = bitblas.MatmulConfig(
    M=M,  # M dimension
    N=N,  # N dimension
    K=K,  # K dimension
    A_dtype=A_dtype,  # activation A dtype
    W_dtype=W_dtype,  # weight W dtype
    accum_dtype=accum_dtype,  # accumulation dtype
    out_dtype=out_dtype,  # output dtype
    layout="nt",  # matrix layout, "nt" indicates the layout of A is non-transpose and the layout of W is transpose
    propagate_b=False,  # propagate B matrix
    storage_dtype=storage_dtype,
)

matmul = bitblas.Matmul(config=matmul_config, enable_tuning=False)

…_test_fix

LeiWang1999 and others added 27 commits November 10, 2024 16:10

relax transform update

2a0f59c

End2end Fix

b475407

Merge branch 'main' of https://github.com/microsoft/BitBLAS into relax

b0738ba

lint fix

f23a2ec

Merge branch 'main' of https://github.com/microsoft/BitBLAS into relax

79826b6

bf16 test fix

1961bc4

format fix

3aa5d82

lint fix

353e279

test fix

7eb315f

test fix

c1b452f

update commits

fe93429

test fix

ccac456

Merge branch 'main' of https://github.com/microsoft/BitBLAS into bf16…

ddaeba2

…_test_fix

submodule update

4b6fddb

Implement FP4

a8ccb17

lint fix

e2632e6

lint fix

47abe0a

testfix

1b5a336

test fix

02c09eb

lint fix

ec0e00c

lint fix

667b36c

bugfix

2193164

support dp4a and fix test

478a0c7

format fix

c323c79

implement simt

a9559a2

submodule update

32e8141

lint fix

017b0a7

LeiWang1999 merged commit 1569c95 into microsoft:main Nov 29, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUGFix] Fix MatmulDequantize with FP4 Format #254

[BUGFix] Fix MatmulDequantize with FP4 Format #254

LeiWang1999 commented Nov 29, 2024

[BUGFix] Fix MatmulDequantize with FP4 Format #254

[BUGFix] Fix MatmulDequantize with FP4 Format #254

Conversation

LeiWang1999 commented Nov 29, 2024