pack_int32_to_int4 #29

YangNuoCheng · 2022-01-16T11:50:31Z

In 'HAWQ-main/tvm_benchmark/hawq_utils_resnet50.py' ,we pack 8 'int4' number to 1 'int32' number, so we got int4 speedup.
Can we pack 16 'int2' to 1 'int32', to got int2 speedup?

zachzzc · 2022-01-18T18:01:49Z

Yes. The purpose of the packing is to handle with memory movement with a datatype that is supported in the target hardware (int8, int32 in cpu/gpu). If you want to further reduce the precision to int2, in cpu/gpu you also need to pack them into a byte-addressable data type (int8, int32) before the memory movement

YangNuoCheng · 2022-01-19T01:37:58Z

Yes. The purpose of the packing is to handle with memory movement with a datatype that is supported in the target hardware (int8, int32 in cpu/gpu). If you want to further reduce the precision to int2, in cpu/gpu you also need to pack them into a byte-addressable data type (int8, int32) before the memory movement

Thank you for your reply!
Actually I am reproducing your great project, and I Try to reply it in my research.
Thanks a lot!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pack_int32_to_int4 #29

pack_int32_to_int4 #29

YangNuoCheng commented Jan 16, 2022

zachzzc commented Jan 18, 2022

YangNuoCheng commented Jan 19, 2022 •

edited

Loading

pack_int32_to_int4 #29

pack_int32_to_int4 #29

Comments

YangNuoCheng commented Jan 16, 2022

zachzzc commented Jan 18, 2022

YangNuoCheng commented Jan 19, 2022 • edited Loading

YangNuoCheng commented Jan 19, 2022 •

edited

Loading