why we should use apply_custom_rules_to_quantizer? #8

dahaiyidi · 2023-11-01T12:46:32Z

In quantize.py I find the following function. And it is used in qat.py. why should we find the quantizer_pairs? why should we set:
major = bottleneck.cv1.conv._input_quantizer
bottleneck.addop._input0_quantizer = major
bottleneck.addop._input1_quantizer = major

`

def apply_custom_rules_to_quantizer(model : torch.nn.Module, export_onnx : Callable):

# apply rules to graph
export_onnx(model, "quantization-custom-rules-temp.onnx")
pairs = find_quantizer_pairs("quantization-custom-rules-temp.onnx")
print(pairs)
for major, sub in pairs:
    print(f"Rules: {sub} match to {major}")
    get_attr_with_path(model, sub)._input_quantizer = get_attr_with_path(model, major)._input_quantizer  # why use the same input_quantizer??
os.remove("quantization-custom-rules-temp.onnx")

for name, bottleneck in model.named_modules():
    if bottleneck.__class__.__name__ == "Bottleneck":
        if bottleneck.add:
            print(f"Rules: {name}.add match to {name}.cv1")
            major = bottleneck.cv1.conv._input_quantizer
            bottleneck.addop._input0_quantizer = major
            bottleneck.addop._input1_quantizer = major

`

Thanks.

The text was updated successfully, but these errors were encountered:

liuanqi-libra7 · 2023-11-16T07:36:23Z

If we use https://github.com/NVIDIA-AI-IOT/cuDLA-samples/tree/main/export#option1, the generated model can also run on the GPU. However, If the Q&DQ nodes of these tensors are inconsistent, there are a lot of useless int8->fp16 and fp16->int8 data convert in our QAT model. This will slow down the model inference speed.

zerollzeng assigned zerollzeng and unassigned zerollzeng Nov 16, 2023

lynettez assigned liuanqi-libra7 and unassigned liuanqi-libra7 Feb 1, 2024

lynettez added the triaged label Feb 1, 2024

lynettez closed this as completed Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why we should use apply_custom_rules_to_quantizer? #8

why we should use apply_custom_rules_to_quantizer? #8

dahaiyidi commented Nov 1, 2023 •

edited

Loading

liuanqi-libra7 commented Nov 16, 2023

why we should use apply_custom_rules_to_quantizer? #8

why we should use apply_custom_rules_to_quantizer? #8

Comments

dahaiyidi commented Nov 1, 2023 • edited Loading

liuanqi-libra7 commented Nov 16, 2023

dahaiyidi commented Nov 1, 2023 •

edited

Loading