Enhance and optimize the model ops test generation and script for updating failures reason for the failed models ops test #1198

chandrasekaranpradeep · 2025-02-07T13:04:26Z

Fixes #1122
Removed the dependency of pt files in Model ops test generation pipeline which resolve space issue in CI and local machines by avoid storing pt files.
Fixes #1199
Created a script for updating the model ops test failures with xfail marker and failures reason by using pytest logs files which is saved as artifacts in CI

Fixes #795
Enabled the option for recording properties such as frontend, op_name, model_name and op_params
TODO:
Need to update other record properties, will work on adding it in separate PR.

The below screenshoot represent the generated model ops test pytest params before script run

The below screenshoot represent the generated model ops test pytest params after script run which adds xfail marker with failure reason by analysing the pytest logs.

Attached the generated model ops test folder before and after the model ops test failure update script.

models_ops_before_xfail_update.zip
models_ops_after_xfail_update.zip

Note:
Added the verification between framework outputs and generated forge module outputs before extract_and_generate_unique_ops_tests function in forge/forge/tvm_to_python.py which is used for extracting the unique ops configuration and generated unique ops test but in the current commit lots of models are failling in the verification so need to check on the latest main.

github-actions · 2025-02-07T13:38:17Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	590 ran	465 passed	125 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T13:42:53Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	590 ran	465 passed	125 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T13:43:54Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	648 ran	508 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T13:46:11Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	648 ran	508 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T15:10:27Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	590 ran	465 passed	125 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T15:15:10Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	590 ran	465 passed	125 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T15:15:27Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	648 ran	508 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-07T15:16:58Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	648 ran	508 passed	140 skipped	0 failed

Test	Result
No test annotations available

nvukobratTT · 2025-02-07T15:08:23Z

forge/forge/python_codegen.py

            self.indent -= 1
        self.wl("")
+        if "pcc" not in exclude_record_property:
+            self.wl("pcc = 0.99")
+        if "integer_tensor_high_value" not in exclude_record_property:


Let's think about having these hard-codded configurations as part of the separated configuration file. E.g. specific config object for this tool.

I assume that we'll have more of this kind, so it might be good to think about how to manage. Also, as this PR is quite big, let's create issues for this and tackle it as separate PR.

Okay Nikola.
Can you give some idea on config object functionalities like whether it is should be used inside the generated models ops test or it is utility used by the write_pytest_method method in ForgeWritter?

nvukobratTT · 2025-02-07T15:11:44Z

forge/forge/tvm_to_python.py

@@ -2784,7 +2762,23 @@ def delete_unneeded_outputs(ops, returns):
            # Generate unique op tests based on requested model. Currently only supported
            # for PyTorch framework.
            if compiler_cfg.extract_tvm_unique_ops_config or compiler_cfg.tvm_generate_unique_ops_tests:
+
+                # Commenting the below verification between framework outputs and generated forge module outputs


Hmm, the majority of them are failing during framework vs codegen?

@dgolubovicTT something to think about during initial support for verifications after different compile stages. Maybe we should review TVM side of verification as well during this push, before pushing next stage of intermediate verification.

FYI, I have tried generating models ops test in latest main for the below 20 models, out of the 20 models, 2 models test cases are failing with framework vs codegen verification

Failed models test:

forge/test/models/pytorch/text/codegen/test_codegen.py::test_codegen[Salesforce/codegen-350M-mono] forge/test/models/pytorch/text/gptneo/test_gptneo.py::test_gptneo_causal_lm[EleutherAI/gpt-neo-125M]

Note: I haven't triggered the models ops test generation pipeline for all the models in the forge, I have test for sample of 20 text based models for checking the behaviour of pipeline.

forge/forge/tvm_unique_op_generation.py

nvukobratTT · 2025-02-07T15:14:21Z

forge/forge/tvm_unique_op_generation.py

@@ -1011,6 +1033,27 @@ def generate_models_ops_test(unique_operations: UniqueOperations, models_ops_tes
                        pytest_input_shapes_dtypes.append((operand_shape, operand_dtype))
                pytest_input_shapes_and_dtypes_list.append(pytest_input_shapes_dtypes)

+                # To avoid recording pcc in record_property pytest fixture and add the pcc to the exclude metadata property list
+                exclude_record_property = ["pcc"]


Also a candidate for tooling configuration object.

Will include it.

nvukobratTT · 2025-02-07T15:15:35Z

forge/forge/tvm_unique_op_generation.py

+                if op_name == "embedding":
+                    # Calculate embedding op indicies tensor maximum value based upon the num_embeddings of the weight tensor.
+                    pytest_metadata["integer_tensor_high_value"] = int(operand_shapes[1][0]) - 1
+                    exclude_record_property.append("integer_tensor_high_value")


This can probably be generally excluded as part of the configuration object.

forge/forge/python_codegen.py

github-actions · 2025-02-10T09:07:40Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T09:12:01Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T09:13:53Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T09:14:15Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T11:40:06Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T11:45:07Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T11:47:14Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-10T11:48:10Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

chandrasekaranpradeep · 2025-02-10T13:42:34Z

@nvukobratTT FYI,

Added option for specify maximum and minimum value for random integer tensor generation in create_torch_tensor class method in Tensor class, because in the embedding op test, there is cases where the indicies tensor shape is (1, 128), and weight tensor shape is (1, 768) and for generating random indices tensor max_int is calculated like num_embeddings(i.e dim=0 of weight tensor) - 1, in this cases max_int and min_int are same value(i.e 0) and in the torch.randint low and high value shouldn't be same. So, modified the create_torch_tensor class method accordingly
Fixed issue in extracting meaningful error message for the fatal error in ErrorMessageUpdater class.

…ating failures reason for the failed models ops test

github-actions · 2025-02-11T11:30:06Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-11T11:30:20Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-11T11:30:50Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	592 ran	466 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-11T11:31:55Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	650 ran	509 passed	141 skipped	0 failed

Test	Result
No test annotations available

chandrasekaranpradeep force-pushed the pchandrasekaran/enhance_models_ops_test_gen branch from 7474899 to 3ccbbd1 Compare February 7, 2025 14:34

chandrasekaranpradeep marked this pull request as ready for review February 7, 2025 14:37

chandrasekaranpradeep requested review from nvukobratTT, pilkicTT and dgolubovicTT as code owners February 7, 2025 14:37

chandrasekaranpradeep requested a review from ashokkumarkannan1 February 7, 2025 14:38

chandrasekaranpradeep self-assigned this Feb 7, 2025

nvukobratTT reviewed Feb 7, 2025

View reviewed changes

nvukobratTT approved these changes Feb 7, 2025

View reviewed changes

ashokkumarkannan1 reviewed Feb 8, 2025

View reviewed changes

forge/forge/python_codegen.py Show resolved Hide resolved

chandrasekaranpradeep force-pushed the pchandrasekaran/enhance_models_ops_test_gen branch from 3ccbbd1 to 9c85e4f Compare February 10, 2025 08:20

chandrasekaranpradeep requested a review from ashokkumarkannan1 February 10, 2025 08:25

ashokkumarkannan1 approved these changes Feb 10, 2025

View reviewed changes

chandrasekaranpradeep force-pushed the pchandrasekaran/enhance_models_ops_test_gen branch from 9c85e4f to 2c4932e Compare February 10, 2025 11:06

chandrasekaranpradeep force-pushed the pchandrasekaran/enhance_models_ops_test_gen branch from 2c4932e to e1f0e2c Compare February 11, 2025 10:46

Enhance and optimize the model ops test generation and script for upd…

aa85b55

…ating failures reason for the failed models ops test

chandrasekaranpradeep force-pushed the pchandrasekaran/enhance_models_ops_test_gen branch from e1f0e2c to aa85b55 Compare February 11, 2025 10:52

chandrasekaranpradeep merged commit 2102aff into main Feb 11, 2025
10 checks passed

chandrasekaranpradeep deleted the pchandrasekaran/enhance_models_ops_test_gen branch February 11, 2025 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance and optimize the model ops test generation and script for updating failures reason for the failed models ops test #1198

Enhance and optimize the model ops test generation and script for updating failures reason for the failed models ops test #1198

chandrasekaranpradeep commented Feb 7, 2025 •

edited

Loading

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

nvukobratTT Feb 7, 2025

chandrasekaranpradeep Feb 10, 2025

nvukobratTT Feb 7, 2025

chandrasekaranpradeep Feb 10, 2025

nvukobratTT Feb 7, 2025

chandrasekaranpradeep Feb 10, 2025

nvukobratTT Feb 7, 2025

chandrasekaranpradeep Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

chandrasekaranpradeep commented Feb 10, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

Enhance and optimize the model ops test generation and script for updating failures reason for the failed models ops test #1198

Enhance and optimize the model ops test generation and script for updating failures reason for the failed models ops test #1198

Conversation

chandrasekaranpradeep commented Feb 7, 2025 • edited Loading

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

github-actions bot commented Feb 7, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

github-actions bot commented Feb 10, 2025

chandrasekaranpradeep commented Feb 10, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

github-actions bot commented Feb 11, 2025

chandrasekaranpradeep commented Feb 7, 2025 •

edited

Loading