- Adding New Operator or Function to ONNX
Operators are the basic building blocks used to define ONNX models. With a rich set of operators, ONNX can describe most DNN and ML models from various frameworks. Functions enable expressing complex operators in terms of more primitive operators. The ONNX specification includes a core set of operators that enable many models. It is a non-goal to add all possible operators, however more operators are added as needed to cover evolving needs.
In this document, we describe the process of accepting a new proposed operator and how to properly submit a new operator as part of ONNX standard. The goal is to improve on what we currently have based on our experience, learning and feedbacks we gathered from the community.
- Decide what to propose
- Submit PR for new operator/function
- Review of PR by Operators SIG
- Merging of PR and inclusion in next ONNX release
In order to propose a new operator/function, the following is needed:
- If the operator can be expressed in terms of other ONNX operators, then it should be a function and not an operator (we have a function in ONNX : MeanVarianceNormalization).
- If the operators can be split to new primitives, propose those primitives instead and make the operator a function.
- Based on a model. This will help us understand the usage and that it solves an actual problem. For the case of the model being private or IP and can't be shared, the operator doesn't belong to the standard and should be implemented as custom OP.
- The operator needs to be implemented by at-least one (well-known) framework. This help us to understand the actual behavior of the operator and its usage.
- Operator signature and behavior:
- If the operator is available in numpy, prefer numpy semantics.
- If the operator is available in more than one frameworks, make sure that your design is general and cover those frameworks.
- Prefer attributes over inputs.
- The operator should not be made more complex than is required by the use-cases. However, the operator should be made as general as possible, as long as it does not make the implementation more complex. This requires carefully balancing generality and complexity. For example, generalizing from 3-D tensors to N-D tensors is straight-forward (implementation-wise) for some operators, but complex for other operators. The choice in such cases will be made based on the complexity of such a generalization.
Once the criteria of proposing new operator/function has been satisfied, you will need to submit a PR for the new operator/function. Here the expectation of what the PR should include. The reviewer is expected to verify the completeness of the PR before signoff.
- Description:
- Write a detailed description about the operator, and its expected behavior. Pretty much, the description should be clear enough to avoid confusion between implementors.
- Add an example in the description to illustrate the usage.
- Add reference to the source of the operator in the corresponding framework in the description (if possible).
- Write the mathematic formula or a pseudocode in the description. The core algorithm needs to be very clear.
- Write a reference implementation in Python, this reference implementation should cover all the expected behavior of the operator. Only in extremely rare case, we will waive this requirement.
- Operator version: check out our versioning doc
- Write unit test, that cover main usage and corner cases.
- The testing examples will be extracted to the doc.
- We also generate binary data for it.
- Example: onnx/backend/test/case/node/abs.py
- Add at least one automatic upgrade test for your operator in onnx/test/automatic_upgrade_test.py using
_test_op_upgrade
. These tests create a given operator at a given opset version (usually the version the operator was introduced in) and test that the version converter is able to convert them to the highest available version. So for a new operator_test_op_upgrade
will not test anything, but as soon as the operator gets updated in a future opset the test will automatically become nontrivial. - Update the documentation and generate the test data.
- Running the script. If you have files under
onnx/backend/test/data/node
which cannot be generated by the scripts fromonnx/backend/test/case/node
, please further usepython onnx/backend/test/cmd_tools.py generate-data --clean
to cleanup the directory and only preserve needed test data. to update the doc and generate the test data.
- Running the script. If you have files under
- Shape Inference function
- Please provide a shape inference function in cases where it is meaningful and applicable.
- In cases where shape inference is not possible, it must have logic to perform rank inference at the very least (adding right amount of dimensions to the output shape)
- Shape inference functions must be accompanied by unit tests (onnx/test/shape_inference_test.py).
- You can refer to the shape inference function for the
TopK
operator while implementing your own function (onnx/defs/math/defs.cc)
PR 1959 is a good example to follow.
The Operators SIG is responsible for the operators/functions in the ONNX specification. The SIG regularly meets and reviews PRs.
At least two sign-off from the Operators SIG contributors.
Once the PR is reviewed and signed off by the Operators SIG, it will be merged. Your new operator/function will be part of the main branch and available to anyone building from source. These are not official releases. ONNX periodically releases official new versions that are a snapshot of the main branch. Your new operator/function will be part of that release.
There are a lot of reasons for removing existing ONNX operator or function, such us being replaced with different operator or can be decomposed by a set of other operators. This document describes the criteria of removing an existing ONNX operator from the standard.
Any operator in ONNX was added because it was required by a model and/or framework. In order to deprecate such an operator we need to do the following.
- Operator can’t be deprecated unless there is a replacement.
- Replacement can be a more general operator that supersedes the old one.
- Or a set of primitive operators that together can implement the same functionality and behavior of the deprecated operator (Function).
- If the deprecated operator can be decomposed by existing operators then it must be converted to a function.
- If replacement isn’t in ONNX standard yet, then add the replacement operator or set of operators first.
- Add a version adapter which turns the operator into its replacement for the version converter. Example: onnx/version_converter/adapters/upsample_9_10.h
- No grace period is needed for deprecated operators.
Function, by definition, is composed of ONNX primitives; however, function could have been accelerated by framework or runtime that support ONNX. So, removing function is not recommended, with the exception of adding another single function which supersede its functionality.
To make sure everyone is aware of the deprecation, the following need to happen:
- Any removed operator or function from ONNX need to be mentioned in the release note.
- Their old documentation needs to be updated to show the new replacement and the mapping between the old to the new.
- Only
def.cc
need to be remove,old.cc
will remain. old.cc
need to be updated with the mapping to the replacement.
- Only
- ONNX checker need to be updated to error with a proper message.
- All removed operators need to be appended at the end of the
operator.md
file.