-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix TVM compilation with USE_LLVM=OFF #194
Commits on Jan 13, 2021
-
[µTVM] Fix two warnings when deprecated forms are used (apache#7269)
* [µTVM] Specify loader for yaml.load Specify the loader to be used by yaml.load as the current form used without specifying explicitly a loader is deprecated since PyYAML 5.1 and will throw a noisy warning. For details, please see: https://github.com/yaml/pyyaml/wiki/PyYAML-yaml.load(input)-Deprecation Signed-off-by: Gustavo Romero <gustavo.romero@linaro.org> * [µTVM] Avoid using tvm.target.create Avoid using tvm.target.create as it's deprecated and use tvm.target.Target directly instead. Signed-off-by: Gustavo Romero <gustavo.romero@linaro.org>
Configuration menu - View commit details
-
Copy full SHA for 35dabd6 - Browse repository at this point
Copy the full SHA 35dabd6View commit details -
Adding aten::unsqueeze_ to PT Frontend (apache#7231)
* Added Ops * Regular * Remove copy * Remove copy * Tests * Black Co-authored-by: Ubuntu <ubuntu@ip-172-31-27-149.us-east-2.compute.internal> Co-authored-by: Ubuntu <ubuntu@ip-172-31-19-34.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 259652b - Browse repository at this point
Copy the full SHA 259652bView commit details
Commits on Jan 14, 2021
-
Configuration menu - View commit details
-
Copy full SHA for af716e5 - Browse repository at this point
Copy the full SHA af716e5View commit details -
[FIX] Remove leftovers from check_correctness (apache#7272)
* [FIX] Remove leftovers from check_correctness * remove unused numpy import
Configuration menu - View commit details
-
Copy full SHA for c11959d - Browse repository at this point
Copy the full SHA c11959dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8d3c0e7 - Browse repository at this point
Copy the full SHA 8d3c0e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f4aa24 - Browse repository at this point
Copy the full SHA 7f4aa24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 51a2d66 - Browse repository at this point
Copy the full SHA 51a2d66View commit details -
[PatternLang][Bugfix] Ensure CallNode attrs are not undefined before …
…checking (apache#7278) * Correct handling of call node attrs to handle non-operator calls (attrs may be undefined) * Linting fix
Configuration menu - View commit details
-
Copy full SHA for 1677bb2 - Browse repository at this point
Copy the full SHA 1677bb2View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb6c26b - Browse repository at this point
Copy the full SHA bb6c26bView commit details -
Add MicroTVM support for the STM32F746 Discovery board (apache#7225)
* Add MicroTVM support for the STM32F746 Discovery board Signed-off-by: Tom Gall <tom.gall@linaro.org> * Add reference to the discovery board in the docs Signed-off-by: Tom Gall <tom.gall@linaro.org>
Configuration menu - View commit details
-
Copy full SHA for ac29624 - Browse repository at this point
Copy the full SHA ac29624View commit details -
Configuration menu - View commit details
-
Copy full SHA for f503d82 - Browse repository at this point
Copy the full SHA f503d82View commit details -
[PatternLang] Add If pattern (apache#7282)
* Add if pattern commit 1ee052fd494a5bdd881c242c3ea0c95cf2a613e5 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 22:19:17 2020 +0900 add comment commit c846a6999e9c9e48fbc019780e705a990f46cb22 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 21:14:20 2020 +0900 max_out_size rewrite added to the test commit 2c7c7fbd0e6563aba694e7fb6baa7bda8e4fadca Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 20:57:55 2020 +0900 max_out_size rewrite working commit 319e930acb8162c1ec4a5d4fb71d134580a68f13 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 20:43:16 2020 +0900 refactor dyn strided slice pattern commit fb6917b703440748800bde624bc20efaf5798b8a Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 11:21:33 2020 +0900 update NMS pattern following frontend change commit 255a98f1da8f300d4fe417cce3587c0d71e38ed3 Author: Masahiro Masuda <masahi129@gmail.com> Date: Thu Dec 24 05:19:31 2020 +0900 add some comment to explain the pattern commit 52cea1cc2bff533ca60acfc2416477fc8b058428 Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 08:35:14 2020 +0900 revert tutorial change commit d3e0e0d7e2427c40067d6ad2680ec5b3f0076223 Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 08:02:29 2020 +0900 test fixed by setting force_surpress=False commit 2fa1a574f932001be2d8f601338a342dab92f79c Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 07:22:32 2020 +0900 fixed coord_start commit 6ba88f27dec1bdb0b0ba746c268591a59264088e Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 06:50:46 2020 +0900 add doc commit 8d386b6a1c92ce4fe3349ff20e320199a1b5b310 Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 05:27:26 2020 +0900 updated tutorial commit 3206b49ecfdd874e0ff8feb0fa586c4c4282f705 Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 05:04:44 2020 +0900 update object detection test to add rewrite commit 74bebb2f4376aeb67d8c4aad395f9f2661fe6b3e Author: Masahiro Masuda <masahi129@gmail.com> Date: Wed Dec 23 05:02:15 2020 +0900 add a pattern to rewrite nms to batched nms commit f410e6dde0ed949b90312c5a7ddbb6c234f9acc1 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 22:20:16 2020 +0900 add comment commit f1e078b0724bd22e7be0a812055e1c7c650d94da Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 26 19:54:22 2020 +0900 Add if pattern * add doc * add test * doc formatting * cpplint fix
Configuration menu - View commit details
-
Copy full SHA for d7a9a7c - Browse repository at this point
Copy the full SHA d7a9a7cView commit details
Commits on Jan 15, 2021
-
[Frontend][Tensorflow] Sparse_Dense Op CSR scheduling issue resolved …
…for Cuda & X86 (apache#7148) * [Frontend][Tensorflow] Sparse_Dense Op CSR scheduling issue resolved for both cuda & x86 * [1] Review comments handled * [2] Review comments handled * [3] Review comments handled
ANSHUMAN TRIPATHY authoredJan 15, 2021 Configuration menu - View commit details
-
Copy full SHA for c947463 - Browse repository at this point
Copy the full SHA c947463View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4c5c086 - Browse repository at this point
Copy the full SHA 4c5c086View commit details -
[µTVM] Add ST STM32F746 disco board to tflite tutorial script (apache…
…#7254) Currently tutorial script 'micro_tflite.py' assumes that all boards with target STM32F746 are Nucleo boards. As a consequence once that target is selected the script automatically defaults to the Nucleo board. However, the STM32F746 is also used on Discovery Kit boards (aka disco) which are quite similar but have some differences, so Nucleo config and final image don't work on the disco boards. That commit adds a way to select a different dev board and adds comments accordingly, informing how to use the script with STM32F746 disco boards. Signed-off-by: Gustavo Romero <gustavo.romero@linaro.org>
Configuration menu - View commit details
-
Copy full SHA for c3f50ff - Browse repository at this point
Copy the full SHA c3f50ffView commit details -
Bring back numbered lists to TVM docs. (apache#7290)
* Upstream fix in tlc-pack/tlcpack-sphinx-addon@995178d
Configuration menu - View commit details
-
Copy full SHA for 4f1f591 - Browse repository at this point
Copy the full SHA 4f1f591View commit details -
[VM] Per-input, data dependence specification for shape func (apache#…
…7210) * made TShapeDataDependant array * add stub * dyn strided slice working * reshape also working * remove log * works on maskrcnn * lint fix * fix cpp test * remove stale pop back * add more doc * dependant -> dependent * remove redundant check * remove data_dependent_
Configuration menu - View commit details
-
Copy full SHA for 2992e9b - Browse repository at this point
Copy the full SHA 2992e9bView commit details -
[uTVM] Initial BYOC support with c-source module (apache#6950)
This commit mainly introduces a byoc c-source module example to uTVM. Moreover, it carries certain modifications to the example codegen_c external module generator code to generate utvm friendly c-source. Change-Id: I09f3a42017d518dd5b6c89e3fe0a0332b80088b0
Configuration menu - View commit details
-
Copy full SHA for b52267e - Browse repository at this point
Copy the full SHA b52267eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 637c9da - Browse repository at this point
Copy the full SHA 637c9daView commit details -
Change const to used dtype if it is passed in (apache#7285)
* Add fix and unit test for const autoconvert dtype. * formatting * Address review comment, casting input value to int32 * Fix failing test * Augment unit test
Configuration menu - View commit details
-
Copy full SHA for ccccac2 - Browse repository at this point
Copy the full SHA ccccac2View commit details
Commits on Jan 16, 2021
-
[TEST] Fix test_topi_batch_matmul_tensorcore.py:test_batch_matmul req…
…uirement (apache#7294) * this test current sets a requirement to "uses_gpu", which causes it to fail in cpu-only machine * this patch changes it to be "requires_tensorcore", as per discussion on issue apache#7277
Configuration menu - View commit details
-
Copy full SHA for 3f15d06 - Browse repository at this point
Copy the full SHA 3f15d06View commit details -
Configuration menu - View commit details
-
Copy full SHA for 052ad3d - Browse repository at this point
Copy the full SHA 052ad3dView commit details
Commits on Jan 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 09bb60a - Browse repository at this point
Copy the full SHA 09bb60aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6fb1019 - Browse repository at this point
Copy the full SHA 6fb1019View commit details
Commits on Jan 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5e92eed - Browse repository at this point
Copy the full SHA 5e92eedView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d95105 - Browse repository at this point
Copy the full SHA 5d95105View commit details -
[TIR][REFACTOR] ForNode introduce thread binding and remove legacy fi…
…eld (apache#7306) [TIR][REFACTOR] ForNode update - Remove deprecated device_api. - Add ThreadBinding for_type. - Add additional annotations. More style consistency refactor to make the ForNode to be consistent with rest of the codebase. - ForType => ForKind - Add constant prefix k to enum consts per Google C style - Introduce ForKind to the python side.
Configuration menu - View commit details
-
Copy full SHA for 7340c02 - Browse repository at this point
Copy the full SHA 7340c02View commit details -
[Relay][Frontend][Onnx] Compare against onnxruntime more consistently…
… during testing (apache#7300) Co-authored-by: Josh Fromm <jwfromm@uw.edu>
Configuration menu - View commit details
-
Copy full SHA for f91b51d - Browse repository at this point
Copy the full SHA f91b51dView commit details -
[TOPI] Minor perf improvement for GPU scatter (apache#7233)
* improve scatter 4d init * do not launch sorting based scatter for small input * do not use hard coded num threads * separate sort based implementation * register scatter as autotvm task * add missing import * fix strategy * add dedicated schedule and dummy flop * add test tuning script * try adding dummy knob * skip random_fill when a tuning workload is from scatter This reverts commit 1fed883. * cleanup memcpy ir * remove scatter tuning script * make sure zero init arguments * add comment on why skip random init for scatter * restore ctx sync Co-authored-by: masa <masa@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 2290cc0 - Browse repository at this point
Copy the full SHA 2290cc0View commit details
Commits on Jan 20, 2021
-
[TFLite] Added ability to infer shapes for arguments (apache#7293)
Added an ability to infer argument shapes if shapes are not present in TFLite files. The set of networks on which the patch was tested is internal to Arm. Any help with creating unit tests would be appreciated.
Configuration menu - View commit details
-
Copy full SHA for f8c55db - Browse repository at this point
Copy the full SHA f8c55dbView commit details -
[TOPI] Make cumsum IR reusable, add thrust scan (apache#7303)
* import changes from scan branch commit cf0d4fd Author: Masahiro Masuda <masahi129@gmail.com> Date: Fri Dec 25 10:12:01 2020 +0900 get valid count test working commit eb142d3 Author: Masahiro Masuda <masahi129@gmail.com> Date: Fri Dec 25 07:22:00 2020 +0900 integrate new cumsum change commit f89684d Author: Masahiro Masuda <masahi129@gmail.com> Date: Fri Dec 25 06:56:46 2020 +0900 remove ceil_div from nms commit a2ad4de Author: Masahiro Masuda <masahi129@gmail.com> Date: Sun Dec 20 20:36:34 2020 +0900 add api for returning reduction from ex scan output commit b7f4ef7 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sun Dec 20 19:49:07 2020 +0900 move ceil_div to utils commit a9a57e3 Author: Masahiro Masuda <masahi129@gmail.com> Date: Sun Dec 20 19:38:15 2020 +0900 rename prefix_scan.py to scan.py commit 03ed43f Author: Masahiro Masuda <masahi129@gmail.com> Date: Sat Dec 19 06:12:55 2020 +0900 surpress cpplint commit abceac9 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 20:36:24 2020 +0900 support more data type commit 3e7d1f8 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 20:09:51 2020 +0900 1d thrust scan working commit ac13b40 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 19:49:25 2020 +0900 adding thrust scan support commit 65634e8 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 19:01:11 2020 +0900 add thrust scan python stub commit 9876c90 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 20:55:14 2020 +0900 introduce prefix_scan.py and move scan ir in nms.py commit 667bdd3 Author: masa <masa@pop-os.localdomain> Date: Fri Dec 18 15:06:18 2020 +0900 make the scan loop exclusive commit 480787b Author: mbrookhart <mbrookhart@octoml.ai> Date: Thu Dec 17 10:01:11 2020 -0700 Parallelize cumsum in get_valid_counts * fix for 1d scan * rename * cast to out dtype * do not run return reduction for inclusive scan * remove another ceil_div definition * adding scan test * add scheduling for scan op, fixed scan 1d test * pylint fix * add doc string * add more thrust scan test * add dynamic get valid count test, including empty size tensor * fix hard coded gpu targets for cpu only env * try retunring early if scan_size is 0 * another change for empty tensor and thrust path Co-authored-by: masa <masa@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 62f251b - Browse repository at this point
Copy the full SHA 62f251bView commit details -
[BYOC][ACL] removed ACL 20.05 limitations (apache#7251)
Removed checks for padding in according with changes in ACL 20.11 *ACL stands for "Compute Library for the Arm® Architecture"
Configuration menu - View commit details
-
Copy full SHA for 969b77a - Browse repository at this point
Copy the full SHA 969b77aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 727345e - Browse repository at this point
Copy the full SHA 727345eView commit details
Commits on Jan 21, 2021
-
[TFLite] Strided slice handling of shrink_axis_mask improved (apache#…
…6998) * [TFLite] Strided slice handlig of shrink_axis_mask improved 1. Added removal of dimensions if result is a scalar to mimic TensorFlow behaviour. E.g.: tf.strided_slice([1,2,3], [0], [1], [1], shrink_axis_mask=0) <tf.Tensor: shape=(1,), dtype=int32, numpy=array([1], dtype=int32)> tf.strided_slice([[[1,2,3],[4,5,6],[7,8,9]]], [0, 0, 0], [3, 3, 3], [1, 1, 1], shrink_axis_mask=7) <tf.Tensor: shape=(), dtype=int32, numpy=1> 2. Added extra check to assert_allclose to check shape equalities as np.testing.assert_allclose() does not distinguish between cases like: np.testing.assert_allclose(1, np.array(1)) np.testing.assert_allclose(1, np.array([1])) np.testing.assert_allclose(np.array(1), np.array([1])) * unit tests fixed
Configuration menu - View commit details
-
Copy full SHA for e8ab607 - Browse repository at this point
Copy the full SHA e8ab607View commit details -
[TOPI] Rewrite GPU argwhere using exclusive scan (apache#7314)
* use ex scan to write argwhere * add doc
Configuration menu - View commit details
-
Copy full SHA for f829403 - Browse repository at this point
Copy the full SHA f829403View commit details -
[COMMUNITY] @jwfromm -> Committer (apache#7316)
* [COMMUNITY] @jwfromm -> Committer * add areas
Configuration menu - View commit details
-
Copy full SHA for 20e03bc - Browse repository at this point
Copy the full SHA 20e03bcView commit details -
[µTVM] Add TVMPlatformGenerateRandom, a non-cryptographic random numb…
…er generator. (apache#7266) * [uTVM] Add TVMPlatformGenerateRandom, and use with Session nonce. * This change is preparation to support autotuning in microTVM. It also cleans up a loose end in the microTVM RPC server implementation. * Randomness is needed in two places of the CRT: 1. to initialize the Session nonce, which provides a more robust way to detect reboots and ensure that messages are not confused across them. 2. to fill input tensors when timing AutoTVM operators (once AutoTVM support lands in the next PR). * This change adds TVMPlatformGenerateRandom, a platform function for generating non-cryptographic random data, to service those needs.
Configuration menu - View commit details
-
Copy full SHA for 8524b28 - Browse repository at this point
Copy the full SHA 8524b28View commit details -
Made tensorflow IsNan actually work (apache#7320)
* Made tensorflow IsNan actually work IsNan was added to tensorflow.rst in fa1b859 but this commit makes IsNan actually work * Added test case for tensorflow.is_nan
Configuration menu - View commit details
-
Copy full SHA for fc9e264 - Browse repository at this point
Copy the full SHA fc9e264View commit details
Commits on Jan 22, 2021
-
Fix an issue with dynamic functions overwritting call arg types (apac…
…he#7295) * Fix an issue with dynamic functions overwritting call arg types * fix a bug for un-annotated inputs * normalize names in TypeSolver::Unifier * fix name normalization
Matthew Brookhart authoredJan 22, 2021 Configuration menu - View commit details
-
Copy full SHA for 7b6a1a7 - Browse repository at this point
Copy the full SHA 7b6a1a7View commit details -
add a shape function and dynamic test for round (apache#7324)
Matthew Brookhart authoredJan 22, 2021 Configuration menu - View commit details
-
Copy full SHA for 17ae44d - Browse repository at this point
Copy the full SHA 17ae44dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 790344c - Browse repository at this point
Copy the full SHA 790344cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6787d74 - Browse repository at this point
Copy the full SHA 6787d74View commit details -
[BYOC][Verilator] add support to dynamically load hardware library (a…
…pache#7286) * add files * remove import * remove os import * reorder header * fix header order cpplint * lint fix
Configuration menu - View commit details
-
Copy full SHA for af9d1d2 - Browse repository at this point
Copy the full SHA af9d1d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3ec67f0 - Browse repository at this point
Copy the full SHA 3ec67f0View commit details
Commits on Jan 23, 2021
-
[PatternLang] Add a relay LetPattern (apache#7332)
* Add a relay LetPattern * fix If copy Co-authored-by: Cody Yu <comaniac0422@gmail.com> * fix If copy Co-authored-by: Cody Yu <comaniac0422@gmail.com> Co-authored-by: Cody Yu <comaniac0422@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for e889def - Browse repository at this point
Copy the full SHA e889defView commit details -
Configuration menu - View commit details
-
Copy full SHA for 218048e - Browse repository at this point
Copy the full SHA 218048eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 42eb55d - Browse repository at this point
Copy the full SHA 42eb55dView commit details
Commits on Jan 25, 2021
-
[Tutorial] Autoscheduler on ARM devices (apache#7326)
* arm tuning tutorial * adjustment to get RPC working * fix lint * fix target * integrate Leandros comments * dont request remote in CI * use API from auto_scheduler, not autoTVM and updated comments * make ci-runnable * fix the formatting * address Zhaos comments * full run stats * taking Zhaos comments into consideration
Configuration menu - View commit details
-
Copy full SHA for 5d33491 - Browse repository at this point
Copy the full SHA 5d33491View commit details -
[AutoScheduler] Separate shapes from DAG hash and enable schedule sha…
…ring (apache#7317) * [AutoScheduler] Separate shapes from DAG hash and enable schedule sharing * Update CI logs * lint * fix registry * add message; fix layout rewrite mismatch * update message * support other formats
Configuration menu - View commit details
-
Copy full SHA for e6d5318 - Browse repository at this point
Copy the full SHA e6d5318View commit details -
[FIX] Infer input shape in sparse_dense_padded's alter_op if one does…
… not exist (apache#7308) * [FIX] Infer input shape in sparse_dense_padded's alter_op if one does not exist If there are multiple alter_ops in a model, the first alteration does not run type inference for the subsequent ones. In this case, we don't have the shape information, so we run the inferencer manually. * add todo
Configuration menu - View commit details
-
Copy full SHA for f3b852d - Browse repository at this point
Copy the full SHA f3b852dView commit details -
Fix warning showed with GCC10 (apache#7336)
catching polymorphic type 'struct dmlc::Error' by value
Configuration menu - View commit details
-
Copy full SHA for da446af - Browse repository at this point
Copy the full SHA da446afView commit details -
[Relay][Training] Add more gradients (apache#7323)
* add more gradients * add documentation
Configuration menu - View commit details
-
Copy full SHA for 6f75cff - Browse repository at this point
Copy the full SHA 6f75cffView commit details
Commits on Jan 26, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 3d13809 - Browse repository at this point
Copy the full SHA 3d13809View commit details -
[CMake] use wrong flag name (apache#7341)
Signed-off-by: windclarion <windclarion@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for c53030f - Browse repository at this point
Copy the full SHA c53030fView commit details -
Add resource_handle to TVM_DLL_EXPORT_TYPED_FUNC. (apache#7338)
* In apache#5921, resource_handle was added as a parameter to TVMBackendPackedCFunc, which is the typedef for functions called by LibraryModule's function lookup. * It appears TVM_DLL_EXPORT_TYPED_FUNC was overlooked in that PR, although there don't seem to be any runtime affects known so far. However, making this definition proper to avoid any compiler warnings/debug tool problems. * See also https://discuss.tvm.apache.org/t/rfc-misra-c-changes-for-rpc-support/7098/5
Configuration menu - View commit details
-
Copy full SHA for ab8bc0a - Browse repository at this point
Copy the full SHA ab8bc0aView commit details -
[Relay, TOPI] Add numpy style cumsum op (apache#7334)
* Add cumsum relay/topi op * relay tests working * add torch frontend converter * fix for importing detr * fix bad merge * begin cuda cumsum * support non innermost axis * support rank higher than 3 * making binop parameter * fix overflow issue in thrust scan * generic binop parameter working * relay test working * fixed for bool input * remove pytorch change * fix pylint * doc update * Update python/tvm/topi/cumsum.py Co-authored-by: Tristan Konolige <tristan.konolige@gmail.com> * Update tests/python/relay/test_op_level3.py Co-authored-by: Tristan Konolige <tristan.konolige@gmail.com> * add example outputs * add supported input and output dtype in thrust log * adding more loop var names * fix cpplint * fix missing check for the cuda target in nms thrust sort * parallelize cpu cumsum * making binop argument tir function * update doc for binop * doc update Co-authored-by: Tristan Konolige <tristan.konolige@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1e0d356 - Browse repository at this point
Copy the full SHA 1e0d356View commit details
Commits on Jan 27, 2021
-
Add resource_handle to both TVM_DLL_EXPORT_TYPED_FUNC and TVM_DLL_EXP…
…ORT_PACKED_FUNC macros in packed_func.h. This is a patch PR for apache#7388. (apache#7343) Co-authored-by: JC Li <jinli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for eeec538 - Browse repository at this point
Copy the full SHA eeec538View commit details -
[FIX] Don't add $TVM_HOME/.. to the include path when compiling code. (…
…apache#7342) If the user has a dmlc-core directory next to the tvm directory, this dmlc-core directory will be incorrectly used when compiling files with cc.py.
Configuration menu - View commit details
-
Copy full SHA for 38fa420 - Browse repository at this point
Copy the full SHA 38fa420View commit details -
[PRNG] Add check to PRNG to make sure that unsigned integer arithmeti…
…c is wrapping (apache#7287) * [PRNG] Add check to PRNG to make sure that unsigned integer arithmetic is wrapping * Add threefry_test_wrapping: a manual test for wrapping unsigned arithmetic. * fix test to actually run on the target * formatting * lint
Configuration menu - View commit details
-
Copy full SHA for eae21b0 - Browse repository at this point
Copy the full SHA eae21b0View commit details -
[Torch] Various updates for PyTorch frontend (apache#7348)
* add conversion for detr * remove explicit broadcast_to before batched matmul * use take with wrap mode * add test for transformer and negative indices * add sort and argsort * add logical_and * support masked_select * add gpu targets to masked_select test * improve sort conversion
Configuration menu - View commit details
-
Copy full SHA for 59e0a4a - Browse repository at this point
Copy the full SHA 59e0a4aView commit details -
[AutoScheduler] Enable schedule sharing in dispatch context (apache#7344
Configuration menu - View commit details
-
Copy full SHA for fd39122 - Browse repository at this point
Copy the full SHA fd39122View commit details -
[Torch] More graph rewrites for Faster RCNN / MaskRCNN (apache#7346)
* add post nms topk to max_out_size rewrite * add argsort conversion * scatter pattern first cut * matching seems to working * dup matching fixed * add converter * conversion seems working * add reshape, use take * remove pytorch argsort converter * update test * add doc
Configuration menu - View commit details
-
Copy full SHA for 4006bde - Browse repository at this point
Copy the full SHA 4006bdeView commit details
Commits on Jan 28, 2021
-
[Autodiff] Deterministic gradient compute (apache#7321)
* fix unstable compute * fix * fix * lint * sort linear equation * sort inequalities * fix * fix find * lint * fix find * lint
Configuration menu - View commit details
-
Copy full SHA for 00257f3 - Browse repository at this point
Copy the full SHA 00257f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for d8efe70 - Browse repository at this point
Copy the full SHA d8efe70View commit details -
[Relay][Frontend][Onnx] Robustify Loop Importer (apache#7353)
* Add test for array loop. * Fixed scalar issue. * Formatting. * Fix injective schedule for dynamic shapes.
Configuration menu - View commit details
-
Copy full SHA for 913abe0 - Browse repository at this point
Copy the full SHA 913abe0View commit details -
If an expression has two branches, and the pattern ignores one with a…
… wildcard, allow grouping via dominator analysis (apache#7355)
Matthew Brookhart authoredJan 28, 2021 Configuration menu - View commit details
-
Copy full SHA for 02fefbc - Browse repository at this point
Copy the full SHA 02fefbcView commit details -
Fold If when the condition is Constant (apache#7354)
Matthew Brookhart authoredJan 28, 2021 Configuration menu - View commit details
-
Copy full SHA for 8b84e33 - Browse repository at this point
Copy the full SHA 8b84e33View commit details -
Update uTVM code to work with the nRF5340DK dev board. (apache#7331)
* Various fixes to get nRF5340 working. Not yet there. * nRF5340 test runs locally. * Various fixes to get nRF5340 working. Not yet there. * nRF5340 test runs locally. * Add `nrfjprog --recover` for nRF5340DK * Cleanup. * Remove debugging code. * Revert submodule update. * Remove debugging code. * Fix comment. * Remove -keys argument. * Adding some debugging code * Fix passing west command to ZephyrFlasher. * Various fixes to get nRF5340 working. Not yet there. * nRF5340 test runs locally. * Add `nrfjprog --recover` for nRF5340DK * Cleanup. * Various fixes to get nRF5340 working. Not yet there. * nRF5340 test runs locally. * Remove debugging code. * Fix comment. * Remove -keys argument. * Fix merge.
Configuration menu - View commit details
-
Copy full SHA for cbc035f - Browse repository at this point
Copy the full SHA cbc035fView commit details -
[Frontend][Tensorflow] Sparse dense matmul adjoint option added (apac…
…he#7267) * [Frontend][Tensorflow] Sparse dense matmul adjoint option added * [1] Review comments handled * [2] Review comments handled * [3] Review comments handled
ANSHUMAN TRIPATHY authoredJan 28, 2021 Configuration menu - View commit details
-
Copy full SHA for dda8f5d - Browse repository at this point
Copy the full SHA dda8f5dView commit details -
[Relay][PatternLang] Bug fix of rewrite func attr (apache#7358)
When using pattern with attr of functions, such attrs mostly does not exist for op node. Therefore, hasattr check has to be done for op nodes. Change-Id: Ia313ab34be95ccc793c32fd8e5e5ef566b78685b
Configuration menu - View commit details
-
Copy full SHA for 67acad3 - Browse repository at this point
Copy the full SHA 67acad3View commit details -
[RUNTIME] Improve error messages for TypedPackedFunc (apache#7152)
* [RUNTIME] Improve error messages for TypedPackedFunc - TypedPackedFunc now prints the function name when the incorrect number of arguments is passed. - TypedPackedFunc now prints the function name and which argument when an argument cannot be converted to the correct type. * check argument conversion by template deducing argument types * switch from template approach to TVMMovableArgValueWithContext * move passes back into cc files * remove error message prefixes * Remove TVM_ICHECK_TYPE_CODE. Rename name to optional_name. * revert changes to module pass for later PR * reverted too much * documentation * formatting * more docs * unify error message language. TypedPackedFunc contrustor that does not take a name * Update include/tvm/runtime/packed_func.h Co-authored-by: Junru Shao <junrushao1994@gmail.com> Co-authored-by: Junru Shao <junrushao1994@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for f17cba7 - Browse repository at this point
Copy the full SHA f17cba7View commit details -
[Relay] Type Relation Fixes (apache#7362)
* fix an error in the dynamic Full Type Relation * Add Diagnostic Errors to Broadcast Type Relations
Matthew Brookhart authoredJan 28, 2021 Configuration menu - View commit details
-
Copy full SHA for b8ad146 - Browse repository at this point
Copy the full SHA b8ad146View commit details
Commits on Jan 29, 2021
-
Remove MemoryPlan from VM passes (apache#7361)
Matthew Brookhart authoredJan 29, 2021 Configuration menu - View commit details
-
Copy full SHA for ef032b3 - Browse repository at this point
Copy the full SHA ef032b3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7275f9 - Browse repository at this point
Copy the full SHA f7275f9View commit details -
[Relay][Frontend[Onnx] Add testing for output datatypes and fix relat…
…ed bugs. (apache#7364) * Add testing for datatypes and fix related bugs. * Fix lint issue in onnx.
Configuration menu - View commit details
-
Copy full SHA for f7862e7 - Browse repository at this point
Copy the full SHA f7862e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f414fd - Browse repository at this point
Copy the full SHA 4f414fdView commit details -
[BYOC][Verilator] change runtime registry function name (apache#7351)
* use lowercase for verilator runtime registry function * lint fix * update comment
Configuration menu - View commit details
-
Copy full SHA for 4142128 - Browse repository at this point
Copy the full SHA 4142128View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02c764c - Browse repository at this point
Copy the full SHA 02c764cView commit details -
[Refactor][VM] Port memory_alloc to c++ (apache#7369)
* Port memory_alloc to c++ * remove memory python pass
Configuration menu - View commit details
-
Copy full SHA for 44a071a - Browse repository at this point
Copy the full SHA 44a071aView commit details -
[CUDA][PASS]Legalize tensorcore (apache#7147)
* add pad_to_tensorcore & legalize for dense/bmm/conv2d * fix pad & slice * fix comments * fix comments * resolve conflict * resolve conflict * support only fp16 * add tests/python/relay/test_pass_legalize_tensorcore.py * add tests for legalize tensorcore * fix pylint * fix pylint * code format * use_gpu test only; fix conv2d_alter_op * fix tests params * revert transform fix
Configuration menu - View commit details
-
Copy full SHA for 3734d5f - Browse repository at this point
Copy the full SHA 3734d5fView commit details
Commits on Jan 31, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e892619 - Browse repository at this point
Copy the full SHA e892619View commit details
Commits on Feb 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 96b0981 - Browse repository at this point
Copy the full SHA 96b0981View commit details -
fix duplicated symbol bug in external codegen (apache#7383)
Co-authored-by: 袁航剑 <yuanhangjian@bytedance.com>
Configuration menu - View commit details
-
Copy full SHA for 2365c7e - Browse repository at this point
Copy the full SHA 2365c7eView commit details -
[Parser] Fix tokenizing inf (apache#7370)
* fix tokenizing inf * use ParseNumber to parse inf, handle -inf * fix neg handling * fixed multi negation * refactor * use while loop * simplyfing * fix lint * simpler implementation per altan's suggestion * disable flaky test
Configuration menu - View commit details
-
Copy full SHA for 0d303b4 - Browse repository at this point
Copy the full SHA 0d303b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for f7e05c3 - Browse repository at this point
Copy the full SHA f7e05c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bd259a - Browse repository at this point
Copy the full SHA 0bd259aView commit details -
Refactor Dynamic to Static (apache#7368)
* DynamicToStatic Refactor * fix test * add regression tests * cleanup * skip PrepareInput if the arg is already a constant * fix an issue with type inference with global functions
Matthew Brookhart authoredFeb 1, 2021 Configuration menu - View commit details
-
Copy full SHA for 3635945 - Browse repository at this point
Copy the full SHA 3635945View commit details
Commits on Feb 2, 2021
-
[Relay][Passes] Iterative A-normal Traversals (apache#7374)
* [WIP][Relay][Passes] non-recursive a-normal traversals * fix clang warning * Refactor ANormal Iterative traversal into a higher order function utility with lambdas * refactor missed pass * add explict use of to lamdbas
Matthew Brookhart authoredFeb 2, 2021 Configuration menu - View commit details
-
Copy full SHA for 0ab9c95 - Browse repository at this point
Copy the full SHA 0ab9c95View commit details -
Configuration menu - View commit details
-
Copy full SHA for de0ab4c - Browse repository at this point
Copy the full SHA de0ab4cView commit details -
[FFI] Improve error messages when array/map types do not match in fun…
…ction calls (apache#7330) * [FIX] Improve error messages when array/map types do not match in function calls * missed some places for renaming * Rename Mismatch to CheckAndGetMismatch. Add Check back in. Use Optional::defined. * Optional<String> -> String * formatting * move ObjectTypeChecker template specializations into where thier respective classes are defined so they will always be found correctly
Configuration menu - View commit details
-
Copy full SHA for da42924 - Browse repository at this point
Copy the full SHA da42924View commit details
Commits on Feb 3, 2021
-
[TOPI] Add einsum operator (apache#6370)
* [TOPI] Einsum * Fix tuple * fix oshape * * test * * Fix lint * * Remove useless define * * Move to einsum header file * * Fix single value situation * * Fix CamelASE * * Print stride * * Fix single input bug * * fix lint * * Fix lint and add comments * * create test einsum * * Fix lint * * Fix comments
Configuration menu - View commit details
-
Copy full SHA for 618ef9e - Browse repository at this point
Copy the full SHA 618ef9eView commit details
Commits on Feb 4, 2021
-
[TFLite] Added check for dynamic range quantization (apache#7114)
* [TFLite] Added check for dynamic range quantization Added check to prevent optimized with "dynamic range quantization" tflite files to be loaded as the optimization is not fully supported. https://www.tensorflow.org/lite/performance/post_training_quantization#dynamic_range_quantization * linter * linter * unit test fix
Configuration menu - View commit details
-
Copy full SHA for 2e8133d - Browse repository at this point
Copy the full SHA 2e8133dView commit details -
Generate requirements.txt from Python spec (apache#7289)
* Generate requirements.txt from Python spec. * add tests, collect actual requirements (first cut). * add tornado and cloudpickle * add xgboost * add xgboost version restriction * cleanup and prepare for merge * black format * add type annotations and docstrings * remove example requirements.txt * fix setup.py extras_require * use typing. classes for type annotations, python 2 compatible :) * fix python2 typing.Pattern * retrigger CI * address comaniac comments * retrigger ci
Configuration menu - View commit details
-
Copy full SHA for 1de98be - Browse repository at this point
Copy the full SHA 1de98beView commit details -
[Bugfix][AutoScheduler] Fail to register ComputeDAG when deserializin…
…g tasks (apache#7395) * [Bugfix][AutoScheduler] Fail to register ComputeDAG when deserialize tasks * fix test * trigger ci
Configuration menu - View commit details
-
Copy full SHA for 30c110c - Browse repository at this point
Copy the full SHA 30c110cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9aec474 - Browse repository at this point
Copy the full SHA 9aec474View commit details
Commits on Feb 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f1b9663 - Browse repository at this point
Copy the full SHA f1b9663View commit details -
Support negative pad values (apache#7375)
* Support negative pad values * Update test_op_level2.py * Update pad.cc * Update test_op_level2.py * PR Comments * Update pad.cc * Address PR Comments * CI Error * CI Error * CI Error Co-authored-by: Ubuntu <ubuntu@ip-172-31-28-115.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for c118b08 - Browse repository at this point
Copy the full SHA c118b08View commit details -
Fix Bug in Bilinear Interpolation and Add Deform Conv to PT FrontEnd (a…
…pache#7397) * Fix Bug in Bilinear Interpolation * Add NHWC Tests * clean * Fix Bug and Add Deformable Conv PyTorch for completeness * Add Tensor Utils * Remove stuff * Include vector * PR Comments * Empty Commit for CI Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 38c9eb1 - Browse repository at this point
Copy the full SHA 38c9eb1View commit details -
[AutoScheduler] Support early_stopping per task (apache#7377)
* [AutoScheduler] Support early_stopping per task * address comment * fix test * Update python/tvm/auto_scheduler/task_scheduler.py * Update python/tvm/auto_scheduler/task_scheduler.py * trigger ci * trigger ci
Configuration menu - View commit details
-
Copy full SHA for d8313d0 - Browse repository at this point
Copy the full SHA d8313d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 132cf6b - Browse repository at this point
Copy the full SHA 132cf6bView commit details -
[Relay][Frontend][Onnx] Refactor where importer to support dynamic sh…
…apes. (apache#7394) * Refactor where importer to support dynamic shapes. * Add a test for dynamic where.
Configuration menu - View commit details
-
Copy full SHA for 91e07e1 - Browse repository at this point
Copy the full SHA 91e07e1View commit details -
Add cuda tags and unit test (apache#7410)
* Add cuda tags and unit test * Add missing space * Remove extra indent * Modify macro def position * Fix clang format * Fix clang format for set_config
Configuration menu - View commit details
-
Copy full SHA for 4df530d - Browse repository at this point
Copy the full SHA 4df530dView commit details
Commits on Feb 6, 2021
-
check for dynamic rank before accessing value (apache#7414)
Matthew Brookhart authoredFeb 6, 2021 Configuration menu - View commit details
-
Copy full SHA for fc08430 - Browse repository at this point
Copy the full SHA fc08430View commit details -
[VM] Minor refactor for C++ memory alloc (apache#7413)
* started moving things to header * directly call InvokeTVMOp * done all memory op * also refactor AllocTensor * declare Prod * remove cached func for Add, Multiply, Divide * lint fix * revert test change * remove tensor.h and declare Prod in pattern_utils.h
Configuration menu - View commit details
-
Copy full SHA for 1f846f0 - Browse repository at this point
Copy the full SHA 1f846f0View commit details
Commits on Feb 7, 2021
-
Fix AutoScheduler for anaconda python (apache#7387)
In case of non cpython flavour of python, the task passed to measure process should be serialized using pickle approach. The task includes workload which is a list of Tensors. The list should be serialized and deserialized as an atomic object.
Configuration menu - View commit details
-
Copy full SHA for 9daf3fe - Browse repository at this point
Copy the full SHA 9daf3feView commit details -
Fix compilation when Arm FP16 extensions are enabled (apache#7386)
Fixes incorrect number of template parameters in call to sort() Signed-off-by: Matthew Bentham <matthew.bentham@arm.com>
Configuration menu - View commit details
-
Copy full SHA for 33f30af - Browse repository at this point
Copy the full SHA 33f30afView commit details
Commits on Feb 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5103bb6 - Browse repository at this point
Copy the full SHA 5103bb6View commit details -
[µTVM] Add VMWare to Reference VM instructions (apache#7221)
* support vmware_desktop provider for microTVM reference VM * update tutorial * python format * try to fix sphinx warning * fix sphinx warning * retrigger CI
Configuration menu - View commit details
-
Copy full SHA for 0e7e2dc - Browse repository at this point
Copy the full SHA 0e7e2dcView commit details -
Generate JUnitXML from pytest (apache#7407)
* Generate JUnitXML from pytest. * address tkonolige comments
Configuration menu - View commit details
-
Copy full SHA for c789a29 - Browse repository at this point
Copy the full SHA c789a29View commit details
Commits on Feb 9, 2021
-
[FIX,CMAKE] Only compile runtime files once (apache#7417)
* [FIX,CMAKE] Only compile runtime files once * copy defines to tvm_runtime_objs
Configuration menu - View commit details
-
Copy full SHA for 79b6ef7 - Browse repository at this point
Copy the full SHA 79b6ef7View commit details -
[TVMC] Allow manual shape specification in tvmc (apache#7366)
* add ability to optionally overide tvm shapes * add help documentation for --shapes * improve documentation * reformat test_compiler using black * Incorporate feedback from ekalda for better pytorch support and testing. * address feedback * switch input shape syntax to be more pythonic * add commentary * reformat common.py * fix lint issue * format common.py with black * torch/pytorch test hiccup * add -s to setup-pytest-env.sh for clearer error msgs Co-authored-by: Jocelyn <jocelyn@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 2b8d113 - Browse repository at this point
Copy the full SHA 2b8d113View commit details -
[AutoScheduler] Add sampling to dispatcher (apache#7376)
* [AutoScheduler] Add sampling to dispatcher * address comment * make measurment configurable
Configuration menu - View commit details
-
Copy full SHA for 0716c2a - Browse repository at this point
Copy the full SHA 0716c2aView commit details -
[ONNX] Add CumSum operator to ONNX frontend (apache#7391)
* [ONNX] Add CumSum operator to ONNX frontend * Fix lint and add attributes to CumSum * Fix CumSum test * Add support exclusive attribute * Add support reverse attribute * Fix clang-format * Fix lint * Move reverse calculation to ONNX frontend and add exclusive to GPU * Add test for int type
Configuration menu - View commit details
-
Copy full SHA for 2999d03 - Browse repository at this point
Copy the full SHA 2999d03View commit details -
[Relay][Topi][CPU] Dense with weight transform (apache#7404)
* Add CPU dense weight transform * Fix format * Fix python format * Fix pylint * Minor fix * Add test * Do not need to infer layout for dense * Fix test * Rename dense_pack * Fix test * Fix lint * Fix dynamic shape dense * Fix lint * Fix autotvm task extraction test * Disable AlterOpLayout in micro_tflite.py tutorial
Configuration menu - View commit details
-
Copy full SHA for 68b7e71 - Browse repository at this point
Copy the full SHA 68b7e71View commit details
Commits on Feb 10, 2021
-
[FIX,CMAKE] Only set Clang flags for C++ files (apache#7424)
Clang flags were set for all file types, causing nvcc to error out.
Configuration menu - View commit details
-
Copy full SHA for 3863e09 - Browse repository at this point
Copy the full SHA 3863e09View commit details -
TRT Dynamic Reshape Fix (apache#7412)
* Dynamic Reshape * Changes * Add test cases * Add test cases * PR COmments * CI Error * EmptyCommitCIError Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 9175c6d - Browse repository at this point
Copy the full SHA 9175c6dView commit details -
Simplify full broadcast (apache#7423)
* convert argwhere(full(const)) to reshape(arange()) * Add IsWildcard syntatic sugar * add a simplify expression to fold full into broadcast ops * Allow constant folding of full-like ops after SimplifyExpr * fix a bug with the Attr Pattern matching * remove skip_list
Matthew Brookhart authoredFeb 10, 2021 Configuration menu - View commit details
-
Copy full SHA for 12c6b70 - Browse repository at this point
Copy the full SHA 12c6b70View commit details
Commits on Feb 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b7808fb - Browse repository at this point
Copy the full SHA b7808fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for d05d75d - Browse repository at this point
Copy the full SHA d05d75dView commit details -
[BYOC][TRT] Fix small bug preventing TRT runtime compilation for vers…
…ions < 6 (apache#7372) * Fix small bug preventing TRT runtime compilation for versions < 6 * Trigger ci
Trevor Morris authoredFeb 11, 2021 Configuration menu - View commit details
-
Copy full SHA for 6b58321 - Browse repository at this point
Copy the full SHA 6b58321View commit details -
Make the TVM targets list available in Python (apache#7427)
* Make the TVM targets list available in Python Change-Id: I8602723fe57aaf32cee5392d4387a637115dd363 * Rename the APIs to get target kinds Change-Id: I2e6e32e025e3614a148a30a31e5a2c52fd3563cc
Nicola Lancellotti authoredFeb 11, 2021 Configuration menu - View commit details
-
Copy full SHA for 8b9005f - Browse repository at this point
Copy the full SHA 8b9005fView commit details -
Replace type punning with memcpy. (apache#7415)
The type punning in the existing code is undefined behaviour in C. In particular, the existing code fails when running on Arm Cortex-M devices. On Cortex-M, accessing a uint64_t that is not 8-byte aligned generates a hard fault. Change-Id: I2aecaa220e581af7c91a8bc7886499d70e2aa6f2
Configuration menu - View commit details
-
Copy full SHA for c52c176 - Browse repository at this point
Copy the full SHA c52c176View commit details -
Fix double compile of runtime sources for TRT, ACL (apache#7436)
Trevor Morris authoredFeb 11, 2021 Configuration menu - View commit details
-
Copy full SHA for 33829b3 - Browse repository at this point
Copy the full SHA 33829b3View commit details
Commits on Feb 12, 2021
-
[TIR][Printer] Fix SelectNode TIRTextPrinter bracket mismatch (apache…
…#7405) Co-authored-by: honghua.cao <honghua.cao@streamcomputing.com>
Configuration menu - View commit details
-
Copy full SHA for d5851dd - Browse repository at this point
Copy the full SHA d5851ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for c7c54de - Browse repository at this point
Copy the full SHA c7c54deView commit details -
Configuration menu - View commit details
-
Copy full SHA for d769727 - Browse repository at this point
Copy the full SHA d769727View commit details -
[AutoScheduler] Fix distill record (apache#7439)
* [AutoScheduler] Fix distill record * update comments
Configuration menu - View commit details
-
Copy full SHA for e426c87 - Browse repository at this point
Copy the full SHA e426c87View commit details -
[Relay][Op][Bug] Fix missing return in scatter_nd cuda strategy (apac…
…he#7447) * fix missing return in scatter_nd cuda strategy * add Relay test for scatter_nd, fix documentation
Configuration menu - View commit details
-
Copy full SHA for b36bdf6 - Browse repository at this point
Copy the full SHA b36bdf6View commit details -
Make keras reshape less restrictive (apache#7446)
Trevor Morris authoredFeb 12, 2021 Configuration menu - View commit details
-
Copy full SHA for a1260cc - Browse repository at this point
Copy the full SHA a1260ccView commit details -
[µTVM] Use standalone_crt build tree for all µTVM builds (apache#7333)
* Build microTVM using standalone_crt in build tree. * black format * pylint * try stashing entire standalone_crt in hopes it will not upset jenkins * Put standalone_crt in correct Jenkinsfile stash bundle * include build prefix * switch to python script for expanding globs * revert attempt to use globs in pack_libs, switch to building standalone_crt * properly revert pack_lib changes * fix typo * retrigger CI * revert pyproject.toml * update Jenkinsfile approach to use task_ci_setup.sh
Configuration menu - View commit details
-
Copy full SHA for b8a8340 - Browse repository at this point
Copy the full SHA b8a8340View commit details
Commits on Feb 13, 2021
-
[ONNX] Make the ONNX Importer More Static (apache#7429)
* Construct static Ops if inputs are Constant * Expose FoldConstant as a function in addition to the pass * refactor onnx importer to do more static imports by constant folding fix pylint * fix test regressions * fix style, two bugs * pipe freeze_params through sub_graphs when importing loops and control flow
Matthew Brookhart authoredFeb 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 4e211a7 - Browse repository at this point
Copy the full SHA 4e211a7View commit details -
[VM] Move param bind to OptimizeModule (apache#7451)
* [VM] Move param bind to OptimizeModule * add test to verify the number of free vars after opt * remove const from OptimizeModule
Configuration menu - View commit details
-
Copy full SHA for 0aa90b0 - Browse repository at this point
Copy the full SHA 0aa90b0View commit details -
[Frontend][MXNet] Add support for MXNet GroupNorm (apache#7409)
* Add support for MXNet GroupNorm * Fix python lint * Fix lint
Trevor Morris authoredFeb 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 2e6e7dc - Browse repository at this point
Copy the full SHA 2e6e7dcView commit details
Commits on Feb 14, 2021
-
update stm32mp1 arm_cpu target configuration (apache#7443)
Add the -mcpu information to complete the picture. Signed-off-by: Vincent ABRIOU <vincent.abriou@st.com>
Configuration menu - View commit details
-
Copy full SHA for 2af3ab1 - Browse repository at this point
Copy the full SHA 2af3ab1View commit details
Commits on Feb 15, 2021
-
[FRONTEND][TFLITE] get input tensor information from graph (apache#7400)
* [FRONTEND][TFLITE] get input tensor information from graph * remove bare-except * fix lint * delete empty line * comment change * move some of the tflite frontend code from tvmc to tflite.py * update shape and dtype when user provided them * remove unused var. pass user provided shape_dict * remove duplicate code
eric authoredFeb 15, 2021 Configuration menu - View commit details
-
Copy full SHA for 6187e1c - Browse repository at this point
Copy the full SHA 6187e1cView commit details -
[µTVM] Print .elf statistics for a model runtime built with Zephyr (a…
…pache#7449) * [µTVM] Print .elf statistics for a model runtime built with Zephyr Currently there isn't any statistics about the used resources by a model runtime built with Zephyr, making it difficult to have any idea about, for instance, the amount of memory taken by the operations necessary to run the model. Since Zephyr's SDK already exposes the statistics about various memory regions on linking by passing '--print-memory-usage' to the linker, it's possible to use it to have an idea about the amount of memory used by the model and how much memory is left on the device. That commit adds a simple method to extract the memory region information out of the build output and then uses it to show memory usage statistics for various memory regions when Zephyr finishes building the image to be flashed to the target device. Signed-off-by: Gustavo Romero <gustavo.romero@linaro.org> * v2: Fixes accordingly to Andrew review - Catch StopIteration in case of a weird output or no additional lines after the last memory region - Use of _LOG.info() instead of plain print() for better control over the output by the main script - Set log level in micro_tflite.py script as an example on how to get the new memory usage statistics and also because currently that's the main script used to test microTVM + Zephyr's SDK - Improve statistics header Signed-off-by: Gustavo Romero <gustavo.romero@linaro.org> * Fix build It seems build system is using Python < 3.7, so 'text' argument is not present as an alias for 'universal_newlines'. To satisfy it use old 'universal_newlines' argument which is available prior to Python 3.7. * Fix build Avoid exception anti-pattern when catching StopIteration * Retrigger CI
Configuration menu - View commit details
-
Copy full SHA for 32c4402 - Browse repository at this point
Copy the full SHA 32c4402View commit details -
Add IdentityN operator for TF Frontend (apache#7452)
* Add frontend code and tests * Add Frontend Code Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 0ebc820 - Browse repository at this point
Copy the full SHA 0ebc820View commit details -
docker/bash.sh: lookup docker image in Jenkinsfile. (apache#7453)
* This PR makes it possible to type `docker/bash.sh ci_cpu tests/scripts/task_config_build_cpu.sh` and the same version of ci_cpu as is used in Jenkins will be used to run the command.
Configuration menu - View commit details
-
Copy full SHA for 2264206 - Browse repository at this point
Copy the full SHA 2264206View commit details
Commits on Feb 16, 2021
-
[BYOC][Verilator] Refactor Verilator runtime (apache#7406)
* new experiment * save * refactor * refactor library * add profiler * refactor * refactor * add docs * update comment * add deallocator
Configuration menu - View commit details
-
Copy full SHA for fc48514 - Browse repository at this point
Copy the full SHA fc48514View commit details -
Configuration menu - View commit details
-
Copy full SHA for 413692e - Browse repository at this point
Copy the full SHA 413692eView commit details
Commits on Feb 17, 2021
-
[Relay][Topi] Add max mode to ROI align (apache#7440)
* ROI align with max on cpu passes * onnx test file was not running gpu testsgit status! * all passing * fix lint * lint again * lint * lint * typo * remove import * fix import * add inf, -inf to hybridscript and respond to comments * shorten code * make atol lower
Configuration menu - View commit details
-
Copy full SHA for 564aae0 - Browse repository at this point
Copy the full SHA 564aae0View commit details -
[ROCM] Add Thrust support (apache#7458)
* enable rocm thrust, confrimed to work on sort and scan * add rocm argsort strategy * Abort if CXX is not hipcc * add more strategy * add missing import * fix lint * show supported data type in err msg * try remove rocthrust * add missing include for rocthrust * more minor change Co-authored-by: Masahiro Masuda <masahi@129@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for e57e644 - Browse repository at this point
Copy the full SHA e57e644View commit details -
SparseFillEmptyRows Op (apache#7442)
* Initial Commit * Fix formats * Remove comments * Black * THreeops * Add Frontend Code * Add Default Value to feed dict * Add Frontend Code * New test Cases and new code to handle them * Add Python Implementation' ' * Remove stuff * Remove unused imports * Pylint * Pylint * PyLint Shape Func * Make tests cpu only * Add unsorted tests * Add frontend code * Row Major Sorting Only Test * Handle Dynamic Shapes * Add dynamic input shapes * Dynamic Shape Tests * Add documentation * Dtypes * PR Comments * Added comments and changed naming * Add comments * Comments to Shape Func * Documentation * PR Changes * PR Comments * Resolve input and output dtype compat Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 173f3fc - Browse repository at this point
Copy the full SHA 173f3fcView commit details -
[Bugfix][Relay] Crash in match_exhaustion.cc when given an empty tupl…
…e pattern or constructor with no args (apache#7459) * [match_exhaustion] Fix cartesian product to handle empty tuple patterns or constructors with no args * Test cases do not actually exhibit the fixed bug * Mistake in comment
Configuration menu - View commit details
-
Copy full SHA for 77d2fe8 - Browse repository at this point
Copy the full SHA 77d2fe8View commit details -
Report JUnit test results for all TVM Python tests (apache#7450)
* Enable JUnit parsing for Python tests * retrigger CI * prefix junit results with FFI type * remove - in junit prefix
Configuration menu - View commit details
-
Copy full SHA for fe398bf - Browse repository at this point
Copy the full SHA fe398bfView commit details -
[ETHOSN] Add support for default Ethos-N78 configuration. (apache#6982)
Note: 'ETHOSN_VARIANT_CONFIG' must be set to test against Ethos-N78 and this adds support for one configuration of Ethos-N78 in TVM.
Configuration menu - View commit details
-
Copy full SHA for 76647ca - Browse repository at this point
Copy the full SHA 76647caView commit details
Commits on Feb 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 143c88e - Browse repository at this point
Copy the full SHA 143c88eView commit details -
[TOPI, Relay] Support roi_align NHWC layout (apache#7463)
* begin nhwc roi align * integrate mode change from upstream * adding test * support nhwc shape func * update strategy * refactoring test * refactor test * refactoring * fix lint * update relay op tests
Configuration menu - View commit details
-
Copy full SHA for b7e0cfb - Browse repository at this point
Copy the full SHA b7e0cfbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 84c4b15 - Browse repository at this point
Copy the full SHA 84c4b15View commit details -
[RUNTIME] Fast path for single thread run to allow app level threading (
apache#7454) * Fast path for single thread run to allow app level threading * add sync counter to avoid error in one of tests
Configuration menu - View commit details
-
Copy full SHA for 944d8d1 - Browse repository at this point
Copy the full SHA 944d8d1View commit details -
[Torch] Add index_put operator (apache#7465)
* [Torch] Add index_put operator * Skip test_frontends.py::test_load_model__pth
Configuration menu - View commit details
-
Copy full SHA for 50e013d - Browse repository at this point
Copy the full SHA 50e013dView commit details
Commits on Feb 19, 2021
-
[Relay][Bugfix] Fix off-by-one error in BiasAddRel, use new reporting (…
…apache#7467) * Fix off-by-one in BiasAddRel, use new reporting * No need to mark xfail if the exception is caught * lint
Configuration menu - View commit details
-
Copy full SHA for b51973f - Browse repository at this point
Copy the full SHA b51973fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e204209 - Browse repository at this point
Copy the full SHA e204209View commit details -
Get tvmc version from tvm (apache#7478)
Change-Id: I6a6e78080f36e4e3e1689e03ea48e759fcd8e466
Configuration menu - View commit details
-
Copy full SHA for 256b9cf - Browse repository at this point
Copy the full SHA 256b9cfView commit details -
[TVMC] Add composite target passes for compilation and tuning (apache…
…#7304) * Extend --target syntax to cover multiple targets for compilation and tuning * Add a new composite_target module to implement custom codegen passes into TVMC * Provide implementation to integrate TVMC, to target Arm Ethos-N NPU and Compute Library for the Arm Architecture (ACL) Change-Id: Iaee53fe22f0c14eb4e4c8ec47e72bade0c5e32cc
Configuration menu - View commit details
-
Copy full SHA for d16f282 - Browse repository at this point
Copy the full SHA d16f282View commit details
Commits on Feb 20, 2021
-
[Frontend][Tensorflow] Support explicit_paddings for TF 2.x (apache#7445
) * Ignore some TF2.0 attributes * Support explicit padding for conv2d, max_pool, conv3d * Remove conv3d explicit padding test since TF API doesn't allow it
Trevor Morris authoredFeb 20, 2021 Configuration menu - View commit details
-
Copy full SHA for 5688068 - Browse repository at this point
Copy the full SHA 5688068View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5a0573b - Browse repository at this point
Copy the full SHA 5a0573bView commit details
Commits on Feb 21, 2021
-
[TIR] Specialize MutateArray in StmtFunctor. (apache#7486)
StmtFunctor applies context dependent copy on write, which requires check over all the dependency chain. Such function is better suited as a special implementation to avoid misuse. This PR refactors the code to specialize the function. Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for cfe88c1 - Browse repository at this point
Copy the full SHA cfe88c1View commit details
Commits on Feb 22, 2021
-
[CUDA][THRUST] Enforce -libs=thrust to allow thrust offload (apache#7468
) * add contrib/thrust.py * update cuda strategy * remove is_thrust_available, update nms, scan, sort and tests * remove unused import * trigger CI * update * add note on how to enable thrust in ssd tutorial * add warning * Revert "update" This reverts commit c1629b3. Co-authored-by: masa <masa@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 072c469 - Browse repository at this point
Copy the full SHA 072c469View commit details -
Fix cuda nms handling of additional per box features (apache#7483)
Trevor Morris authoredFeb 22, 2021 Configuration menu - View commit details
-
Copy full SHA for d666b41 - Browse repository at this point
Copy the full SHA d666b41View commit details -
Fixed minor misspelling (apache#7499)
Co-authored-by: mshr-h <mshr-h@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 84359a9 - Browse repository at this point
Copy the full SHA 84359a9View commit details
Commits on Feb 23, 2021
-
[Target] Add target host field for target specification (apache#7462)
* Add target host field in Target * Add host as a config field to target * Add target host support for Python api * Add unit tests * Adjust format for cpplint * Remove unnecessary after in Python file * Remove redundancy and add param description * Fix format issue * Fix param description * Add unit test for duplicate target hosts
Configuration menu - View commit details
-
Copy full SHA for 794f6c6 - Browse repository at this point
Copy the full SHA 794f6c6View commit details -
[RELAY][Parser] Optimize relay parser to restore calls attrs (apache#…
…7347) * [RELAY][Parser] Optimize relay parser to restore attrs for non-Operator calls * To avoid too much modification to the native class, only print out the attrs type key of non-Operator Call in relay printer. Then reconstruct the attrs object after parsing this attrs type key value in Relay parser. * fix lint * fix ci * add test case
Configuration menu - View commit details
-
Copy full SHA for a111803 - Browse repository at this point
Copy the full SHA a111803View commit details -
[Frontend]Make onnx gemm tensor C optional (apache#7489)
* Make onnx gemm tensor C optional * fix codestyle * add tests * fix codestyle
Configuration menu - View commit details
-
Copy full SHA for d94cbbb - Browse repository at this point
Copy the full SHA d94cbbbView commit details -
[CRT] Create C-runtime-style metadata module for llvm builds (apache#…
…7398) * Create C-runtime-style metadata module for llvm builds. * maybe address manupa's comment * lint * actually address manupa comments * comment and rename * git-clang-format * pylint * cpp warning * try to fix apps/bundle_deploy * black format * build correct file * Use save() for C++-runtime targeted artifacts. * fix build_module LLVM metadata module conditions * fix test comment * black format * further restrict CRT MetadataModule creation * Fix test_link_params * black format and address zhiics comments * fix test_link_params, i think?
Configuration menu - View commit details
-
Copy full SHA for 0ba3741 - Browse repository at this point
Copy the full SHA 0ba3741View commit details -
Fix stack overflow when partially-__init__ Node raises exception. (ap…
…ache#7481) * Fix stack overflow when partially-__init__ Node raises exception. * If a Node subclass raises an exception and ctypes is in use before __init_handle_by_constructor__ is called (or self.handle is otherwise set), a Python stack overflow could result. This is because the unset handle slot causes self.handle accesses to fallback on the getattr(self, 'handle') method, invoking NodeGetAttr. * Then I believe this causes an infinite loop. * The fix is to make Node.__getattr__ raise AttributeError for all attributes in __slots__, then make __del__ tolerant to missing self.handle. * I don't believe cython is affected because it implements a descriptor to access its underlying chandle and that shouldn't be unset. * black format * actually use handle instead of self.handle
Configuration menu - View commit details
-
Copy full SHA for 929717a - Browse repository at this point
Copy the full SHA 929717aView commit details
Commits on Feb 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e67f15b - Browse repository at this point
Copy the full SHA e67f15bView commit details -
[Relay][Frontend][Onnx] Fix GEMM converter when C is not a parameter. (…
…apache#7509) * Fix onnx gemm with non parameter C. * Add gemm tests for C. * Fix formatting.
Configuration menu - View commit details
-
Copy full SHA for d425c14 - Browse repository at this point
Copy the full SHA d425c14View commit details -
[AutoScheduler] Fix the type inference for conv2d (apache#7501)
* fix type inference for conv2d * fix
Configuration menu - View commit details
-
Copy full SHA for 086dbfe - Browse repository at this point
Copy the full SHA 086dbfeView commit details -
[TVMC] rename composite target "acl" to "compute-library" (apache#7508)
* Renames the "acl" composite target to point to the specific library it represents
Configuration menu - View commit details
-
Copy full SHA for 9c5333e - Browse repository at this point
Copy the full SHA 9c5333eView commit details -
Support creating Bool constants in the pattern_utils (apache#7507)
Matthew Brookhart authoredFeb 24, 2021 Configuration menu - View commit details
-
Copy full SHA for 88a4fdd - Browse repository at this point
Copy the full SHA 88a4fddView commit details -
[Frontend][Tensorflow] Support range like axis in tf.raw_ops.All for …
…TF 2.x (apache#7502) * add TF2.x raw_ops.all axis range support * apply linting * fix range() func input
Xingyu Zhou authoredFeb 24, 2021 Configuration menu - View commit details
-
Copy full SHA for 7f86987 - Browse repository at this point
Copy the full SHA 7f86987View commit details -
[BYOC][VitisAI] Fix issue in Vitis AI codegen out tensor names matchi…
…ng & update docs and docker (apache#7350) * Fix bug in vitis ai codegen out tensor names matching & update docs & update docker * Update vitis_ai.rst * Move gpg-agent package installation to vitis ai core script * Refactor install_vitis_ai_core script * Update docs/deploy/vitis_ai.rst Co-authored-by: Cody Yu <comaniac0422@gmail.com> * Update docs/deploy/vitis_ai.rst Co-authored-by: Cody Yu <comaniac0422@gmail.com> * Update vitis-ai docs pynq/edge setup & adjustements for comments * Update python/tvm/contrib/target/vitis_ai.py Co-authored-by: Cody Yu <comaniac0422@gmail.com> * Reorg Vitis AI dockerfile to make sure gpg-agent is installed before llvm Co-authored-by: Jorn Tuyls <jornt.tuyls@gmail.com> Co-authored-by: Cody Yu <comaniac0422@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1455536 - Browse repository at this point
Copy the full SHA 1455536View commit details
Commits on Feb 25, 2021
-
Support CombinedNMS in TF frontend. (apache#7520)
Trevor Morris authoredFeb 25, 2021 Configuration menu - View commit details
-
Copy full SHA for 9e74f90 - Browse repository at this point
Copy the full SHA 9e74f90View commit details -
[Frontend] TF V2 sparse.todense() test added (apache#7473)
* [Frontend] TF V2 sparse.todense() test added * [1] Review comments handled
ANSHUMAN TRIPATHY authoredFeb 25, 2021 Configuration menu - View commit details
-
Copy full SHA for fc6f08a - Browse repository at this point
Copy the full SHA fc6f08aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b77c019 - Browse repository at this point
Copy the full SHA b77c019View commit details -
Configuration menu - View commit details
-
Copy full SHA for 772fa6b - Browse repository at this point
Copy the full SHA 772fa6bView commit details -
Introduce module_loader to AutoTVM. (apache#7337)
* Introduce code_loader to AutoTVM. * Prepares for autotuning with microTVM, and provides extension hook for VTA. * add vta hook * git-black * pylint * Add missing import * Fix import problem * add missing import * rename code_loader to module_loader * rename remote_kw to remote_kwargs * black format
Configuration menu - View commit details
-
Copy full SHA for b111695 - Browse repository at this point
Copy the full SHA b111695View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43b15a8 - Browse repository at this point
Copy the full SHA 43b15a8View commit details -
use checked_type instead of type_annotation (apache#7522)
Matthew Brookhart authoredFeb 25, 2021 Configuration menu - View commit details
-
Copy full SHA for 6274a7f - Browse repository at this point
Copy the full SHA 6274a7fView commit details
Commits on Feb 26, 2021
-
[Torch] Avoid adding unnecessary slicing (apache#7479)
* simplyfing * improved fast path for slice * update rewrite pattern for maskrcnn
Configuration menu - View commit details
-
Copy full SHA for e664b2f - Browse repository at this point
Copy the full SHA e664b2fView commit details -
[Relay] Enforce static dim for non-concat axis if one or more tensors…
… have static dim (apache#7487) * enforce static dim for non-concat axis * assign any when all dims are dyn * add missing case * simplify * add test * only enforce static dim constraint if concat output is dynamic * more update to concat type rel * update tests * fixed compile warning
Configuration menu - View commit details
-
Copy full SHA for 63ea8e1 - Browse repository at this point
Copy the full SHA 63ea8e1View commit details -
[Frontend][Tensorflow] Add unique operator (apache#7441)
* Initial commit of the unique operator Add unit tests for unique operator * Add tensorflow unique op * Refactor unique to use sort-based algorithm * Change relay.unique test to run only on cpu * Change topi.unique test to run only on cpu * Change range to parallel for parallelizable loops * Add return_counts option for relay.unique and topi.unique, add pytorch frontend * Fix pylint * Patch pytorch frontend * Initial support of topi.cuda.unique * Refactor to use ir_builder directly * Modularize adjacent difference * Refactor to simplify * Fix typo * Combine _unique and _unique_with_counts * Reuse indices_ptr to remove arange_ptr Co-authored-by: Yanming Wang <yanmwang@amazon.com>
Configuration menu - View commit details
-
Copy full SHA for 09b0c8e - Browse repository at this point
Copy the full SHA 09b0c8eView commit details -
[Torch] Pool ops, convert strides and pool_size to int (apache#7517)
* Convert strides and pool_size to int * Make helper function, add test * Fix lint
Configuration menu - View commit details
-
Copy full SHA for c46b187 - Browse repository at this point
Copy the full SHA c46b187View commit details -
SparseReshape Op (apache#7477)
* SparseReshape Inital Code * Done * Format * Add empty tests * Formatting * SanityCheck * formatting documentation * Documentation * Only Enable CPU * Add support for CUDA * Stuff * Add Dynamic Support * Parallelize GPU Impl * Documentation * Documentation * Import * Import * Remove unnecessary code * PR Comments * Schedules * Tests * Dtypes * Black * Parallelize CPU * CI error Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 56ac892 - Browse repository at this point
Copy the full SHA 56ac892View commit details -
[BUG_FIX][TOPI] Allow topi resize to accept more options (apache#7532)
* Make topi more permissive * Remove testing stuff * lint * Downsampling tests
Configuration menu - View commit details
-
Copy full SHA for 5c5aea6 - Browse repository at this point
Copy the full SHA 5c5aea6View commit details -
[ONNX]fix datatype on Reciprocal op (apache#7519)
* fix datatype on Reciprocal op * clean up test case
Matthew Brookhart authoredFeb 26, 2021 Configuration menu - View commit details
-
Copy full SHA for 2d57470 - Browse repository at this point
Copy the full SHA 2d57470View commit details -
[CI] Move ci-cpu to use llvm-11 (apache#7541)
* [CI] Move ci-cpu to use llvm-11 * Fix the testcase of x86 codegen by relax the register names.
Configuration menu - View commit details
-
Copy full SHA for 74ca8f0 - Browse repository at this point
Copy the full SHA 74ca8f0View commit details
Commits on Feb 27, 2021
-
Add create_local_debug_runtime to micro exports (apache#7528)
* Add create_local_debug_runtime to micro exports. * retrigger CI
Configuration menu - View commit details
-
Copy full SHA for 0758337 - Browse repository at this point
Copy the full SHA 0758337View commit details -
Don't run non-tvm_op GraphRuntime nodes in Debug Runtime over RPC. (a…
…pache#7512) * Don't run non-tvm_op GraphRuntime nodes in Debug Runtime over RPC. * These are filtered out in SetupOpExecs for normal debug runtime operation. * retrigger CI * retrigger CI * address tkonolige comment
Configuration menu - View commit details
-
Copy full SHA for f6d0fee - Browse repository at this point
Copy the full SHA f6d0feeView commit details -
Configuration menu - View commit details
-
Copy full SHA for e7f0a11 - Browse repository at this point
Copy the full SHA e7f0a11View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bbc205 - Browse repository at this point
Copy the full SHA 0bbc205View commit details -
Configuration menu - View commit details
-
Copy full SHA for 22ba2c4 - Browse repository at this point
Copy the full SHA 22ba2c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for b1b3823 - Browse repository at this point
Copy the full SHA b1b3823View commit details -
Fix typo in relay.vm.Executable (apache#7543)
Co-authored-by: Yanming Wang <yanmwang@amazon.com>
Configuration menu - View commit details
-
Copy full SHA for 485dfd6 - Browse repository at this point
Copy the full SHA 485dfd6View commit details
Commits on Feb 28, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 2673309 - Browse repository at this point
Copy the full SHA 2673309View commit details
Commits on Mar 1, 2021
-
Fix foldconstant involving dropout (apache#7550)
Co-authored-by: masa <masa@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 51dc332 - Browse repository at this point
Copy the full SHA 51dc332View commit details -
[TensorIR] introduce Block and BlockRealize (apache#312) (apache#7553)
Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com> Co-authored-by: Junru Shao <junrushao1994@gmail.com> Co-authored-by: Tianqi Chen <tqchen@users.noreply.github.com> Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com> Co-authored-by: Hongyi Jin <3231950289@qq.com> Co-authored-by: Wuwei Lin <wuwei@apache.org> Co-authored-by: Bohan Hou <32121147+spectrometerHBH@users.noreply.github.com> Co-authored-by: Junru Shao <junrushao1994@gmail.com> Co-authored-by: Tianqi Chen <tqchen@users.noreply.github.com> Co-authored-by: Ruihang Lai <lairuihangdongdong@qq.com> Co-authored-by: Hongyi Jin <3231950289@qq.com> Co-authored-by: Wuwei Lin <wuwei@apache.org>
Configuration menu - View commit details
-
Copy full SHA for 057a673 - Browse repository at this point
Copy the full SHA 057a673View commit details
Commits on Mar 2, 2021
-
[Autoscheduler][VM] Autoscheduler layout rewrite pass to VM (apache#7516
Configuration menu - View commit details
-
Copy full SHA for a1d43c1 - Browse repository at this point
Copy the full SHA a1d43c1View commit details -
fuse constant padding into conv kernels (apache#7515)
* fuse constant padding into conv kernels * change the kernel to support other layouts * add channel-last test * add a comment about bailing early
Matthew Brookhart authoredMar 2, 2021 Configuration menu - View commit details
-
Copy full SHA for 633ee11 - Browse repository at this point
Copy the full SHA 633ee11View commit details -
[Codegen][CUDA] Fix: cuda codegen vectorize cast (apache#7561)
* fix: cuda codegen vectorize cast * style: fix python coding style * fix: missing break * refactor: directly split by factor Co-authored-by: jiangchengquan <jiangchengquan@bytedance.com>
Configuration menu - View commit details
-
Copy full SHA for 5d354e4 - Browse repository at this point
Copy the full SHA 5d354e4View commit details -
[Torch] Fix converting torch slice op with dynamic slice length (apac…
…he#7549) * Fix converting torch slice op with dynamic slice length * use isinstance Co-authored-by: masa <masa@pop-os.localdomain>
Configuration menu - View commit details
-
Copy full SHA for 08ea961 - Browse repository at this point
Copy the full SHA 08ea961View commit details
Commits on Mar 3, 2021
-
[Pass] Profiling TVM compiler passes (apache#7500)
* basic pass profiler prototype * allow enable/disable of pass profiling * lint * add example pass profiler usage as test * render pass profiles to String instead of stdout
Configuration menu - View commit details
-
Copy full SHA for 3a02e0b - Browse repository at this point
Copy the full SHA 3a02e0bView commit details -
[TIR] Add TIR While node (apache#7425)
* add while node * update visitors * binary search lowering works * llvm codegen working * cuda codegen working * nms updated to use while loop * add missing upper bound check too * add mandelbrot test * add gpu mandel commit ee2363b Author: Masahiro Masuda <masahi129@gmail.com> Date: Fri Jan 29 11:44:02 2021 +0900 enable extern lib offload for nvptx * rename test * run black * add doc * add collatz test * add while + vectorize test * simplify bin search * Add special case visit method to storage_access.cc * disallow while loop inside vectorized loop * disallow trivial condition since we do not have break * error out in CoprocSync for now * error out LiftAttrScope for now * add placeholder to inject_vpthread * refactor to use MakeAttach * handle WhileNode in InplaceOpVerifier * error out in InjectVirtualThread * try handle WhileNode in StoragePlanRewriter * remove WhileNode visitor from storage rewrite * add while loop storage rewrite test * update tests * move test_vectorize_while_fail to test_tir_transform_vectorize.py
Configuration menu - View commit details
-
Copy full SHA for cf36aa6 - Browse repository at this point
Copy the full SHA cf36aa6View commit details -
[RELAY] Modify some passes to not stack overflow on many lets. (apach…
…e#7558) * [RELAY] Modify some passes to not stack overflow on many lets. Passes modified: - inline primitives - dead code - lambda lift * one fix * small fix * .at -> [] * fix
Configuration menu - View commit details
-
Copy full SHA for 91b6b3f - Browse repository at this point
Copy the full SHA 91b6b3fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 67bba90 - Browse repository at this point
Copy the full SHA 67bba90View commit details -
[Tensorize] Support conds depend on outer loop vars inside tensorize …
…scope (apache#7497) * [Tensorize] Support conds depend on outer loop vars inside tensorize scope * Reformat
Configuration menu - View commit details
-
Copy full SHA for 37053e1 - Browse repository at this point
Copy the full SHA 37053e1View commit details -
[CI][VitisAI] Update CI Vitis AI PyXIR version (apache#7575)
* Update Vitis AI CI PyXIR version to v0.1.6 * Add --depth 1 to PyXIR clone command
Configuration menu - View commit details
-
Copy full SHA for 296700e - Browse repository at this point
Copy the full SHA 296700eView commit details -
[SPIR-V] Add SPIR-V lowering for While node (apache#7574)
* Add SPIR-V lowering for WhileNode * test vulkan in while loop tests
Configuration menu - View commit details
-
Copy full SHA for dccc1ab - Browse repository at this point
Copy the full SHA dccc1abView commit details -
Configuration menu - View commit details
-
Copy full SHA for 25bf449 - Browse repository at this point
Copy the full SHA 25bf449View commit details -
[Relay][bugfix][error reporting] BiasAddRel does not check for a nega…
…tive index being out of bounds (apache#7554)
Configuration menu - View commit details
-
Copy full SHA for 1d5f334 - Browse repository at this point
Copy the full SHA 1d5f334View commit details -
Configuration menu - View commit details
-
Copy full SHA for 980cf80 - Browse repository at this point
Copy the full SHA 980cf80View commit details -
[RUNTIME] Move Map into runtime (apache#7570)
* [RUNTIME] Move Map into runtime This allows us to use Map to store parameters needed at runtime. * node.{Array|Map} -> runtime.{Array|Map} * missed some renames
Configuration menu - View commit details
-
Copy full SHA for 73a0b96 - Browse repository at this point
Copy the full SHA 73a0b96View commit details
Commits on Mar 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 41c0591 - Browse repository at this point
Copy the full SHA 41c0591View commit details -
[AutoScheduler] Querying and sampling in task extraction (apache#7571)
* [AutoScheduler] Query in task extraction * trigger ci
Configuration menu - View commit details
-
Copy full SHA for 3f5f84d - Browse repository at this point
Copy the full SHA 3f5f84dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 66f9139 - Browse repository at this point
Copy the full SHA 66f9139View commit details -
[ETHOSN] Add support for 20.11 Ethos-N driver stack release (apache#7506
Configuration menu - View commit details
-
Copy full SHA for 02a6483 - Browse repository at this point
Copy the full SHA 02a6483View commit details -
Fixes for using Python APIs from Rust. (apache#7085)
* Rewrite the Rust Module API and change some imports causing crashes. This commit also updates the docs to remove outdated information. * Renable Python test and remove warnings * Python test still flaky * Fix broken module test * Fix broken test * Reset test file
Configuration menu - View commit details
-
Copy full SHA for 6aae48b - Browse repository at this point
Copy the full SHA 6aae48bView commit details -
Add segment sum Op to relay and 7 corresponding TF Ops , fix scatter_…
…add dynamic bug (apache#7562) * Add segment sum Op * Remove unnecessary * Documentation * Black * Add GPU * Uncomment * Add documentation * Add dynamic tests * Add TF Op * Add Sparse Segment Sum * Add test coverage * PR Comments * Int64 tests * Add SparseSegmentSqrtN * Add SparseSegmentSqrtNOp * Deduplicate code * Add SparseSegmentMean * Parametrize Tests * Remove * Modularize * Black * Modularize Code * Pylint * PR Comments * Add scatter add tests * Remove Test Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 83ab234 - Browse repository at this point
Copy the full SHA 83ab234View commit details -
[BYOC][TensorRT] Make TRT runtime robust to empty or weird subgraphs (a…
…pache#7581) * Prevent TRT runtime crash for duplicate inputs and outputs * Add empty subgraph unit test
Trevor Morris authoredMar 4, 2021 Configuration menu - View commit details
-
Copy full SHA for 3fbb0a3 - Browse repository at this point
Copy the full SHA 3fbb0a3View commit details
Commits on Mar 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d7f5753 - Browse repository at this point
Copy the full SHA d7f5753View commit details -
Configuration menu - View commit details
-
Copy full SHA for d5cb3cb - Browse repository at this point
Copy the full SHA d5cb3cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 61e799c - Browse repository at this point
Copy the full SHA 61e799cView commit details -
Configuration menu - View commit details
-
Copy full SHA for fb06fd8 - Browse repository at this point
Copy the full SHA fb06fd8View commit details -
[Frontend][MXNet] Fix default value for is_ascend in topk (apache#7568)
* Use correct default value of False for is_ascend * Add unit test for default topk is_ascend value
Trevor Morris authoredMar 5, 2021 Configuration menu - View commit details
-
Copy full SHA for b9adce2 - Browse repository at this point
Copy the full SHA b9adce2View commit details -
[BYOC][TRT]Fix groups cannot divide output channel count error for de…
…conv when groups>1 (apache#7595) * trt num_outputs * asdf * fix lint Co-authored-by: Leyuan Wang <leyuan.wang@bytedance.com>
Configuration menu - View commit details
-
Copy full SHA for c5f608f - Browse repository at this point
Copy the full SHA c5f608fView commit details -
Support negative axis for gather (apache#7600)
* Fix negative axis in gather * Clang Format * Black * Empty Commit Co-authored-by: Ubuntu <ubuntu@ip-172-31-42-251.us-east-2.compute.internal>
Configuration menu - View commit details
-
Copy full SHA for 5d5bbfb - Browse repository at this point
Copy the full SHA 5d5bbfbView commit details -
[Vulkan] Support passing 64 bit scalar (apache#7572)
Co-authored-by: Wuwei Lin <wuwei@apache.org>
Configuration menu - View commit details
-
Copy full SHA for c0b9688 - Browse repository at this point
Copy the full SHA c0b9688View commit details -
Fix autotuning, broken in apache#7337 (apache#7566)
* Fix autotuning, broken in apache#7337 * retrigger CI, because I don't understand how it passed
Configuration menu - View commit details
-
Copy full SHA for 7344b66 - Browse repository at this point
Copy the full SHA 7344b66View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6c0cea - Browse repository at this point
Copy the full SHA d6c0ceaView commit details -
[Relay][Pass] Avoid stack overflow when using PostOrderRewrite (apach…
…e#7588) * init * fix * fix
Configuration menu - View commit details
-
Copy full SHA for 1ae4697 - Browse repository at this point
Copy the full SHA 1ae4697View commit details -
[TOPI] disable test_shift with i8 datatype (apache#7597)
apache#7539 Co-authored-by: guoweijun <guoweijun@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 783be9d - Browse repository at this point
Copy the full SHA 783be9dView commit details
Commits on Mar 6, 2021
-
[AutoSchedule] Sparse dense tuning support with custom sketch rule (a…
…pache#7313) * Add sparse dense tuning tutorial * Add sparse input fusion * Update the dag to support output fusion * Update * Add task input to search_task * Update * Add search_inputs to measure * Lint fix * Lint fix * Update * Update * Update * Update * Add file save load support * Update * Update * Update * Remove add_task_inputs API * Update * Update * Update * Lint fix * Lint fix * Lint fix * Lint fix * Update * Add example ci_log * Update * retrigger ci * Update * Update * Update * Lint fix * Lint fix * Lint fix
Configuration menu - View commit details
-
Copy full SHA for 0b4f669 - Browse repository at this point
Copy the full SHA 0b4f669View commit details -
Move SimplifyConvPad to a new pass and don't enable it by default (ap…
…ache#7603) * Move SimplifyConvPad to a new pass and don't enable it by default * rename pass * move files * fix lint * adjust test tolerance
Matthew Brookhart authoredMar 6, 2021 Configuration menu - View commit details
-
Copy full SHA for 69c1c6d - Browse repository at this point
Copy the full SHA 69c1c6dView commit details -
[Executor][Bugfix] Properly return and unflatten outputs from GraphEx…
…ecutor (apache#7604) * properly return and unflatten outputs from GraphExecutor * lint * cleaner approach, not sure what I was thinking before * remove unused import * forgot copyto cpu * make solution even cleaner using iterator
Configuration menu - View commit details
-
Copy full SHA for 875f8ee - Browse repository at this point
Copy the full SHA 875f8eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8aa2a7c - Browse repository at this point
Copy the full SHA 8aa2a7cView commit details
Commits on Mar 8, 2021
-
[Torch, QNN] Support quantized mobilenet v3 from torch 1.8 (apache#7606)
* [Torch] support hardsigmoid * qhswish first impl * add qhardsigmoid but the result is not correct * add qmv3 to test * comment fix
Configuration menu - View commit details
-
Copy full SHA for 760e9b2 - Browse repository at this point
Copy the full SHA 760e9b2View commit details -
[TE] Fix bug in AutoInlineElemWise and implement AutoInlineBroadcast (a…
…pache#7602) * [TE] Fix bug in AutoInlineElemWise and implement AutoInlineBroadcast * [TE] Add AutoInlineBroadcast API to schedule_pass.h
Configuration menu - View commit details
-
Copy full SHA for cc7f8dc - Browse repository at this point
Copy the full SHA cc7f8dcView commit details -
[Relay] add ShapeFunc for tanh (apache#6898)
* add ShapeFunc for tanh * _schedule_dense_small_batch turn autotvm off when dense's inner dim is unknown * fix CI pylint
Configuration menu - View commit details
-
Copy full SHA for ca303aa - Browse repository at this point
Copy the full SHA ca303aaView commit details -
[Relay] Fix relay op strategy for cuda dense int8 (apache#7586)
* [Relay] Fix relay op strategy for cuda dense int8 * Remove uint8 && Add autotvm task extraction test for relay graph that contains dense op (int8 * int8 -> int32) * Reformat the code of test case
Configuration menu - View commit details
-
Copy full SHA for 8d1f5b2 - Browse repository at this point
Copy the full SHA 8d1f5b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 717c5e0 - Browse repository at this point
Copy the full SHA 717c5e0View commit details -
[Relay] add ShapeFunc for one_hot op (apache#7490)
* [Relay] add ShapeFunc for one_hot op * fix pylint * add test for shapefunc of one_hot op
Configuration menu - View commit details
-
Copy full SHA for b827845 - Browse repository at this point
Copy the full SHA b827845View commit details
Commits on Mar 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 89bafd5 - Browse repository at this point
Copy the full SHA 89bafd5View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8d1055 - Browse repository at this point
Copy the full SHA a8d1055View commit details -
Configuration menu - View commit details
-
Copy full SHA for d830f2c - Browse repository at this point
Copy the full SHA d830f2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for dfc231c - Browse repository at this point
Copy the full SHA dfc231cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f9e5a4 - Browse repository at this point
Copy the full SHA 8f9e5a4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12c3b3d - Browse repository at this point
Copy the full SHA 12c3b3dView commit details -
[SPIRV] Minor update to TIR sort to make it work on VK/SPIR-V (apache…
…#7607) * sort started to working * static size sort seems to be working * test sort on vulkan * add nvptx to sort test too
Configuration menu - View commit details
-
Copy full SHA for a0656f5 - Browse repository at this point
Copy the full SHA a0656f5View commit details -
Trevor Morris committed
Mar 9, 2021 Configuration menu - View commit details
-
Copy full SHA for 77d28ab - Browse repository at this point
Copy the full SHA 77d28abView commit details