TVM v0.5 Roadmap #1596

tqchen · 2018-08-13T18:56:05Z

yzhliu · 2018-08-13T20:57:23Z

Shall we add heterogeneous graph runtime? @zhiics is working on that.

anijain2305 · 2018-08-13T22:25:44Z

I am interested in implementing the Intel CPU support for INT8 quantization

siju-samuel · 2018-08-14T00:38:13Z

I'm interested in implementing the RUST runtime.

ehsanmok · 2018-08-14T02:35:09Z

@tqchen @siju-samuel My Rust runtime (dylib) support which follows the same generic API as Java for example (CPU, GPU, etc.) is 70%-ish done! I'll need to finish the callback support, add docs and cleanup. Any contributions is welcomed!

@nhynes Rust static support is in a good shape as well but is specific to CPU with custom allocator etc.

siju-samuel · 2018-08-14T03:31:26Z

@ehsanmok OK
Anyone doing "Support a c++ version of cross platform RPC"? If not, I'm interested in taking up this.

PariksheetPinjari909 · 2018-08-14T03:42:20Z

@tqchen I have started working 8 bit quantizer and its operator support for conv2d, dense and relu. To avoid duplicate work pls let me know if anyone else is doing this work.

nhynes · 2018-08-14T03:47:37Z

PR for static Rust runtime in #1597.

@ehsanmok I'm not sure what you mean by "custom allocator etc." It uses whatever GlobalAlloc you care to use.

ehsanmok · 2018-08-14T03:54:57Z

@nhynes I meant you've defined your own allocator, threading, parallel backend support for CPU usage only for staticlib compiling with xargo while I've taken different route relying on existing layeouts for example and seems working for GPU. Though I admit I've done the project for my own enrichment first.

tqchen · 2018-08-14T04:05:02Z

@PariksheetPinjari909 the UW SAML team is working on a generic n-bit quantizer and hopefully things will get RFCed and upstreamed in this release cycle

tqchen · 2018-08-14T04:07:28Z

Please feel free to open new issues to track the working items, @siju-samuel standalone RPC is tracked by #1496

tqchen · 2018-08-14T04:12:09Z

The first post contains an initial list of things based on the community feedback, please also feel free to propose new things and we will add it to the roadmap

nhynes · 2018-08-14T04:19:06Z

Will the new graph runtime make it into this release? I'd love to upstream some training codes, but they all depend on the semi-kluge FExpandCompute.

tqchen · 2018-08-14T04:23:28Z

@nhynes it belongs to the "high-level IR improvements"

PariksheetPinjari909 · 2018-08-14T04:45:33Z

@tqchen Ok. Let me know what support i can give in 8 bit quantization. I am interested to contribute here.

PariksheetPinjari909 · 2018-08-14T10:53:02Z

I would like to take up the control flow ops. Let me know if someone is working on that.

tqchen · 2018-08-14T16:36:48Z

@PariksheetPinjari909 We will make a major RFC to upgrade the IR system including control flow ops and type system, and after the first phase proposal is done, everyone is welcomed to contribute

kazum · 2018-08-16T15:58:58Z

Sorry for being late. I’d like to add preliminary support for HLS shecudler to allow compiling actual neural networks with AOCL and SDAccel backends.

tqchen · 2018-08-21T17:41:55Z

int8 cuda gemm recipe #1614

JammyZhou · 2018-08-22T13:46:17Z

@tqchen from TVM perspective, any comments on ONNXIFI? I'm thinking about how TVM stack can fit into it.

ajtulloch · 2018-08-24T22:03:21Z

Re microkernels/tensorization, I've been looking at that stuff the last few months or so. There's some WIP stuff in https://github.com/ajtulloch/tvm/tree/tvm-using-val/tensorize, notably well-tuned assembly versions of:

FP32 GEMM kernels (ARMv7, AVX2)
Int8 x Int8 -> Int32 GEMM kernels (AVX2, adding ARMv7 shortly)

My hypothesis is that we can get a pretty decent part of the way with just GEMM microkernels for a lot of these dense workloads, but it's to-be-tested currently.

Some examples of using them in GEMM-based convs and for the batch gemm of a minimal F(6x6, 3x3) Winograd (~2-3x faster than current trunk on most configurations for ARMv7) are in that dir as well. For folks interested in the "Micro-asm kernel exploration" and "8-bit network stuff" (esp on CPUs), it'd be good to collaborate :).

anijain2305 · 2018-08-24T22:15:10Z

@ajtulloch I am working on Intel 8-bit Conv implementation using Intel Skylake AVX512 instructions (with the long-term goal of using VNNI instructions). I am not using GEMM-based convolution though. I am starting from NCHWc format direct convolution present in current conv2d topi implementation. I should have some numbers for the conv operator by the next weekend and can share them.

merrymercy · 2018-08-27T20:43:41Z

@ajtulloch It will be great if you can send a tutorial or topi recipe

ajtulloch · 2018-08-27T20:52:55Z

@anijain2305 you might find https://github.com/ajtulloch/tvm/blob/tvm-using-val/tensorize/gemm__avx2.c#L424-L531 or a similar microkernel for AVX512 useful on Skylake (same as MKL-DNN's vpmaddubsw/vpmaddwd/vpaddd sequence on AVX2/AVX512 pre VNNI).

@merrymercy what would be useful to have documented/tutorialized or made into a recipe?

merrymercy · 2018-08-27T22:11:31Z

I think making a simple runnable conv2d example and showing its speedup will be very useful.

FrozenGene · 2018-08-29T00:54:07Z

+1 to one conv2d runnable example. Besides ARMv7 / AVX2, I think we should also add SSE too. For some embbeding platforms, which would use Intel ATOM processors. However, Intel ATOM processors only support SSE4.2 at most, not AVX2.

ZihengJiang · 2019-01-17T01:44:00Z

0.5 release note candidate is now up at #2448

ZihengJiang · 2019-02-19T05:16:37Z

v0.5 is now tagged, next cycle roadmap issue is available at #2623

tqchen added the type: roadmap label Aug 13, 2018

This was referenced Aug 13, 2018

TVM v0.4 Roadmap #1170

Closed

TVM 0.4 Release Note #1577

Closed

tqchen added this to the v0.5 milestone Aug 14, 2018

ZihengJiang closed this as completed Feb 19, 2019

apache locked as resolved and limited conversation to collaborators Feb 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TVM v0.5 Roadmap #1596

TVM v0.5 Roadmap #1596

tqchen commented Aug 13, 2018 •

edited by ZihengJiang

Loading

yzhliu commented Aug 13, 2018

anijain2305 commented Aug 13, 2018

siju-samuel commented Aug 14, 2018

ehsanmok commented Aug 14, 2018 •

edited

Loading

siju-samuel commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

nhynes commented Aug 14, 2018 •

edited

Loading

ehsanmok commented Aug 14, 2018 •

edited

Loading

tqchen commented Aug 14, 2018 •

edited

Loading

tqchen commented Aug 14, 2018

tqchen commented Aug 14, 2018 •

edited

Loading

nhynes commented Aug 14, 2018

tqchen commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

tqchen commented Aug 14, 2018

kazum commented Aug 16, 2018

tqchen commented Aug 21, 2018 •

edited

Loading

JammyZhou commented Aug 22, 2018

ajtulloch commented Aug 24, 2018 •

edited

Loading

anijain2305 commented Aug 24, 2018

merrymercy commented Aug 27, 2018

ajtulloch commented Aug 27, 2018

merrymercy commented Aug 27, 2018

FrozenGene commented Aug 29, 2018

ZihengJiang commented Jan 17, 2019

ZihengJiang commented Feb 19, 2019

TVM v0.5 Roadmap #1596

TVM v0.5 Roadmap #1596

Comments

tqchen commented Aug 13, 2018 • edited by ZihengJiang Loading

Features

yzhliu commented Aug 13, 2018

anijain2305 commented Aug 13, 2018

siju-samuel commented Aug 14, 2018

ehsanmok commented Aug 14, 2018 • edited Loading

siju-samuel commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

nhynes commented Aug 14, 2018 • edited Loading

ehsanmok commented Aug 14, 2018 • edited Loading

tqchen commented Aug 14, 2018 • edited Loading

tqchen commented Aug 14, 2018

tqchen commented Aug 14, 2018 • edited Loading

nhynes commented Aug 14, 2018

tqchen commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

PariksheetPinjari909 commented Aug 14, 2018

tqchen commented Aug 14, 2018

kazum commented Aug 16, 2018

tqchen commented Aug 21, 2018 • edited Loading

JammyZhou commented Aug 22, 2018

ajtulloch commented Aug 24, 2018 • edited Loading

anijain2305 commented Aug 24, 2018

merrymercy commented Aug 27, 2018

ajtulloch commented Aug 27, 2018

merrymercy commented Aug 27, 2018

FrozenGene commented Aug 29, 2018

ZihengJiang commented Jan 17, 2019

ZihengJiang commented Feb 19, 2019

tqchen commented Aug 13, 2018 •

edited by ZihengJiang

Loading

ehsanmok commented Aug 14, 2018 •

edited

Loading

nhynes commented Aug 14, 2018 •

edited

Loading

ehsanmok commented Aug 14, 2018 •

edited

Loading

tqchen commented Aug 14, 2018 •

edited

Loading

tqchen commented Aug 14, 2018 •

edited

Loading

tqchen commented Aug 21, 2018 •

edited

Loading

ajtulloch commented Aug 24, 2018 •

edited

Loading