[RFC] [Relay] Automatic Mixed Precision Pass #6

AndrewZhaoLuo · 2021-06-09T04:06:46Z

Relevant Links:

https://discuss.tvm.apache.org/t/rfc-relay-fp32-fp16-model-support/9994

Old discussion before the new RFC process was rolled out

apache/tvm#8069

Initial PR

cc @hogepodge @mbrookhart @anijain2305 @masahi

Link to tracking issue: apache/tvm#8296

comaniac · 2021-06-09T17:27:08Z

Thanks for the RFC. I have two questions:

How to mark/set the color (i.e., attribute) of every operator?
It seems to me that if we register a casting checker instead of just a label (color), then we can simplify the algorithm a lot. Taking the case A(green) - B(gray) - C(green) as an example, if we could register a casting rule of B as follows, then we just need one traverse to know if we need cast around B:
```
def amp_B(expr, args):
    a = args[0]
    if (a.dtype is float16):
      return fp16
    return fp32
```
After all, we only need the previous nodes to determine 1) whether to use FP16 implementation, and 2) whether to insert casts. It seems to me that this pass is similar to the layout conversion pass, which uses one traverse to finish everything, so it might be possible for AMP too.

AndrewZhaoLuo · 2021-06-09T17:43:49Z

Thanks for the RFC. I have two questions:
How to mark/set the color (i.e., attribute) of every operator?
It seems to me that if we register a casting checker instead of just a label (color), then we can simplify the algorithm a lot. Taking the case A(green) - B(gray) - C(green) as an example, if we could register a casting rule of B as follows, then we just need one traverse to know if we need cast around B:
def amp_B(expr, args):
    a = args[0]
    if (a.dtype is float16):
      return fp16
    return fp32
After all, we only need the previous nodes to determine 1) whether to use FP16 implementation, and 2) whether to insert casts. It seems to me that this pass is similar to the layout conversion pass, which uses one traverse to finish everything, so it might be possible for AMP too.

Yep that is correct it is very similar to the layout conversion pass. This RFC has an initial PR here: apache/tvm#8069.

To answer your questions:

src/relay/transforms/fp32_to_fp16.h -- DefaultFP16Colorer is the default way. But the only thing we need is a callable with type CallNode*(Color). So you could write your own colorer that does arbitrary stuff when only looking at a single node at a time.
This is functionally what is done in the PR I link. It's one pass.

comaniac · 2021-06-09T17:55:09Z

Thanks for the answers. I'll review the PR to get more implementation details.
One more question regarding the extensibility: can this be extended easily to support bfloat16?

AndrewZhaoLuo · 2021-06-09T18:02:27Z

Thanks for the answers. I'll review the PR to get more implementation details.
One more question regarding the extensibility: can this be extended easily to support bfloat16?

It should be trivial (hope I don't eat my words). I'm not 100% sure of the support for bfloat16 in current relay ops however.

AndrewZhaoLuo · 2021-06-09T18:10:00Z

I don't know Chris Sullivan's github handle so if someone could cc him too that would be great.

tmoreau89 · 2021-06-09T18:12:00Z

CCing @csullivan

comaniac · 2021-06-09T18:26:32Z

Thanks for the answers. I'll review the PR to get more implementation details.
One more question regarding the extensibility: can this be extended easily to support bfloat16?

It should be trivial (hope I don't eat my words). I'm not 100% sure of the support for bfloat16 in current relay ops however.

TVM has limited bfloat16 support now but it's on the way, so it would be better for this RFC to also consider this case, even the initial version may not cover it.

AndrewZhaoLuo · 2021-06-15T18:30:04Z

So the associated PR is getting closer to a mergeable state. Is this RFC ready for more comments?

tqchen · 2021-07-24T12:43:23Z

cc @comaniac would be great if you can help shepherd this RFC

comaniac

The concept and algorithm look good to me, but it would be better to provide more implementation/design details.

comaniac · 2021-07-27T16:32:04Z

rfcs/0001-AMP_pass.md

+We can support automatic mixed precision retraining though that is a much, much larger future goal. It's
+good to have this in the meantime.


The answer to this question should come with a discussion of existing mechanisms used by other frameworks, such as XLA and PyTorch.

Done. Please let me know if this is sufficient. Don't have the best background on some of this stuff.

rfcs/0001-AMP_pass.md

AndrewZhaoLuo · 2021-07-27T23:02:47Z

Thanks for driving this review @comaniac. I'll get to this later in the week.

AndrewZhaoLuo · 2021-08-04T22:03:00Z

Going to get to this tomorrow 😬. Promise 🤞

comaniac · 2021-08-04T22:09:45Z

btw, according to #17, please update the RFC number on the file name to align with this PR number.

comaniac · 2021-08-16T21:03:09Z

Took a quick pass to the updated RFC. I think it's almost ready to merge as long as the last 3 comments are resolved.

AndrewZhaoLuo · 2021-08-17T20:08:53Z

PTAL @comaniac

AndrewZhaoLuo · 2021-08-18T18:02:10Z

@comaniac, I'll be talking about this at the TVM community meeting tomorrow so put off merging until after.

comaniac

LGTM. Will merge after the community meeting if there's no objection.

AndrewZhaoLuo · 2021-08-19T19:44:19Z

If there is not other objections, this will be merged on monday.

comaniac · 2021-08-24T16:54:42Z

Thanks @AndrewZhaoLuo

MeJerry215 · 2021-12-30T07:28:07Z

@AndrewZhaoLuo will it remove cast weight to float16 from graph? and make weight as float16 when build lib.
in my opinion, it will reduce the bandwidth.

masahi · 2021-12-30T07:38:07Z

@MeJerry215 Yes, casting of weight to fp16 is done at compile time by FoldConstant pass, so weights will be in fp16 at deploy time.

uma-rfc: update to questions/comments added

draft v1

a8c2549

add sources

6698203

editor for spelling

2a209ef

AndrewZhaoLuo marked this pull request as ready for review June 9, 2021 18:08

AndrewZhaoLuo changed the title ~~Automatic Mixed Precision Pass RFC~~ [RFC] [Relay] Automatic Mixed Precision Pass Jun 9, 2021

AndrewZhaoLuo mentioned this pull request Jun 16, 2021

[Relay] [Pass] Add mixed precision (e.g. FP16) model conversion pass apache/tvm#8069

Merged

add plans for benchmarking + tutorial

97ddca9

AndrewZhaoLuo mentioned this pull request Jun 21, 2021

[RFC][Tracking Issue][AMP] Tracking Issue for Mixed Precision Pass apache/tvm#8296

Closed

18 tasks

tqchen assigned comaniac Jul 24, 2021

tqchen added the status: need review RFC needs review label Jul 24, 2021

comaniac requested changes Jul 27, 2021

View reviewed changes

comaniac added the status: need update RFC needs update based on feedback label Jul 27, 2021

AndrewZhaoLuo added 6 commits August 5, 2021 20:34

rename to rfc name

45133d5

add links to PRs and Issues

fcb5325

light edits

56831ab

flesh out interface for user

ed104ec

add example

87813af

clean up all sections except reference level explanation

e9777c1

AndrewZhaoLuo added 3 commits August 13, 2021 12:01

add example of how to convert to fp16

ed3acb7

correct example

1c4e595

pass implementation details

2bc896c

AndrewZhaoLuo added 6 commits August 17, 2021 10:25

flesh out final sections

2749f7c

add sentence

def9bc5

add sentence

6886d38

more touch ups

d45a661

talk about XLA and existing support

24c66c6

discussion on possible targets

63d1cb0

AndrewZhaoLuo added 2 commits August 18, 2021 10:58

address comments on PyTorch vs TF appraoches

def82a1

light edits for grammar

6f65e82

comaniac approved these changes Aug 18, 2021

View reviewed changes

comaniac added status: accepted RFC is accepted and removed status: need review RFC needs review status: need update RFC needs update based on feedback labels Aug 18, 2021

comaniac merged commit dd2e7a8 into apache:main Aug 24, 2021

MichaelJKlaiber added a commit to MichaelJKlaiber/tvm-rfcs that referenced this pull request Apr 6, 2022

Merge pull request apache#6 from MichaelJKlaiber/rfc_uma

fcc56ca

uma-rfc: update to questions/comments added

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] [Relay] Automatic Mixed Precision Pass #6

[RFC] [Relay] Automatic Mixed Precision Pass #6

AndrewZhaoLuo commented Jun 9, 2021 •

edited

Loading

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021 •

edited

Loading

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021

tmoreau89 commented Jun 9, 2021

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 15, 2021 •

edited

Loading

tqchen commented Jul 24, 2021

comaniac left a comment

comaniac Jul 27, 2021

AndrewZhaoLuo Aug 17, 2021

AndrewZhaoLuo commented Jul 27, 2021

AndrewZhaoLuo commented Aug 4, 2021

comaniac commented Aug 4, 2021

comaniac commented Aug 16, 2021

AndrewZhaoLuo commented Aug 17, 2021

AndrewZhaoLuo commented Aug 18, 2021

comaniac left a comment

AndrewZhaoLuo commented Aug 19, 2021

comaniac commented Aug 24, 2021

MeJerry215 commented Dec 30, 2021

masahi commented Dec 30, 2021

		We can support automatic mixed precision retraining though that is a much, much larger future goal. It's
		good to have this in the meantime.

[RFC] [Relay] Automatic Mixed Precision Pass #6

[RFC] [Relay] Automatic Mixed Precision Pass #6

Conversation

AndrewZhaoLuo commented Jun 9, 2021 • edited Loading

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021 • edited Loading

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021

AndrewZhaoLuo commented Jun 9, 2021

tmoreau89 commented Jun 9, 2021

comaniac commented Jun 9, 2021

AndrewZhaoLuo commented Jun 15, 2021 • edited Loading

tqchen commented Jul 24, 2021

comaniac left a comment

Choose a reason for hiding this comment

comaniac Jul 27, 2021

Choose a reason for hiding this comment

AndrewZhaoLuo Aug 17, 2021

Choose a reason for hiding this comment

AndrewZhaoLuo commented Jul 27, 2021

AndrewZhaoLuo commented Aug 4, 2021

comaniac commented Aug 4, 2021

comaniac commented Aug 16, 2021

AndrewZhaoLuo commented Aug 17, 2021

AndrewZhaoLuo commented Aug 18, 2021

comaniac left a comment

Choose a reason for hiding this comment

AndrewZhaoLuo commented Aug 19, 2021

comaniac commented Aug 24, 2021

MeJerry215 commented Dec 30, 2021

masahi commented Dec 30, 2021

AndrewZhaoLuo commented Jun 9, 2021 •

edited

Loading

AndrewZhaoLuo commented Jun 9, 2021 •

edited

Loading

AndrewZhaoLuo commented Jun 15, 2021 •

edited

Loading