[Arith] Merge surjective/non-surjective iter mapping detections #11287

wrongtest-intellif · 2022-05-12T12:18:41Z

Update a simplify rule when c2 is nonzero, original rule is covered with constant folding.
floormod(x * c1, c2) =>
floormod(x * (floordiv(c1, c2) * c2 + floormod(c1, c2)), c2) =>
floormod(x * floormod(c1, c2)), c2)

This is useful for certain non-perfect tiling case, where there are dynamic loop ranges which is actually constant wrt outer loop domain.

For example, floordiv(floormod(x * 360, 16) + 359, 16) with x in [0, 2) can finally reduce to constant 22, since the rule could eliminate the multiply factor 360 to 360 % 16, activating more available rules.

Unfortunately the working example on tiling encounter a region_cover related problem again.

wrongtest-intellif · 2022-05-12T12:25:28Z

where (or is it neccesary) to write testcase on analyzer.simplify()'s behavior ?

tqchen · 2022-05-12T13:59:24Z

@wrongtest yes we should cover simplifier's behavior, but the rewrite_simplifier testcase should be sufficient for now

wrongtest-intellif · 2022-05-13T16:51:15Z

The failed compute_at's region cover check possibly could get fixed by #11235 improvement on iteration analysis.

vinx13 · 2022-05-13T18:08:44Z

LGTM, let's have #11235 merged first

wrongtest-intellif · 2022-05-19T07:03:23Z

To enable region cover proof on such cases, we need to lift DetectIterMapPadded to standard implementation for DetectIterMap.

Hzfengsy · 2022-05-23T08:16:14Z

A gentle ping for @vinx13

vinx13 · 2022-05-23T18:43:47Z

@wrongtest Can you elaborate the usage of DetectIterMapPadded in our analysis? Do we need the padding information?

also cc @Lunderberg for DetectIterMap changes

wrongtest-intellif · 2022-05-23T19:33:29Z

usage of DetectIterMapPadded in our analysis

Try merge DetectIterMapPadded and DetectIterMap into the same interface, and replace option require_bijective with a new enum IterMapLevel with three alternatives:

Bijective
for original behavior on require_bijective=true
Surjective
for original behavior on require_bijective=false
Injective
for behavior of DetectIterMapPadded

The #11235 brings great way to analyze iteration form like (x + 7) // 16 with padding. The surjective checking of DetectIterMap is used many where (like region cover check after schedule step), however, it can not leverage this analysis now, it is checked to take no padding predicate.

I think actually, as an example, though (x + 7) // 16 is rewritten into a "padded" iteration form, we could still prove the mapping is surjective, since the left and right padding is no more than the largest divisor by how we pad it. If we extent CheckMapping rules carefully, we may be able to distinguish that

(x + 7) // 16 -> surjective
- this is the access index form in my original failed case
(x + 7) % 16 -> surjective [0, 16) if x's extent is larger than 16
((x + 7) // 16, (x + 7) % 16) -> non-surjective

So from my perspective it would be great if we have a uniform interface and share same padding based analysis. Ideally padding_predicate is not affected for IndexMap functionalities, and it should not introduce false positives in bijective/surjective checking. I'm still working to check more unittest cases and adapt padding analysis if surjective mapping is required.

Do we need the padding information

No, original usages of DetectIterMap do not require padding_predicate as before. But we need prove surjective-ness if padding is added for new iteration form supported by original DetectIterMapPadded .

Lunderberg

I focused on the DetectIterMap changes, and especially like the merging and de-duplication. Mostly just some nitpicks here and there.

include/tvm/arith/iter_affine_map.h

Lunderberg · 2022-05-23T19:54:50Z

src/arith/iter_affine_map.cc

  }

  // Step0.1: Check each index to determine required padding
-  bool allow_padding = !require_bijective;
+  bool allow_padding = check_level != IterMapLevel::Bijective;


This would enable padding for IterMapLevel::Surjective, which I don't think is correct. Since padding is any output value for which no input value exists, any introduction of padding wouldn't be surjective.

That is the claim~ I try to change padding to iter mark itself.

For example,(x + 7) x in [0, 8) => IterMark(IterSplit(IterSum({x}, 7), lower_factor=1, extent=16, scale=1), extent=16 with left_pad=7, right_pad=1

Then (x + 7) // 8 is mapped to range [0, extent//2) == [0, 2), though we have padding into iter mark, the IterSplit's range can be achieved when we only iterate x in it's original domain: (0 + 7) // 8 = 0, (7 + 7) // 8 = 1

Good point, and that does maintain surjectivity for a single index. I'm not entirely sure for the case of two indices, though. For the same x ∈ [0,8), the indices [(x+7)//8, (x+7)%8] would have the same padding left_pad=7 and right_pad=1. Even though each individual index can take any value in the output ((x+7)//8 ∈[0,2) and (x+7)%8 ∈ [0,8)), there are some coordinate pairs that cannot be generated for any value of x (e.g. [0,0] and [1,7]).

I agree! This is where we should be careful. In CheckMapping with surjective mode when padding exists, we check padded // LCM and padded % LCM(or it's sub-splits) must not both exists. The case below depict this check:

sum = 80 + y dom_map = var_dom([(y, 176)]) # (80 + y) // 32 itself could be surjective assert_iter_sum_pattern( {fld(sum, 32): (6, 2, 1)}, dom_map, ) # (80 + y) % 2, ((80 + y) // 2) % 16) could be surjective, # since they can be seen as sub-splits of (80 + y) % 32 assert_iter_sum_pattern( {flm(fld(sum, 2), 16): (16, 0, 1), flm(sum, 2): (2, 0, 1)}, dom_map, ) # but (80 + y) // 32, (80 + y) % 32 are not surjective assert_iter_sum_failure({fld(sum, 32), flm(sum, 32)}, dom_map)

Other kinds of negatives like (80 + y) // 32, (80 + y) // 4 would be banned by existing checking rule.

Lunderberg · 2022-05-23T19:59:00Z

src/arith/iter_affine_map.cc

+    requires_padding_ = requires_padding_ || (left_padding_introduced || right_padding_introduced);
+    padding_predicate_ = padding_predicate_ || (left_padding_predicate || right_padding_predicate);
+  }
+  // ICHECK(CanProveDivisible(info.padded->extent, split->lower_factor));


Should these // ICHECK lines be either uncommented or removed?

Would like to check the padding factor is divisible by split->lower_factor, then the commented check can be ensured from context. I found it may fail unfortunetely due to simplifier's ability limitation when the padded extent contain complex flm/fld expressions.

Got it. I noticed that there were also some simplification steps that needed to increase the number of iterations performed. Is the failure to prove divisibility related, since CanProveDivisible only uses the default of 2 steps?

(I'm also wondering if the default for Analyzer::Simplify should be to iterate until it the simplification converges, rather than using a fixed number of steps.)

python/tvm/arith/iter_affine_map.py

junrushao · 2022-05-24T07:24:36Z

Quick note: #11235 is merged

src/arith/iter_affine_map.cc

vinx13 · 2022-05-27T22:11:46Z

src/arith/iter_affine_map.cc

@@ -1659,7 +1676,7 @@ bool IterMapRewriter::CanProveDivisible(const PrimExpr& lhs, const PrimExpr& rhs
  PrimExpr divisor = normalizer.Convert(rhs);

  return analyzer_->CanProveEqual(dividend, divisor) ||
-         analyzer_->CanProve(floormod(dividend, divisor) == 0);
+         analyzer_->CanProve(analyzer_->Simplify(floormod(dividend, divisor), 8) == 0);


it would be great to have some explanations here that it need more simplification steps

Sorry, that is something forget to revert. There is some cases the division could not be proved like
floormod(0 + -x * 8, x) == 0, floormod(8*c1*c2, c1) == 0, even we increate iteration num. They get work-around here and there, for example,

if (CanProveDivisible(right_edge, divisor)) { right_pad = 0; } else { right_pad = analyzer_->Simplify(floormod(-right_edge, divisor)); }

@Lunderberg suggest Simplify could be optimized to iterate until reaching fix point. But now it is suffice to work on existing tests.

vinx13 · 2022-05-31T03:38:46Z

Could you also update this line https://github.com/apache/tvm/blob/main/src/tir/schedule/primitive/layout_transformation.cc#L395? There are some conflict that CI didn't catch because of concurrent merge

- determine case like x % 16, x in [0, 5) to be non-surjective, since usages may treat the region extent as 16 by mistake. - skip second round of rewrite when there is no padding - fix some typo in comments

junrushao · 2022-05-31T23:54:36Z

One bug from my side is magically fixed by this PR!!

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from bf9c28d to d4d439d Compare May 13, 2022 07:27

tqchen requested a review from vinx13 May 13, 2022 15:17

tqchen assigned vinx13 May 13, 2022

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from d4d439d to 6795cb0 Compare May 23, 2022 13:03

wrongtest-intellif changed the title ~~[Arith][Simplify] Extend simplify rule for floormod(x * c1 + y, c2)~~ [Arith] Merge surjective/non-surjective iter mapping detections May 23, 2022

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from 6795cb0 to 700b702 Compare May 23, 2022 18:21

Lunderberg reviewed May 23, 2022

View reviewed changes

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch 2 times, most recently from a1a2086 to 1c15f4d Compare May 25, 2022 09:45

wrongtest-intellif commented May 25, 2022

View reviewed changes

src/arith/iter_affine_map.cc Show resolved Hide resolved

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch 2 times, most recently from f4280f0 to 001ed50 Compare May 25, 2022 20:36

vinx13 reviewed May 27, 2022

View reviewed changes

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from 001ed50 to f24db1d Compare May 28, 2022 06:33

wrongtest-intellif added 5 commits May 31, 2022 13:06

simplify (x * 96) % 64 to (x * 32) % 64

4c36d01

adapt merge mulmod opt for OffsetOf computation

8ce069d

merge DetectIterMap and DetectIterMapPadded

e028443

adjust related interfaces for IterMapLevel

4344b68

- check incompatible left paddings

8d46bb5

- determine case like x % 16, x in [0, 5) to be non-surjective, since usages may treat the region extent as 16 by mistake. - skip second round of rewrite when there is no padding - fix some typo in comments

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from f24db1d to 48a16f1 Compare May 31, 2022 05:28

rebase upstream

4d1239a

wrongtest-intellif force-pushed the simplify_floormod_after_multiply branch from 48a16f1 to 4d1239a Compare May 31, 2022 05:52

vinx13 approved these changes May 31, 2022

View reviewed changes

vinx13 merged commit c1b22ee into apache:main May 31, 2022

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Arith] Merge surjective/non-surjective iter mapping detections #11287

[Arith] Merge surjective/non-surjective iter mapping detections #11287

wrongtest-intellif commented May 12, 2022 •

edited

Loading

wrongtest-intellif commented May 12, 2022

tqchen commented May 12, 2022

wrongtest-intellif commented May 13, 2022

vinx13 commented May 13, 2022

wrongtest-intellif commented May 19, 2022

Hzfengsy commented May 23, 2022

vinx13 commented May 23, 2022

wrongtest-intellif commented May 23, 2022 •

edited

Loading

Lunderberg left a comment

Lunderberg May 23, 2022

wrongtest-intellif May 24, 2022

Lunderberg May 24, 2022

wrongtest-intellif May 25, 2022 •

edited

Loading

Lunderberg May 23, 2022

wrongtest-intellif May 24, 2022

Lunderberg May 24, 2022

junrushao commented May 24, 2022

vinx13 May 27, 2022

wrongtest-intellif May 28, 2022

vinx13 commented May 31, 2022

junrushao commented May 31, 2022

[Arith] Merge surjective/non-surjective iter mapping detections #11287

[Arith] Merge surjective/non-surjective iter mapping detections #11287

Conversation

wrongtest-intellif commented May 12, 2022 • edited Loading

wrongtest-intellif commented May 12, 2022

tqchen commented May 12, 2022

wrongtest-intellif commented May 13, 2022

vinx13 commented May 13, 2022

wrongtest-intellif commented May 19, 2022

Hzfengsy commented May 23, 2022

vinx13 commented May 23, 2022

wrongtest-intellif commented May 23, 2022 • edited Loading

Lunderberg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wrongtest-intellif May 25, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

junrushao commented May 24, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vinx13 commented May 31, 2022

junrushao commented May 31, 2022

wrongtest-intellif commented May 12, 2022 •

edited

Loading

wrongtest-intellif commented May 23, 2022 •

edited

Loading

wrongtest-intellif May 25, 2022 •

edited

Loading