test that instances for Eq and Ord agree with going via toAscList #670

jwaldmann · 2019-07-22T17:01:09Z

add test case, see #470

treeowl · 2019-07-22T17:53:17Z

containers-tests/tests/intset-properties.hs

+prop_instanceEqIntSet x y = (x == y) == (toAscList x == toAscList y)
+
+prop_instanceOrdIntSet :: IntSet -> IntSet -> Bool
+prop_instanceOrdIntSet x y = (compare x y) == (compare (toAscList x) (toAscList y))


We already have prop_ord for this. I don't see a similar prop_eq though.

not exactly "this" (prop_ord uses toList, I think it should use toAscList #470 (comment) )

Ah, yes, toAscList is a bit more correct. That said, I would be astonished if we made toList do something different. While it's true that it's not documented as producing keys in order, it's also not explicitly documented as potentially producing keys in some other order. There are almost certainly people whose code will break in horrible ways if we change that.

that avoids toAscList and walks the tree directly. See haskell#470

containers/src/Data/IntSet/Internal.hs

jwaldmann · 2019-07-23T10:35:04Z

How worried should I be about travis failing? I am using <> and foldMap in the benchmark, this trips up ghc-8.2.2. (How come it works with 7.6.3?)

jwaldmann · 2019-07-23T13:59:27Z

It's better now - travis breaks only for ghc-7.8.4, which does not have foldMap. Do I need to fix this? Do you know how?

treeowl · 2019-07-23T14:21:37Z

Travis has to work, yeah. But you don't have to run the detailed test on 7.8.

treeowl · 2019-07-23T14:25:45Z

Can you try to give the helper functions somewhat more informative names, and explain what they do in comments?

jwaldmann · 2019-07-23T15:50:54Z

Sorry for 7.8.4 I have difficulty testing locally: for me it's either this:

PATH=/opt/ghc/ghc-7.8.4/bin:$PATH  stack --resolver=lts-2.22  bench containers-tests:intset-benchmarks  
Stack no longer supports Cabal versions below 1.19.2,
but version 1.18.1.5 was found.
...
Error: While constructing the build plan, the following exceptions were encountered:

In the dependencies for gauge-0.2.4:
    basement is a library dependency, but the package provides no library
needed due to containers-tests-0 -> gauge-0.2.4

or that:

PATH=/opt/ghc/ghc-7.8.4/bin:$PATH cabal bench
...
[33 of 36] Compiling Data.Tree        ( src/Data/Tree.hs, dist/build/Data/Tree.o )

src/Data/Tree.hs:103:23: parse error on input ‘-- ^ @since 0.5.8’

treeowl · 2019-07-23T16:35:01Z

Can't you test using cabal test directly? Why the heck would there be a parse error on a comment?

containers/src/Data/IntSet/Internal.hs

treeowl · 2019-07-23T17:11:23Z

containers/src/Data/IntSet/Internal.hs

+relateTop t1@(Tip p1 bm1) t2@(Bin p2 m2 l2 r2)
+  | mixed t2 = combine_right (relate t1 r2)
+  | otherwise = relate t1 t2
+relateTop t1@(Tip _ _) t2@(Tip _ _) = relateTipTip t1 t2


I haven't tried to work through it, but mixed strikes me as an odd way to organize this. Would this work better?

splitSign :: IntSet -> (IntSet,IntSet) splitSign t@(Tip kx _) | kx >= 0 = (Nil, t) | otherwise = (t, Nil) splitSign t@(Bin p m l r) -- m < 0 is the usual way to find out if we have positives and negatives (see findMax) | m < 0 = (r, l) | p < 0 = (t, Nil) | otherwise = (Nil, t) splitSign Nil = (Nil, Nil)

that splits an IntSet into the negative and non-negative elements. Then

compare xs ys | (xsNeg, xsNonNeg) <- splitSign xs , (ysNeg, ysNonNeg) <- splitSign ys = case relate xsNeg ysNeg of Less -> LT Prefix -> if null xsNonNeg then LT else GT Equals -> orderingOf (relate xsNonNeg ysNonNeg) FlipPrefix -> if null ysNonNeg then GT else LT Greater -> GT

OK I can do this - but I was hoping that if relateTop and orderingOf get inlined, it'll result in the same code.

Well, the thing I'm really going for amounts to structuring relateTop more clearly. If you want to maintain the separation you describe, you can definitely do that with this structure as well. That would be totally fine by me.

I am now (81bae9c) using the code that you proposed.

treeowl · 2019-07-23T17:26:05Z

containers/src/Data/IntSet/Internal.hs

+-- | precondition: each argument is non-Nil and non-mixed
+relate :: IntSet -> IntSet -> Relation
+relate t1@(Tip p1 bm1) t2@(Tip p2 bm2) = relateTipTip t1 t2
+relate t1@(Bin p1 m1 l1 r1) t2@(Bin p2 m2 l2 r2)


Can't we play any tricks here? I have to think there's some way to use the prefixes and masks. For example, if we have

t1 = Bin 0100000 0000100 l1 r1 t2 = Bin 0010000 0001000 l2 r2

I think we can reach a conclusion immediately.

In a bit more detail.... It seems to me that if p1 < p2, then t1 ≤ t2. Is that right? If so, then can we use the masks to determine (sometimes) that t1 < t2?

I hope that this happens automatically since the compiler should inline orderingOf $ relateTipTip ..

No, I mean pulling tricks in the Bin/Bin case. You can surely do the same thing you do with Tip: get lower and upper bounds for the trees, and if they don't overlap you have an answer.

prefixes are tricky: assume wordSize is one (else multiply everthing by some large enough power of two), then fromList [2,3] has prefix 10 (binary) but fromList[3,4] has smaller prefix 0 (?)

Okay, but can't you use your lower bound/upper bound calculation anyway? If the upper bound of one tree is less than the lower bound of the other, I think that's it.

lower/upper-bound: yes this should be correct for any Tip/Bin combo:

| succUpperbound t1 <= lowerbound t2 = Less | lowerbound t1 >= succUpperbound t2 = Greater

For now, this is inside the guards of some of the pattern matches but this could also be inverted (compare bounds first, and pattern match only if needed). This needs benchmarking (not today).

I think that computing "lowerbound" inside a branch of a pattern match is better: the inliner then should be able to remove the pattern match that is in the implementation of "lowerbound" (does it really?)

I inserted the lower-upper test in the Bin/Bin case as helps to avoid recursion.

Bodigrim found new `Ord` instance in haskell#783. Put the old one one back.

Bodigrim found new `Ord` instance in haskell#783. Put the old one one back. Fixes haskell#783.

Bodigrim found a bug in the new `Ord` instance in haskell#783. Put the old one one back. Fixes haskell#783.

Bodigrim found a bug in the new `Ord` instance in #783. Put the old one one back. Fixes #783.

treeowl · 2021-06-28T07:59:33Z

The implementation changes for IntSet have been reverted to fix #783. It would be nice to reinstate them correctly.

test that instances for Eq and Ord agree with going via toAscList

45d19b1

treeowl reviewed Jul 22, 2019

View reviewed changes

jwaldmann added 2 commits July 22, 2019 20:20

add benchmark for "instance Ord IntSet", using "Set IntSet"

9e87998

improved implementation of "instance Ord IntSet"

c0fb190

that avoids toAscList and walks the tree directly. See haskell#470

treeowl reviewed Jul 22, 2019

View reviewed changes

containers/src/Data/IntSet/Internal.hs Outdated Show resolved Hide resolved

relate: handle full trees in relateTop, simplify recursive case

dee87a5

for compilation with older ghcs (who don't have <>)

3eb7c58

jwaldmann added 3 commits July 23, 2019 17:18

add import for ghc-7.8.4 and below

0606cfd

add explanation for helper functions

392bf60

do not use 'length' since it was not generic in base < 4.8

b9d9608

treeowl reviewed Jul 23, 2019

View reviewed changes

jwaldmann added 2 commits July 27, 2019 20:28

replace mixed/relateTop by splitSign/relate

81bae9c

avoid recursion by detecting that ranges are disjoint

9454e35

jwaldmann mentioned this pull request Jul 28, 2019

IntSet: reverse bitmap for faster comparison? #674

Open

treeowl merged commit 7aff529 into haskell:master Dec 22, 2019

sjakobi mentioned this pull request Jun 23, 2021

instance Ord IntSet is broken since #670 #783

Closed

treeowl added a commit to treeowl/containers that referenced this pull request Jun 28, 2021

Partially revert haskell#670

d57aa62

Bodigrim found new `Ord` instance in haskell#783. Put the old one one back.

treeowl added a commit to treeowl/containers that referenced this pull request Jun 28, 2021

Partially revert haskell#670

557d057

Bodigrim found new `Ord` instance in haskell#783. Put the old one one back. Fixes haskell#783.

treeowl added a commit to treeowl/containers that referenced this pull request Jun 28, 2021

Partially revert haskell#670

bd6bcce

Bodigrim found a bug in the new `Ord` instance in haskell#783. Put the old one one back. Fixes haskell#783.

treeowl added a commit that referenced this pull request Jun 28, 2021

Partially revert #670

adfee37

Bodigrim found a bug in the new `Ord` instance in #783. Put the old one one back. Fixes #783.

treeowl mentioned this pull request Jun 28, 2021

Improve Ord IntSet instance #787

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test that instances for Eq and Ord agree with going via toAscList #670

test that instances for Eq and Ord agree with going via toAscList #670

jwaldmann commented Jul 22, 2019

treeowl Jul 22, 2019

jwaldmann Jul 22, 2019

treeowl Jul 22, 2019

jwaldmann commented Jul 23, 2019 •

edited

Loading

jwaldmann commented Jul 23, 2019

treeowl commented Jul 23, 2019

treeowl commented Jul 23, 2019

jwaldmann commented Jul 23, 2019

treeowl commented Jul 23, 2019

treeowl Jul 23, 2019

jwaldmann Jul 27, 2019

treeowl Jul 27, 2019

jwaldmann Jul 27, 2019

treeowl Jul 23, 2019

treeowl Jul 23, 2019

jwaldmann Jul 27, 2019 •

edited

Loading

treeowl Jul 27, 2019

jwaldmann Jul 27, 2019

treeowl Jul 27, 2019

jwaldmann Jul 27, 2019

jwaldmann Jul 28, 2019

treeowl commented Jun 28, 2021

test that instances for Eq and Ord agree with going via toAscList #670

test that instances for Eq and Ord agree with going via toAscList #670

Conversation

jwaldmann commented Jul 22, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jwaldmann commented Jul 23, 2019 • edited Loading

jwaldmann commented Jul 23, 2019

treeowl commented Jul 23, 2019

treeowl commented Jul 23, 2019

jwaldmann commented Jul 23, 2019

treeowl commented Jul 23, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jwaldmann Jul 27, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

treeowl commented Jun 28, 2021

jwaldmann commented Jul 23, 2019 •

edited

Loading

jwaldmann Jul 27, 2019 •

edited

Loading