Improve speed of sum_of_* for CombinatorialFreeModule #33267

tscrim · 2022-02-01T02:29:06Z

We avoid using transient elements as much as possible (and using Cythonization) to speed up these methods.

Depends on #33257

CC: @fchapoton @orlitzky

Component: performance

Author: Travis Scrimshaw

Branch/Commit: 33a7e77

Reviewer: Michael Orlitzky

Issue created by migration from https://trac.sagemath.org/ticket/33267

The text was updated successfully, but these errors were encountered:

tscrim · 2022-02-01T02:49:50Z

Commit: 6d1dbb2

tscrim · 2022-02-01T02:49:50Z

Branch: public/performance/optimize_sum_of_in_cfm-33267

tscrim · 2022-02-01T02:49:50Z

comment:1

sage: F = CombinatorialFreeModule(QQ, ['a', 'b', 'c'])
sage: %timeit F._sum_of_monomials(['a','b','b'])
2.12 µs ± 16 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
sage: %timeit F.sum_of_terms([('a',2), ('c',3)])
1.38 µs ± 26.9 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

versus before

sage: %timeit F._sum_of_monomials(['a','b','b'])
8.11 µs ± 21.6 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)
sage: %timeit F.sum_of_terms([('a',2), ('c',3)])
5.1 µs ± 20.5 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

So we get ~4x speedup on these small examples. Likely this will improve the speed across a number of methods as these two methods are used somewhat frequently.

New commits:

`6d1dbb2`	`Speedup of sum_of_* methods by using dictionaries directly.`

orlitzky · 2022-02-01T13:56:15Z

comment:2

This bit,

    for index, coeff in index_coeff_pairs:
        if index in result:
            result[index] += coeff
        else:
            result[index] = coeff
    return remove_zeros(result)

makes several passes through result looking for index. Given that we're going to remove the zeros at the end anyway, would it be any faster to initialize the result with zeros, so that we can add unconditionally? Or to try result[index] += coeff and only do result[index] = coeff if a KeyError is thrown?

tscrim · 2022-02-02T00:33:29Z

comment:3

Initializing zeros would only be better in cases that are highly dense, which is fairly rare IMO. Suppose we are working in the exterior algebra of rank n, which has dimension 2ⁿ. If we simply want to work with 20 terms for a computation (not an unlikely scenario in rank 10). Then we have to fill all 1024 possible entries of this dict, which we then afterwards have to check for 0 (iterating over everything, which is not so good for a dict) and filter most of those out.

Now I did think about catching the KeyError, but I am assuming the most likely scenario is most of the terms are unique and index is not in the `dict. In small scale testing:

sage: def t1():
....:     d = {}
....:     try:
....:         d[5] += 1
....:     except KeyError:
....:         d[5] = 1
....:         
sage: def t2():
....:     d = {}
....:     if 5 in d:
....:         d[5] += 1
....:     else:
....:         d[5] = 1
....:         
sage: %timeit t1()
729 ns ± 0.982 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)
sage: %timeit t2()
478 ns ± 20.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

If I instead replace 5 with ind = (1,2,3), I get

sage: %timeit t1()
929 ns ± 7.51 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)
sage: %timeit t2()
656 ns ± 1.84 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

This is why I settled on this.

Something I have thought a bit about is having a sparse and dense version of CFM (and bring its implementation much closer FreeModule). However, that would likely be a major project. Perhaps I should propose that as a GSoC project... So the dense case would become useful for those that want it (or "small" dimensional algebras). That's for later though.

sagetrac-git · 2022-02-02T00:40:57Z

Branch pushed to git repo; I updated commit sha1. New commits:

`7229a19`	`some details about shuffle of words and multizetas`
`ec60e55`	`Merge branch 'u/chapoton/33102' in 9.5`
`7ea8aea`	`fix a bug in multiple zeta values`
`c4415d7`	`Merge branch 'u/chapoton/33257' of git://trac.sagemath.org/sage into public/performance/optimize_sum_of_in_cfm-33267`
`fd50bb4`	`One additional optimization to multiple zetas.`

sagetrac-git · 2022-02-02T00:40:57Z

Changed commit from 6d1dbb2 to fd50bb4

tscrim · 2022-02-02T00:41:48Z

comment:5

I did one additional optimization I noticed while reviewing #33257.

tscrim · 2022-02-02T00:41:48Z

Dependencies: #33257

orlitzky · 2022-02-02T14:33:18Z

Reviewer: Michael Orlitzky

orlitzky · 2022-02-02T14:33:18Z

comment:6

Ok, it does what it says. I've been testing it on my own CFM code with no problems.

One more nitpick: in sum_of_terms, you mention that the argument can be any iterable, but

cpdef dict sum_of_monomials(monomials, scalar):
    r"""
    Return the pointwise addition of ``monomials``.

    INPUT:

    - ``monomials`` -- a list of indices representing the monomials

only mentions a list. I think an iterable would work there too? Not a big deal.

I also spent some time trying to figure out how to remove the double-loop from remove_zeros(). The best I could come up with is to use a dict comprehension like { index: D[index] for index in D if D[index] }, but that creates a new dict so it's not guaranteed to be any faster.

sagetrac-git · 2022-02-02T23:49:36Z

Branch pushed to git repo; I updated commit sha1 and set ticket back to needs_review. New commits:

`33a7e77`	`Update doc of sum_of_monomials() to include iterables.`

sagetrac-git · 2022-02-02T23:49:36Z

Changed commit from fd50bb4 to 33a7e77

tscrim · 2022-02-03T00:01:48Z

comment:8

Replying to @orlitzky:

Ok, it does what it says. I've been testing it on my own CFM code with no problems.

Thank you for the review.

One more nitpick: in sum_of_terms, you mention that the argument can be any iterable, but
cpdef dict sum_of_monomials(monomials, scalar):
    r"""
    Return the pointwise addition of ``monomials``.

    INPUT:

    - ``monomials`` -- a list of indices representing the monomials
only mentions a list. I think an iterable would work there too? Not a big deal.

I fixed it. Since it is a trivial change, I am allowing myself to set this back to a positive review. Feel free to revert if you disagree.

I also spent some time trying to figure out how to remove the double-loop from remove_zeros(). The best I could come up with is to use a dict comprehension like { index: D[index] for index in D if D[index] }, but that creates a new dict so it's not guaranteed to be any faster.

That would be bad when there are very few zeros, but say the dict is really big. I feel that is a more common scenario than having a lot of zeros, and a list is cheaper to create I believe. There will always be a scenario that behaves badly for whichever implementation unfortunately. So IMO we just have to chose the one which seems least likely to occur.

vbraun · 2022-02-20T13:27:39Z

Changed branch from public/performance/optimize_sum_of_in_cfm-33267 to 33a7e77

tscrim added this to the sage-9.6 milestone Feb 1, 2022

tscrim added p: major / 3 labels Feb 1, 2022

tscrim added s: needs review labels Feb 1, 2022

orlitzky added s: positive review and removed s: needs review labels Feb 2, 2022

sagetrac-git mannequin added s: needs review and removed s: positive review labels Feb 2, 2022

tscrim added s: positive review and removed s: needs review labels Feb 3, 2022

vbraun removed the s: positive review label Feb 20, 2022

vbraun closed this as completed in ffa81bb Feb 20, 2022

tscrim mentioned this issue Feb 9, 2022

schur functions construct elements with coefficients in the wrong base ring #33313

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve speed of sum_of_* for CombinatorialFreeModule #33267

Improve speed of sum_of_* for CombinatorialFreeModule #33267

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

orlitzky commented Feb 1, 2022

tscrim commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

tscrim commented Feb 2, 2022

tscrim commented Feb 2, 2022

orlitzky commented Feb 2, 2022

orlitzky commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

tscrim commented Feb 3, 2022

vbraun commented Feb 20, 2022

Improve speed of sum_of_* for CombinatorialFreeModule #33267

Improve speed of sum_of_* for CombinatorialFreeModule #33267

Comments

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

tscrim commented Feb 1, 2022

orlitzky commented Feb 1, 2022

tscrim commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

tscrim commented Feb 2, 2022

tscrim commented Feb 2, 2022

orlitzky commented Feb 2, 2022

orlitzky commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

sagetrac-git mannequin commented Feb 2, 2022

tscrim commented Feb 3, 2022

vbraun commented Feb 20, 2022