Update random(QQ) to use standard uniform distribution #3481

d-torrance · 2024-09-17T23:53:24Z

This is split off from #3478. I think it makes sense to keep them separate since that deals with the Probability package and this deals with how rationals are randomly generated in the engine. The big change to the commits that were removed from that PR is the proposed probability distribution (no longer normal).

Rather than using the quotient of two discrete random variables with the uniform distribution on {1, ..., h}, we use the continuous uniform distribution on [0, 1], but rounded to the nearest rational number with denominator bounded by the number given by the Height option (default 10). For example:

i1 : random(QQ^3, QQ^3, Height => 3)

o1 = | 2/3 2/3 1/3 |
     | 1   1   2/3 |
     | 1   2/3 1/2 |

              3       3
o1 : Matrix QQ  <-- QQ

i2 : random(QQ^3, QQ^3)

o2 = | 1/8 1   7/8  |
     | 1/3 5/6 9/10 |
     | 1   3/4 1/7  |

              3       3
o2 : Matrix QQ  <-- QQ

i3 : random(QQ^3, QQ^3, Height => 100)

o3 = | 61/65 33/59 3/16  |
     | 15/26 7/26  31/39 |
     | 3/26  7/48  62/95 |

              3       3
o3 : Matrix QQ  <-- QQ

M2/Macaulay2/packages/Macaulay2Doc/functions/random-doc.m2

Have it call rawSetRandomQQ to de-duplicate code

This gives us the closest rational number to a given real number with bounded denominator. It's available at top level using the unexported rawFareyApproximation.

d-torrance · 2024-09-18T18:55:47Z

I think the build failures are coming from examples that use random in some way and that have some nonzero probability of failing. This PR is changing things so that we're only calling the pseudorandom number generator once instead of twice for each call to rawRandomQQ, we're getting different random numbers and getting some unexpected results.

In particular, CoincidentRootLoci::realrank sometimes calls QEPCAD, which isn't available in the macOS builds, and GraphicslModelsMLE::solverMLE sometimes results in division by zero.

I was able to reproduce both of these pretty quickly using Macaulay2 1.24.05 on Debian unstable, i.e., before the proposed changes:

i1 : needsPackage "CoincidentRootLoci";

i2 : realrank randomBinaryForm(6, 4, 4)

o2 = 4

i3 : realrank randomBinaryForm(6, 4, 4)

o3 = 4

i4 : realrank randomBinaryForm(6, 4, 4)
stdio:4:10:(3):[1]: error: the ideal is not apolar

i5 : realrank randomBinaryForm(6, 4, 4)

o5 = 4

i6 : realrank randomBinaryForm(6, 4, 4)

o6 = 4

i7 : realrank randomBinaryForm(6, 4, 4)

o7 = 4

i8 : realrank randomBinaryForm(6, 4, 4)
/bin/sh: 1: qepcad: not found
stdio:8:1:(3): error: error occurred while executing QEPCAD. Please make sure that it is installed and configured correctly.

i1 : needsPackage "GraphicalModelsMLE";
-- storing configuration for package Graphs in /root/.Macaulay2/init-Graphs.m2
-- storing configuration for package NumericalAlgebraicGeometry in /root/.Macaulay2/init-NumericalAlgebraicGeometry.m2
-- storing configuration for package Bertini in /root/.Macaulay2/init-Bertini.m2

i2 : G = mixedGraph(digraph {{1,2},{1,3},{2,3},{3,4}},bigraph {{3,4}});

i3 : U = matrix{{6.2849049, 10.292875, 1.038475, 1.1845757}, {3.1938475, 3.2573, 1.13847, 1}, {4/5, 3/2, 9/8, 3/10}, {10/7, 2/3,1, 8/3}};

                4         4
o3 : Matrix RR    <-- RR
              53        53

i4 : solverMLE(G,U,RealPrecision=>10)

o4 = (1.83311, | 4.52904   7.81123   -.0250185 .309951   |, 2)
               | 7.81123   14.3732   -.0382332 .473665   |
               | -.0250185 -.0382332 .00337164 -.0418254 |
               | .309951   .473665   -.0418254 .742627   |

o4 : Sequence

i5 : solverMLE(G,U,RealPrecision=>10)
stdio:5:1:(3): error: atttempt to divide by zero

I'm thinking of replacing the failing examples with canned ones for the time being and also opening issues for both of these examples. Does that sound reasonable?

mahrud · 2024-09-18T20:13:25Z

Instead of canning the examples, try setting the random seed to something that prevents failure.

d-torrance · 2024-09-19T17:16:12Z

For now, I've just added a setRandomSeed line to the offending examples, but I wonder if that might be confusing to a user reading the documentation?

Maybe at some point we should add some way of setting the random seed for a particular example (or test)?

M2/Macaulay2/m2/galois.m2

mahrud · 2024-09-20T17:32:12Z

For now, I've just added a setRandomSeed line to the offending examples, but I wonder if that might be confusing to a user reading the documentation?

Maybe at some point we should add some way of setting the random seed for a particular example (or test)?

This kind of situation is fairly rare, so I think it's fine. Also, I think it's helpful for users to see that they can produce the results identical to the documentation.

mahrud · 2024-09-20T17:33:31Z

I'm happy with this PR, but given that it's changing the behavior of random rather significantly, I think it would be good to have a second opinion from @mikestillman or @antonleykin.

d-torrance · 2024-09-20T17:37:21Z

I'm happy with this PR, but given that it's changing the behavior of random rather significantly, I think it would be good to have a second opinion from @mikestillman or @antonleykin.

Yes, I agree!

mahrud · 2024-09-20T17:43:33Z

While we're here, could you also add something like random RR for QQ? e.g.

random QQ := QQ => opts -> x -> x * random(QQ, opts)

A problem with this method is that the height is then skewed (e.g. 6 * random(QQ, Height => 3) always gives integers), so maybe we want random x + random(QQ, opts) instead? I'm not sure what's the best approach.

d-torrance · 2024-09-20T22:34:36Z

I pushed a commit adding random(QQ) by first calling random(RR) and then rounding it. That seems to work pretty well:

i1 : apply(20, i -> random(10/1))

      19  11  59  25  7  5  17  89  4  8  33  31  31  11  53     17  64  31  53
o1 = {--, --, --, --, -, -, --, --, -, -, --, --, --, --, --, 4, --, --, --, --}
      10   5   6   8  6  2   3  10  7  5   4  10   4   3   9      6   7   6   6

o1 : List

d-torrance · 2024-09-20T23:49:59Z

I went ahead and added more random methods, so now pretty much any Number object or pair of Number objects should do something sensible.

For the complex numbers, if we give it $a + bi$, it will give us something in the rectangle $[0, a]\times [0, b]$ on the complex plane, and if we give it $a + bi$ and $c + di$, it will give us something in $[a,c]\times[b, d]$. (The order on the endpoints may be reversed if necessary.)

i1 : random(1, pi)

o1 = 1.47764022754653

o1 : RR (of precision 53)

i2 : random(1/2, ii)

o2 = .398515040066406+.230213567526911*ii

o2 : CC (of precision 53)

mahrud · 2024-09-21T15:57:49Z

I went ahead and added more random methods, so now pretty much any Number object or pair of Number objects should do something sensible.

I don't like the "pair of numbers" methods and would even advocate for deprecating random(ZZ, ZZ). They overcrowd the methods on random for very little gain, since they are simply for non-integers random(min, max) = min + max * random class max or min + random max for integers. The reason I wanted random QQ is that otherwise getting the height correct is nontrivial.

d-torrance · 2024-09-21T20:47:26Z

Fair enough -- I'll remove them. It was redundant with random uniformDistribution(a, b) from the Probability package anyway

d-torrance · 2024-09-21T20:50:28Z

I'm wondering if maybe the support of random(QQ) should be [0, opts.Height] instead of [0, 1]? That would keep it consistent with the current behavior, and might cut down on the random seed failures since I think now there's a higher probability of singular random rational matrices, etc, with the smaller support

mahrud · 2024-09-21T23:19:55Z

Yeah, maybe. Height of a rational number a/b is usually defined as max(a,b) so you're right that that would make more sense. I guess I'm just annoyed that Height has different effects for ZZ and QQ and seemingly no effect for RR or CC. Can we make these consistent somehow and document them all in the same page?

Also, should random CC give a random complex number on the unit circle instead of a random complex number in the unit square? Not sure which is preferable.

We then round the result to the nearest rational number with denominator bounded by the Height option using Farey approximation

It's defined in the interpreter, which we don't link against here, and we need it for rawSetRandomQQ.

Previously misspelled the word "attempt". We use the same message as the same error in the interpreter.

For consistency w/ rawRandomRRNormal

This way, calling GF(p^n) 100 times won't give us 100 different GaloisField objects. This fixes a strange example in the "random(Ring)" docs where we called "tally for i to 100 list random GF 11" and got a Tally object with 100 different key-value pairs.

Otherwise, we end up calling QEPCAD, which will fail if it's not available (e.g., on the macOS GitHub builds).

d-torrance · 2024-09-22T12:11:40Z

I've updated the support so that random QQ returns a random rational number in [0, opts.Height]. At least locally, this fixes it so that only one of the tests/examples that previously needed a different random seed still needs one.

So for RR and CC, are you suggesting that we adjust it so that the support is [0, opts.Height] and [0, opts.Height] x [0, opts.Height], respectively (or [0, opts.Height] x [0, 2$\pi$] in polar coords if we switch to a circle in the complex plane)?.

d-torrance · 2024-09-22T13:07:37Z

Just pushed another commit fixing a bug I found working on this:

Before

i1 : random(QQ, Height => 0)
Floating point exception (core dumped)

After

i1 : random(QQ, Height => 0)
stdio:1:6:(3): error: expected a positive height

mahrud · 2024-09-22T14:23:41Z

M2/Macaulay2/d/interface.dd

@@ -66,8 +66,7 @@ export rawFareyApproximation(e:Expr):Expr := (
 setupfun("rawFareyApproximation", rawFareyApproximation);
 export rawRandomQQ(e:Expr):Expr := (
     when e
-     is Nothing do toExpr(Ccode(QQ, "rawRandomQQ(", "0)"))
-     is ht:ZZcell do toExpr(Ccode(QQ, "rawRandomQQ(", ht.v, ")"))
+     is ht:ZZcell do toExpr(Ccode(QQorNull, "rawRandomQQ(", ht.v, ")"))


I think it might be simpler to just assert that Height is positive in the top-level, no?

My concern was that we also want to catch this when making random matrices, which is a different call to the engine that isn't specific to the coefficient ring. Is height always positive for all the various meanings of height we might want for random? Or could it be zero/negative in other contexts?

d-torrance · 2024-09-22T21:29:58Z

I keep re-thinking what the support of random QQ should be lol...

Considering that the height of a rational number is the max of the absolute values of its numerator and denominator, and the proposed behavior would allow larger numerators, then maybe the current behavior is basically correct, or at least has the correct support. But instead maybe we should have a uniform distribution on that support, rather than the current distribution where 1 is more likely than anything else. So in other words, instead of the distribution being $\frac{X}{Y}$ where $X,Y\in\{1,\ldots,h\}$, we have $\frac{X}{Y}$ where $X,Y\in\{1,\ldots,h\}$ and $\gcd(X,Y)=1$.

So rawRandomQQ could be something like (pseudocode):

while (
    x = random_int(1, h)
    y = random_int(1, h)
    gcd(x, y) != 1) do nothing
return x/y

d-torrance · 2024-09-26T17:27:15Z

So where did we land on the support? [0, 1] or [0, height]?

mahrud · 2024-09-26T18:11:36Z

I think new option Support with default value [-1,1]? And perhaps this?

random QQ := o -> r -> random(QQ, o, Support => [0,r])

mahrud · 2024-11-06T23:29:04Z

If it makes sense to you, I think we should merge this and work on the optional arguments in follow ups.

d-torrance · 2024-11-07T02:32:50Z

I'm not sure if the current proposed behavior is what we want, though, since the numerator can exceed the height.

mahrud · 2024-11-19T00:54:08Z

Related: #999 and #2089.

d-torrance requested review from mikestillman and mahrud September 17, 2024 23:53

d-torrance force-pushed the random-qq branch from 23db5d8 to 2dcfd43 Compare September 18, 2024 00:44

mahrud approved these changes Sep 18, 2024

View reviewed changes

M2/Macaulay2/packages/Macaulay2Doc/functions/random-doc.m2 Outdated Show resolved Hide resolved

d-torrance added 2 commits September 18, 2024 07:30

Simplify rawSetRandomQQ

dc4c1c0

Have it call rawSetRandomQQ to de-duplicate code

Add routine for computing Farey approximations to engine

59cf054

This gives us the closest rational number to a given real number with bounded denominator. It's available at top level using the unexported rawFareyApproximation.

d-torrance force-pushed the random-qq branch from 2dcfd43 to c976fa1 Compare September 18, 2024 11:34

d-torrance marked this pull request as draft September 19, 2024 03:32

d-torrance force-pushed the random-qq branch from d454379 to 6d494b3 Compare September 19, 2024 17:11

d-torrance marked this pull request as ready for review September 20, 2024 01:17

mahrud approved these changes Sep 20, 2024

View reviewed changes

M2/Macaulay2/m2/galois.m2 Show resolved Hide resolved

mahrud mentioned this pull request Sep 20, 2024

memoize and options #3491

Open

d-torrance force-pushed the random-qq branch 2 times, most recently from 95e64e3 to 3265493 Compare September 20, 2024 23:44

d-torrance force-pushed the random-qq branch from 2f2ee70 to 428b1ea Compare September 21, 2024 22:04

d-torrance added 2 commits September 22, 2024 00:33

Update random(QQ) to use uniform distribution on [0, height]

6fcb951

We then round the result to the nearest rational number with denominator bounded by the Height option using Farey approximation

Add unit tests for Farey approximation

7a1f5b6

d-torrance added 8 commits September 22, 2024 00:33

Define gmp_defaultPrecision for engine unit tests

0808832

It's defined in the interpreter, which we don't link against here, and we need it for rawSetRandomQQ.

Reword division by zero error message in engine

de2eb75

Previously misspelled the word "attempt". We use the same message as the same error in the interpreter.

Rename rawRandomRR -> rawRandomRRUniform

51f614b

For consistency w/ rawRandomRRNormal

Memoize GF(ZZ,ZZ)

0471672

This way, calling GF(p^n) 100 times won't give us 100 different GaloisField objects. This fixes a strange example in the "random(Ring)" docs where we called "tally for i to 100 list random GF 11" and got a Tally object with 100 different key-value pairs.

Add random(QQ)

09f9748

Document random(QQ)

97fd04e

Update random matrix Core test for new random(QQ) behavior

a09c8fa

Update ComputationsBook test for new random(QQ) behavior

25efd13

d-torrance force-pushed the random-qq branch from 428b1ea to 25efd13 Compare September 22, 2024 04:48

Use a different random seed for randomBinaryForm example

82aabfe

Otherwise, we end up calling QEPCAD, which will fail if it's not available (e.g., on the macOS GitHub builds).

Raise an error in random(QQ) if height is nonpositive

7fabcd3

mahrud reviewed Sep 22, 2024

View reviewed changes

d-torrance marked this pull request as draft September 22, 2024 21:30

mahrud added this to the version 1.24.11 milestone Oct 7, 2024

d-torrance modified the milestones: version 1.24.11, version 1.25.05 Oct 25, 2024

mahrud linked an issue Nov 19, 2024 that may be closed by this pull request

random QQ #999

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update random(QQ) to use standard uniform distribution #3481

Update random(QQ) to use standard uniform distribution #3481

d-torrance commented Sep 17, 2024

d-torrance commented Sep 18, 2024

mahrud commented Sep 18, 2024

d-torrance commented Sep 19, 2024

mahrud commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

d-torrance commented Sep 20, 2024 •

edited

Loading

mahrud commented Sep 21, 2024

d-torrance commented Sep 21, 2024

d-torrance commented Sep 21, 2024

mahrud commented Sep 21, 2024

d-torrance commented Sep 22, 2024

d-torrance commented Sep 22, 2024

mahrud Sep 22, 2024 •

edited

Loading

d-torrance Sep 22, 2024

d-torrance commented Sep 22, 2024

d-torrance commented Sep 26, 2024

mahrud commented Sep 26, 2024 •

edited

Loading

mahrud commented Nov 6, 2024

d-torrance commented Nov 7, 2024

mahrud commented Nov 19, 2024

Update random(QQ) to use standard uniform distribution #3481

Are you sure you want to change the base?

Update random(QQ) to use standard uniform distribution #3481

Conversation

d-torrance commented Sep 17, 2024

d-torrance commented Sep 18, 2024

mahrud commented Sep 18, 2024

d-torrance commented Sep 19, 2024

mahrud commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

mahrud commented Sep 20, 2024

d-torrance commented Sep 20, 2024

d-torrance commented Sep 20, 2024 • edited Loading

mahrud commented Sep 21, 2024

d-torrance commented Sep 21, 2024

d-torrance commented Sep 21, 2024

mahrud commented Sep 21, 2024

d-torrance commented Sep 22, 2024

d-torrance commented Sep 22, 2024

Before

After

mahrud Sep 22, 2024 • edited Loading

Choose a reason for hiding this comment

d-torrance Sep 22, 2024

Choose a reason for hiding this comment

d-torrance commented Sep 22, 2024

d-torrance commented Sep 26, 2024

mahrud commented Sep 26, 2024 • edited Loading

mahrud commented Nov 6, 2024

d-torrance commented Nov 7, 2024

mahrud commented Nov 19, 2024

d-torrance commented Sep 20, 2024 •

edited

Loading

mahrud Sep 22, 2024 •

edited

Loading

mahrud commented Sep 26, 2024 •

edited

Loading