Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[resubmit] Fix bug of FastReducer used in BigInteger.ModPow #55122

Closed
wants to merge 11 commits into from

Conversation

key-moon
Copy link
Contributor

@key-moon key-moon commented Jul 3, 2021

I accidentally deleted the forked repository, so I will resubmit the PR #53945 . Please refer to the previous discussion there. I apologize for the inconvenience.

This pull request fixes two bugs that existed in FastReducer.

1.

There is a bug in the current implementation of FastReducer that causes a runtime error when certain conditions described below are met.

  • Modulus is less than or equal to base^{modulus.Length-1}.
  • Length of divided value must be sufficiently large when call reduce (this length depending on the modulus).

When these conditions are met, the size of the buffers q1 and q2 used in the algorithm becomes insufficient, causing a assertion error on BigIntegerCalculator.FastReducer.cs#L77 in debug build, or IndexOutOfRangeException on BigIntegerCalculator.PowMod.cs#L401 in Release build.

FastReducer is used only in BigInteger.ModPow, but the specific condition cause above error. Following code is one of the example.

BigInteger.ModPow(BigInteger.One << 1008, 2, BigInteger.One << 992);

This PR fixes this bug by changing the allocated size of the buffer. The validity of the change is explained below.

Edit: After submitting this PR, I found a bug that is related but has a different cause, so I will fix that as well. See the below comments for details.

Explanation

Unless otherwise stated, all variables are non-negative and division is defined as truncated division.

Before we begin, we will define a few constants and functions.

               base := 2^32
      Length(value) := Length of the current array representing the value
ActualLength(value) := Minimum array length that needs to represent value

First, we will look at the algorithm of the constructor.

given m: uint array notation of modulus

let k := Length(m)
let r := base^(2k)

let mu := r / m
let muLength := ActualLength(mu)

Note: The variable k is implicitly defined and does not appear in the actual code.

The above corresponds to the following code.

public FastReducer(uint[] modulus)
{
Debug.Assert(modulus != null);
// Let r = (2^32)^(2k), with (2^32)^k > m
uint[] r = new uint[modulus.Length * 2 + 1];
r[r.Length - 1] = 1;
// Let mu = r / m
_mu = Divide(r, modulus);
_modulus = modulus;
// Allocate memory for quotients once
_q1 = new uint[modulus.Length * 2 + 2];
_q2 = new uint[modulus.Length * 2 + 1];
_muLength = ActualLength(_mu);
}

Next, we will look at the algorithm of the Reduce function.

given V: value that ActualLength(V) is less than or equal 2k
      ⇔ V is less than base^(2k)

let q1 := V / base^(k-1) * mu
let q2 := q1 / base^(k+1) * m

The above corresponds to the following code.

public int Reduce(uint[] value, int length)
{
Debug.Assert(value != null);
Debug.Assert(length <= value.Length);
Debug.Assert(value.Length <= _modulus.Length * 2);
// Trivial: value is shorter
if (length < _modulus.Length)
return length;
// Let q1 = v/(2^32)^(k-1) * mu
int l1 = DivMul(value, length, _mu, _muLength,
_q1, _modulus.Length - 1);
// Let q2 = q1/(2^32)^(k+1) * m
int l2 = DivMul(_q1, l1, _modulus, _modulus.Length,
_q2, _modulus.Length + 1);
// Let v = (v - q2) % (2^32)^(k+1) - i*m
return SubMod(value, length, _q2, l2,
_modulus, _modulus.Length + 1);
}

Here, we will think about upper bound of ActualLength(q1).
Since upper bound of V is base^(2k), upper bound of V / base^(k-1) is base^(k+1).
Since modulus should not be 0, r and mu can't be 0. So mu is less than base^(muLength).
It is easy to show that an upper bound on the values of both V and mu can be achieved.
From these facts, we can see that the upper bound of q1 is base^(muLength + k + 1).

Same as q1, we can see that upper bound of q2 is base^(muLength + k)

Now we have shown the upper bound of q1 and q2. From these, we can see that the upper bound of length of the buffer required for q1 and q2 is: ActualLength(q1) ≦ muLength + k + 1, ActualLength(q2) ≦ muLength + k.

2.

There is one additional bug in the current FastReducer implementation. This bug occurs when the lower half bit of the value is less than value%modulus.
This happens when you call BigInteger.ModPow using the FastReducer as follows:

BigInteger.ModPow(BigInteger.One << 1008, 2, (BigInteger.One << 992) + 1);

The above commit adds a test that covers this case and fixes the bug. Below is an explanation of the cause and fix for this bug.

Explanation

In this explanation, the symbols used in the above explanation are reused.
Barett Reduction is an algorithm that takes advantage of the fact that V mod m = V - ⌊v/m⌋m holds. In this algorithm, value of q2 is ⌊v/m⌋m. Now we will look at the algorithm of the last subtraction.

v := (v - q2) % base^(k+1)
while m ≦ v:
  v := v - m

The above corresponds to the following code.

// Let v = (v - q2) % (2^32)^(k+1) - i*m
return SubMod(value, length, _q2, l2,
_modulus, _modulus.Length + 1);

private static unsafe int SubMod(uint[] left, int leftLength,
uint[] right, int rightLength,
uint[] modulus, int k)
{
Debug.Assert(left != null);
Debug.Assert(left.Length >= leftLength);
Debug.Assert(right != null);
Debug.Assert(right.Length >= rightLength);
// Executes the subtraction algorithm for left and right,
// but considers only the first k limbs, which is equivalent to
// preceding reduction by 2^(32*k). Furthermore, if left is
// still greater than modulus, further subtractions are used.
if (leftLength > k)
leftLength = k;
if (rightLength > k)
rightLength = k;
fixed (uint* l = left, r = right, m = modulus)
{
SubtractSelf(l, leftLength, r, rightLength);
leftLength = ActualLength(left, leftLength);
while (Compare(l, leftLength, m, modulus.Length) >= 0)
{
SubtractSelf(l, leftLength, m, modulus.Length);
leftLength = ActualLength(left, leftLength);
}
}
Array.Clear(left, leftLength, left.Length - leftLength);
return leftLength;
}

The reason we divide by base^(k+1) first is that we are trying to speed up the process by using the fact that mod is greater than base^(k+1) by the definition of k. This algorithm itself is reasonable and there is no problem.
However, the actual implementation process is as follows:

given left, right, modulus, k

left := left % base^k
right := right % base^k
left := left - right
while modulus ≦ left:
  left := left - modulus

The SubtractSelf function, which is performing the subtraction in line 133, does not allow the subtrahend to be larger than minuend. Therefore, this code may result in an assertion error.
This can be avoided by using the SubtractSelf function which allows overflow. The reason this works is that the overflow in leftLength=k acts as operation on Z/(base^k)Z. Note that the overflow occurs only when left≧base^k, since left≧right holds.

Furthermore, since v%m<m<base^k holds, we know that the number to divide v can be base^k, not base^(k+1).

@ghost
Copy link

ghost commented Jul 3, 2021

Tagging subscribers to this area: @dotnet/area-system-numerics
See info in area-owners.md if you want to be subscribed.

Issue Details

I accidentally deleted the forked repository, so I will resubmit the PR #53945 . Please refer to the previous discussion there. I apologize for the inconvenience.

This pull request fixes two bugs that existed in FastReducer.

1.

There is a bug in the current implementation of FastReducer that causes a runtime error when certain conditions described below are met.

  • Modulus is less than or equal to base^{modulus.Length-1}.
  • Length of divided value must be sufficiently large when call reduce (this length depending on the modulus).

When these conditions are met, the size of the buffers q1 and q2 used in the algorithm becomes insufficient, causing a assertion error on BigIntegerCalculator.FastReducer.cs#L77 in debug build, or IndexOutOfRangeException on BigIntegerCalculator.PowMod.cs#L401 in Release build.

FastReducer is used only in BigInteger.ModPow, but the specific condition cause above error. Following code is one of the example.

BigInteger.ModPow(BigInteger.One << 1008, 2, BigInteger.One << 992);

This PR fixes this bug by changing the allocated size of the buffer. The validity of the change is explained below.

Edit: After submitting this PR, I found a bug that is related but has a different cause, so I will fix that as well. See the below comments for details.

Explanation

Unless otherwise stated, all variables are non-negative and division is defined as truncated division.

Before we begin, we will define a few constants and functions.

               base := 2^32
      Length(value) := Length of the current array representing the value
ActualLength(value) := Minimum array length that needs to represent value

First, we will look at the algorithm of the constructor.

given m: uint array notation of modulus

let k := Length(m)
let r := base^(2k)

let mu := r / m
let muLength := ActualLength(mu)

Note: The variable k is implicitly defined and does not appear in the actual code.

The above corresponds to the following code.

public FastReducer(uint[] modulus)
{
Debug.Assert(modulus != null);
// Let r = (2^32)^(2k), with (2^32)^k > m
uint[] r = new uint[modulus.Length * 2 + 1];
r[r.Length - 1] = 1;
// Let mu = r / m
_mu = Divide(r, modulus);
_modulus = modulus;
// Allocate memory for quotients once
_q1 = new uint[modulus.Length * 2 + 2];
_q2 = new uint[modulus.Length * 2 + 1];
_muLength = ActualLength(_mu);
}

Next, we will look at the algorithm of the Reduce function.

given V: value that ActualLength(V) is less than or equal 2k
      ⇔ V is less than base^(2k)

let q1 := V / base^(k-1) * mu
let q2 := q1 / base^(k+1) * m

The above corresponds to the following code.

public int Reduce(uint[] value, int length)
{
Debug.Assert(value != null);
Debug.Assert(length <= value.Length);
Debug.Assert(value.Length <= _modulus.Length * 2);
// Trivial: value is shorter
if (length < _modulus.Length)
return length;
// Let q1 = v/(2^32)^(k-1) * mu
int l1 = DivMul(value, length, _mu, _muLength,
_q1, _modulus.Length - 1);
// Let q2 = q1/(2^32)^(k+1) * m
int l2 = DivMul(_q1, l1, _modulus, _modulus.Length,
_q2, _modulus.Length + 1);
// Let v = (v - q2) % (2^32)^(k+1) - i*m
return SubMod(value, length, _q2, l2,
_modulus, _modulus.Length + 1);
}

Here, we will think about upper bound of ActualLength(q1).
Since upper bound of V is base^(2k), upper bound of V / base^(k-1) is base^(k+1).
Since modulus should not be 0, r and mu can't be 0. So mu is less than base^(muLength).
It is easy to show that an upper bound on the values of both V and mu can be achieved.
From these facts, we can see that the upper bound of q1 is base^(muLength + k + 1).

Same as q1, we can see that upper bound of q2 is base^(muLength + k)

Now we have shown the upper bound of q1 and q2. From these, we can see that the upper bound of length of the buffer required for q1 and q2 is: ActualLength(q1) ≦ muLength + k + 1, ActualLength(q2) ≦ muLength + k.

2.

There is one additional bug in the current FastReducer implementation. This bug occurs when the lower half bit of the value is less than value%modulus.
This happens when you call BigInteger.ModPow using the FastReducer as follows:

BigInteger.ModPow(BigInteger.One << 1008, 2, (BigInteger.One << 992) + 1);

The above commit adds a test that covers this case and fixes the bug. Below is an explanation of the cause and fix for this bug.

Explanation

In this explanation, the symbols used in the above explanation are reused.
Barett Reduction is an algorithm that takes advantage of the fact that V mod m = V - ⌊v/m⌋m holds. In this algorithm, value of q2 is ⌊v/m⌋m. Now we will look at the algorithm of the last subtraction.

v := (v - q2) % base^(k+1)
while m ≦ v:
  v := v - m

The above corresponds to the following code.

// Let v = (v - q2) % (2^32)^(k+1) - i*m
return SubMod(value, length, _q2, l2,
_modulus, _modulus.Length + 1);

private static unsafe int SubMod(uint[] left, int leftLength,
uint[] right, int rightLength,
uint[] modulus, int k)
{
Debug.Assert(left != null);
Debug.Assert(left.Length >= leftLength);
Debug.Assert(right != null);
Debug.Assert(right.Length >= rightLength);
// Executes the subtraction algorithm for left and right,
// but considers only the first k limbs, which is equivalent to
// preceding reduction by 2^(32*k). Furthermore, if left is
// still greater than modulus, further subtractions are used.
if (leftLength > k)
leftLength = k;
if (rightLength > k)
rightLength = k;
fixed (uint* l = left, r = right, m = modulus)
{
SubtractSelf(l, leftLength, r, rightLength);
leftLength = ActualLength(left, leftLength);
while (Compare(l, leftLength, m, modulus.Length) >= 0)
{
SubtractSelf(l, leftLength, m, modulus.Length);
leftLength = ActualLength(left, leftLength);
}
}
Array.Clear(left, leftLength, left.Length - leftLength);
return leftLength;
}

The reason we divide by base^(k+1) first is that we are trying to speed up the process by using the fact that mod is greater than base^(k+1) by the definition of k. This algorithm itself is reasonable and there is no problem.
However, the actual implementation process is as follows:

given left, right, modulus, k

left := left % base^k
right := right % base^k
left := left - right
while modulus ≦ left:
  left := left - modulus

The SubtractSelf function, which is performing the subtraction in line 133, does not allow the subtrahend to be larger than minuend. Therefore, this code may result in an assertion error.
This can be avoided by using the SubtractSelf function which allows overflow. The reason this works is that the overflow in leftLength=k acts as operation on Z/(base^k)Z. Note that the overflow occurs only when left≧base^k, since left≧right holds.

Furthermore, since v%m<m<base^k holds, we know that the number to divide v can be base^k, not base^(k+1).

Author: key-moon
Assignees: -
Labels:

area-System.Numerics

Milestone: -

@key-moon key-moon changed the title Fix bug of FastReducer used in BigInteger.ModPow [resubmit] Fix bug of FastReducer used in BigInteger.ModPow Jul 3, 2021
@terrajobst terrajobst added the community-contribution Indicates that the PR has been added by a community member label Jul 19, 2021
@jeffhandley
Copy link
Member

@pgovind This PR is assigned to you for follow-up/decision before the RC1 snap.

@pgovind
Copy link

pgovind commented Aug 2, 2021

I haven't had time to go through the changes here yet. I will have some time later this/next week to play around with the changes here. However, because we're pretty late in the .NET 6 cycle, this will likely miss .NET 6, but we can get it into .NET 7 for sure. .NET 7 will begin accepting changes in ~3 weeks

@pgovind pgovind modified the milestones: Future, 7.0.0 Aug 2, 2021
@jeffhandley
Copy link
Member

@key-moon I wanted to let you know we're still working on some .NET 6.0 RC2 items, so we're a bit delayed on reviewing this. It'll likely be a couple more weeks. Thanks for your contributions and patience!

Copy link

@pgovind pgovind left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great PR and explanation. This LGTM!

@@ -27,19 +27,19 @@ public FastReducer(uint[] modulus)
{
Debug.Assert(modulus != null);

// Let r = 4^k, with 2^k > m
// Let r = (2^32)^(2k), with (2^32)^k > m
Copy link
Member

@tannergooding tannergooding Oct 4, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you explain why base is 2^32, which I'm interpreting as 4294967296?

Copy link
Contributor Author

@key-moon key-moon Oct 7, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The formula in the comment changed because the following formula is not correct. It should be v/2^(k-32) * mu.

// Let q1 = v/2^(k-1) * mu
int l1 = DivMul(value, length, _mu, _muLength,
_q1, _modulus.Length - 1);

Instead of change (k-1) to (k-32), I choose to change base to 2^32 from 2 because the base of the number notation in this code is 2^32. If you write as 2^(k-32), I think it is more harder to understand because the reader doesn't sure where 32 came from.

But I can understand why you prefer base 2. So, what about //Let r = 2^(32*2*k), with 2^(32*2*k) > m instead of 2^(2k) or (2^32)^(2k)?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd prefer to keep this inline with the paper and just use 2^(k-32). We can leave a comment that its 32 because we operate on 32-bits at a time

@tannergooding
Copy link
Member

The logic and math LGTM minus the use of 2^32 as the base, which is confusing me here.

The reduction algorithm is Barrett reduction (https://en.wikipedia.org/wiki/Barrett_reduction) and the base number system is 2

@jeffhandley
Copy link
Member

@key-moon Because of another PR that got merged, there are now some conflicts. Let us know if you have any trouble resolving the conflicts. Thanks!

@key-moon
Copy link
Contributor Author

@jeffhandley Should I resolve the conflict by merging the main branch into this branch?

@jeffhandley
Copy link
Member

@jeffhandley Should I resolve the conflict by merging the main branch into this branch?

Yes. If you perform the following steps, it'll set you up to resolve the conflict in your environment:

  1. git switch main
  2. git pull main
  3. git switch fix-fastreducer
  4. git merge main

You'll then see the conflicts in BigIntegerCalculator.FastReducer.cs and you'll need to reconcile the changes made in #35565 here with your own changes.

Sorry for the inconvenience with the conflicts. Several folks have been contributing to BigInteger recently.

@jeffhandley jeffhandley assigned tannergooding and unassigned pgovind Oct 21, 2021
@tannergooding
Copy link
Member

Please let us know if you need assistance with fixing the merge conflict. We're happy to do so if needed.

@key-moon
Copy link
Contributor Author

I apologize for the delay in resolving the conflicts.

Because the size of q1 and q2 changes by calculation in the constructor, stack allocation could not be realized only with a read-only struct. To solve this problem, I add a helper structure.

I have aligned the definition of k in the comments with the one in Wikipedia and added notes where k is used in similar applications but other definitions.

@tannergooding
Copy link
Member

Closing and reopening to get CI to reintegrate with main and rerun.

If CI is passing, then I'll give this a final review and hopefully get it merged.

@tannergooding
Copy link
Member

Repeating the process as there are crypto failures that I believe are unrelated.

@tannergooding
Copy link
Member

@key-moon looks like the crypto tests are still failing.

Could you push a merge commit that pulls in latest dotnet:main or investigate to ensure that this isn't hitting some BigInteger path in the code/test? -- It isn't failing anywhere else at this time.

@key-moon
Copy link
Contributor Author

ModPow is an operation that is also used in the Cryptographic library, so it could be related. In the Spannify changes we merged, I found the trivial information leakage of a public key in DSAKeyFormatHelper, that is caused by using BigInteger.ModPow in ReadDsaPrivateKey

Since uncleared data does not contain any information of private key, it can't be abusable and therefore does not meet the criteria for information leakage in security reports. So I didn't report to MSRC.

This problem existed in the PR of Spannify and the test was failed before the merge, so I thought it was irrelevant. But I would like to note it for your information.

@key-moon
Copy link
Contributor Author

Update: This failure is reproduced in my local environment. And not occurred on main branch. So this failure is related to this PR. I'll investigate it.

@tannergooding tannergooding added the needs-author-action An issue or pull request that requires more info or actions from the author. label Feb 7, 2022
@ghost ghost added the no-recent-activity label Feb 21, 2022
@ghost
Copy link

ghost commented Feb 21, 2022

This pull request has been automatically marked no-recent-activity because it has not had any activity for 14 days. It will be closed if no further activity occurs within 14 more days. Any new comment (by anyone, not necessarily the author) will remove no-recent-activity.

@ghost
Copy link

ghost commented Mar 8, 2022

This pull request will now be closed since it had been marked no-recent-activity but received no further activity in the past 14 days. It is still possible to reopen or comment on the pull request, but please note that it will be locked if it remains inactive for another 30 days.

@ghost ghost closed this Mar 8, 2022
@ghost ghost locked as resolved and limited conversation to collaborators Apr 7, 2022
This pull request was closed.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area-System.Numerics community-contribution Indicates that the PR has been added by a community member needs-author-action An issue or pull request that requires more info or actions from the author. no-recent-activity
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants