Diagnostics tool for ill-posed constraints #1454

andrewlee94 · 2024-07-22T20:24:54Z

Waiting on Pyomo/pyomo#3376

Summary/Motivation:

As part of working on the new scaling tools, I realised there are some simple checks we can do for detecting poorly-posed constraints that could cause scaling issues. This PR adds a new expression walker that looks for the following signs of poor scaling in constraints:

sum expressions with terms which have significant differences in magnitude.
sum expressions where terms cancel out (with a catch for cases of constant == sum()).

Changes proposed in this PR:

Legal Acknowledgement

By contributing to this software project, I agree to the following terms and conditions for my contribution:

I agree my contributions are submitted under the license terms described in the LICENSE.txt file at the top level of this directory.
I represent I am authorized to make the contributions and grant the license. If my employer has rights to intellectual property that includes these contributions, I represent that I have received permission to make contributions and grant the required license on behalf of that employer.

…ag_diagnostics

codecov-commenter · 2024-07-22T21:11:47Z

Codecov Report

Attention: Patch coverage is 83.33333% with 28 lines in your changes missing coverage. Please review.

Project coverage is 76.86%. Comparing base (b6be02e) to head (5d5da12).

Files with missing lines	Patch %	Lines
idaes/core/util/model_diagnostics.py	83.33%	18 Missing and 10 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1454      +/-   ##
==========================================
+ Coverage   76.84%   76.86%   +0.01%     
==========================================
  Files         376      376              
  Lines       61091    61258     +167     
  Branches    13505    13549      +44     
==========================================
+ Hits        46948    47084     +136     
- Misses      11762    11784      +22     
- Partials     2381     2390       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dallan-keylogic · 2024-07-23T14:36:36Z

@Robbybp , is there any possibility of combining a tool like this with block triangularization? My intuition is that a constraint in which catastrophic cancellation occurs (taking the difference between two large numbers to get a small number) might show up differently in the block structure than one in which it does not (adding a small number to a large number to get another large number), even though individual constraint might be identical.

Robbybp · 2024-07-23T15:25:37Z

@dallan-keylogic My only initial thought is that a "catastrophic cancellation" has different implications depending on where it appears appears in the block triangular decomposition. If it is in a diagonal block, it could be a problem (as we're relying on that term for nonsingularity). Otherwise, it is fine, as the entry could be zero and the matrix's singularity would not change.

dallan-keylogic · 2024-07-23T22:00:16Z

Okay, a very basic example to see the issues at play. We're not concerned when we add a small number to a big number to get another big number:

This system has the matrix representation

However, the situation is different when we take the difference of two big numbers to get a small number:

This system has the matrix representation

Looking at these two systems, there doesn't seem to be any structural difference between them. The variable b is chosen to be a pivot in both cases. However, when we define a scaled variable

we get a different picture.

The first system then becomes

with a matrix representation

whose system matrix is well-conditioned with determinant -1.

The second system, on the other hand, becomes

with a matrix representation

whose system matrix is ill-conditioned with determinant -epsilon. It can be shown that the condition number blows up in the limit as epsilon goes to zero.

Robbybp · 2024-07-23T22:56:35Z

@dallan-keylogic I would say that scaling b by 1/eps is not the right choice here. A block triangularization on the scaled system could definitely identify that. We could potentially identify that b, as a column-singleton, should not be scaled up.

dallan-keylogic · 2024-07-24T13:11:17Z

@Robbybp Whether or not we should scale by 1/eps requires a judgement call about what precision we need b. If having b to the same absolute precision as a and c is sufficient, we should not scale it. If we need b to the same relative precision as a and c, we must scale it. We should scale it when, for example, we're then going to divide it by a small number as part of a finite difference approximation to a derivative (note, @andrewlee94 , this sort of action is taken in the ControlVolume1D).

…ag_diagnostics

jsiirola

Overall looks pretty reasonable. Some questions and comments (that should probably be addressed)

jsiirola · 2024-08-15T19:00:13Z

idaes/core/util/model_diagnostics.py

+            for i in mm:
+                mismatch.append(f"{c.name}: {i}")
+            for i in cc:
+                cancellation.append(f"{c.name}: {i}")
+            if k:
+                constant.append(c.name)


I would be happier if the "collect" routine didn't do formatting (conversion to a string).

This one would take some more work and require changing other collect methods as well. I think that might be better as a separate issue/PR.

idaes/core/util/model_diagnostics.py

jsiirola · 2024-08-22T18:48:05Z

idaes/core/util/model_diagnostics.py

+        if (
+            hasattr(node, "is_named_expression_type")
+            and node.is_named_expression_type()
+        ):


I have stumbled across a syntax that is a bit more concise. Not sure if you want to use it, though:

if getattr(node, "is_named_expression_type", bool)():

I think I'll keep the current form as it is a little easier to understand what it is doing.

idaes/core/util/model_diagnostics.py

jsiirola · 2024-08-22T19:27:54Z

idaes/core/util/model_diagnostics.py

+            # We will check for cancelling terms here, rather than the sum itself, to handle special cases
+            # We want to look for cases where a sum term results in a value much smaller
+            # than the terms of the sum
+            sums = self._sum_combinations(d[0])
+            if any(i <= self._sum_tol * max(d[0]) for i in sums):
+                cancelling.append(str(node))


I think I understand what you are doing here, but wouldn't this complain loudly about the objective for every parameter estimation problem (MSE) if the problem actually solved?

One of my main concerns with this tool is its inability to distinguish between true cases of catastrophic cancellation and benign situations, where, for example, we have a heat of adsorption term in a column which will be close to except near the breakthrough point.

Is there an easy way to test for that however? Note that this tool is only used for Cautions in the main toolbox, with the implication that these might be issues (but not guaranteeing that).

I.e., this is intended to be a simple check to find potential issues, but the user will have to look into them all further to decide if they are critical or not.

I'm not sure there's an easy way to do it without assuming a well-scaled system. In a well-scaled system, it would show up in the SVD with extremely large or extremely small singular values.

However, once you get to a high enough noise/signal ratio, you start to question about whether including the tool in report_numerical_issues is worthwhile or whether it is distracting the user from more fruitful avenues of diagnostics. I suppose we can just pull it from report_numerical_issues without a deprecation cycle if it proves to not be useful, so we can try it out and see how users find it.

@jsiirola This walker was written specifically with Constraints in mind, so I had not considered that. The check for mismatched terms would make sense for an Objective however, so this could be extended to handle them as well. However, how would the expression walker know if it was dealing with an Objective or a Constraint (the input argument is the expression, not the component)?

jsiirola · 2024-08-22T19:33:14Z

idaes/core/util/model_diagnostics.py

+        # (In)equality expressions are a special case of sum expressions
+        # We can start by just calling the method to check the sum expression
+        vals, mismatch, cancelling, const = self._check_sum_expression(node, child_data)


Don't you need to negate the values for the second argument in the relational expression before treating it like a sum?

I had forgotten to make that correction - thank you.

jsiirola · 2024-08-22T19:39:01Z

idaes/core/util/model_diagnostics.py

+    node_type_method_map = {
+        EXPR.EqualityExpression: _check_equality_expression,
+        EXPR.InequalityExpression: _check_equality_expression,
+        EXPR.RangedExpression: _check_sum_expression,


Don't you need special handling for Ranged (like Equality / Inequality)? I would probably define a

def _check_ranged(self, node, child_data): lhs_vals, lhs_mismatch, lhs_cancelling, lhs_const = self._check_equality(node, child_data[:2]) rhs_vals, rhs_mismatch, rhs_cancelling, rhs_const = self._check_equality(node, child_data[1:]) # Then merge the results and return

I think I've fixed this - it would be good if you could double check my logic however.

idaes/core/util/model_diagnostics.py

jsiirola · 2024-08-22T19:45:13Z

idaes/core/util/model_diagnostics.py

+    def _check_product(self, node, child_data):
+        mismatch, cancelling, const = self._perform_checks(node, child_data)
+
+        val = self._get_value_for_sum_subexpression(
+            child_data[0]
+        ) * self._get_value_for_sum_subexpression(child_data[1])
+
+        return [val], mismatch, cancelling, const


Almost all of your nonlinear handlers could be replaced by a single callback:

def _check_general_expr(self, node, child_data): mismatch, cancelling, const = self._perform_checks(node, child_data) val = node._apply_operation(list(map(self._get_value_for_sum_subexpression, child_data))) return [val], mismatch, cancelling, const

Thank you - I did not know I could do that.

mrmundt

I did not review the tests cases yet because I do have some suggestions for changes in the main file that may require changes in test cases.

idaes/core/util/model_diagnostics.py

mrmundt · 2024-09-05T19:05:38Z

idaes/core/util/model_diagnostics.py

+    def _sum_combinations(self, values_list):
+        sums = []
+        for i in chain.from_iterable(
+            combinations(values_list, r) for r in range(2, len(values_list) + 1)


2 is a magic number! Why 2?

We are looking for any combination of terms which cancel, thus the minimum number of terms to consider is 2 (a single term cannot cancel with itself). I can add a comment.

What this block of code does is go through combinations of terms in an expression first in groups of 2, then in groups of 3, all the way up to groups of len(values_list). So if we have the sum expression a+b+c+d, we first iterate through (a, b), (a, c), (a, d), (b, c), (b, d), (c, d), then iterate through (a, b, c,), (a, b, d), (a, c, d), (b, c, d), then (a,b,c,d).

However, the number of terms you're checking grows exponentially in expression size. In particular, if len(values_list) == m, you'll be checking 2 ** m - m -1 terms. I expect that this sort of check will take an extremely long time on any model with Expressions of any significant length, much less an extreme chonker like eNRTL.

We can probably make this more efficient by:

Stripping any term with a value of 0

Breaking the loop at the first failure - we do not count how many cancellations there are, just if there is at least 1.

idaes/core/util/model_diagnostics.py

…ag_diagnostics

andrewlee94 · 2024-09-10T19:57:02Z

@mrmundt I think I have addressed all of your comment. If you have time to do another review, it would be appreciated.

idaes/core/util/tests/test_model_diagnostics.py

andrewlee94 · 2024-09-11T15:12:10Z

@mrmundt Thank you - I fixed the left over print statements.

mrmundt

I only have issues with one thing which is "magic numbers" - otherwise it looks good!

mrmundt · 2024-09-23T18:24:33Z

idaes/core/util/model_diagnostics.py

+        for j in child_data[0][0]:
+            vals.append(-j)
+        mdata.append((vals, child_data[0][1]))
+        mdata.append(child_data[1])
+
+        # Next, call the method to check the sum expression
+        vals, const, _ = self._check_sum_expression(node, mdata)
+
+        # Next, we need to check for canceling terms.
+        # In this case, we can safely ignore expressions of the form constant = sum()
+        # We can also ignore any constraint that is already flagged as mismatched
+        # We will also ignore any constraints where a == b and neither a nor b are sum expressions
+        if not child_data[0][2] and not child_data[1][2]:


I am... mildly irked about all of the magic numbers here. Perhaps it is more well-known than me what is inside child_data, but as someone who doesn't know, what is in [0][0] and why does it need to be negated?

Seeing as I had to remind myself what these meant, the point is well taken. I have added some comments that hopefully explain what is happening.

…ag_diagnostics

…/idaes-pse into cons_mag_diagnostics

mrmundt

I am generally content with this now!

andrewlee94 added 2 commits July 22, 2024 16:11

Working on cancellation detection

37e0fae

Merge branch 'main' of https://github.com/IDAES/idaes-pse into cons_m…

1664593

…ag_diagnostics

andrewlee94 requested review from jsiirola, Robbybp, bknueven and dallan-keylogic July 22, 2024 20:28

andrewlee94 self-assigned this Jul 22, 2024

andrewlee94 added enhancement New feature or request Priority:Normal Normal Priority Issue or PR diagnostics labels Jul 22, 2024

Fixing typo

4d48741

andrewlee94 added 8 commits August 6, 2024 13:11

Merge branch 'main' of https://github.com/IDAES/idaes-pse into cons_m…

a1e91d2

…ag_diagnostics

Improving efficiency

33a4435

More efficiency and testing

747eb7d

Reducing duplicated code in tests

707de1e

Finishing testing

3a6400b

Adding additioanl tests and clean up

c3e315c

Running pylint

2ab55ca

Adding constraint walker to diagnostics toolbox

2af9478

andrewlee94 marked this pull request as ready for review August 9, 2024 19:17

andrewlee94 requested a review from lbianchi-lbl as a code owner August 9, 2024 19:17

jsiirola reviewed Aug 22, 2024

View reviewed changes

andrewlee94 added 2 commits August 23, 2024 10:04

Merge branch 'main' into cons_mag_diagnostics

b9d12ea

Addressing simple review comments

162f631

Fixing unused variable warning

1048b32

ksbeattie requested review from mrmundt and Robbybp and removed request for Robbybp, bknueven and lbianchi-lbl September 5, 2024 18:37

mrmundt requested changes Sep 5, 2024

View reviewed changes

andrewlee94 added 4 commits September 6, 2024 17:12

Fixing bug related to external functions with string arguments

d10a493

Merge branch 'main' of https://github.com/IDAES/idaes-pse into cons_m…

00a3434

…ag_diagnostics

Merge branch 'old_scaling_bug' into cons_mag_diagnostics

d27de22

Addressing comments

6d6fe2c

mrmundt reviewed Sep 10, 2024

View reviewed changes

idaes/core/util/tests/test_model_diagnostics.py Outdated Show resolved Hide resolved

idaes/core/util/tests/test_model_diagnostics.py Outdated Show resolved Hide resolved

idaes/core/util/tests/test_model_diagnostics.py Outdated Show resolved Hide resolved

andrewlee94 added 2 commits September 11, 2024 11:11

Merge branch 'main' into cons_mag_diagnostics

9dbd521

removing debugging prints

678caf8

mrmundt requested changes Sep 23, 2024

View reviewed changes

andrewlee94 added 5 commits September 23, 2024 15:57

Merge branch 'main' of https://github.com/IDAES/idaes-pse into cons_m…

47a58f4

…ag_diagnostics

Addressing more comments

1b5d1d2

Merge branch 'cons_mag_diagnostics' of https://github.com/andrewlee94…

18a6ff1

…/idaes-pse into cons_mag_diagnostics

Debugging issue with nested external functions

ef2e228

Adding todo

70636cb

mrmundt approved these changes Sep 24, 2024

View reviewed changes

Relax test tolerance due to Windows failures

5d5da12

ksbeattie added Priority:High High Priority Issue or PR and removed Priority:Normal Normal Priority Issue or PR labels Sep 26, 2024

andrewlee94 added 2 commits September 30, 2024 10:46

Adding catches for string arguments/values

c173984

Test for nested external functions

7c03467

jsiirola mentioned this pull request Oct 3, 2024

Resolve bugs in create_node_with_local_data Pyomo/pyomo#3376

Merged

Fixing bug in walker logic for string arguments to external funcitons

21cca53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diagnostics tool for ill-posed constraints #1454

Diagnostics tool for ill-posed constraints #1454

andrewlee94 commented Jul 22, 2024 •

edited

Loading

codecov-commenter commented Jul 22, 2024 •

edited

Loading

dallan-keylogic commented Jul 23, 2024

Robbybp commented Jul 23, 2024

dallan-keylogic commented Jul 23, 2024

Robbybp commented Jul 23, 2024

dallan-keylogic commented Jul 24, 2024

jsiirola left a comment

jsiirola Aug 15, 2024

andrewlee94 Aug 23, 2024

jsiirola Aug 22, 2024

andrewlee94 Aug 23, 2024

jsiirola Aug 22, 2024

dallan-keylogic Aug 23, 2024

andrewlee94 Aug 23, 2024

dallan-keylogic Aug 23, 2024

andrewlee94 Aug 23, 2024

jsiirola Aug 22, 2024

andrewlee94 Aug 23, 2024

jsiirola Aug 22, 2024

andrewlee94 Aug 23, 2024

jsiirola Aug 22, 2024

andrewlee94 Aug 23, 2024

mrmundt left a comment

mrmundt Sep 5, 2024

andrewlee94 Sep 5, 2024

dallan-keylogic Sep 5, 2024

andrewlee94 Sep 5, 2024

andrewlee94 commented Sep 10, 2024

andrewlee94 commented Sep 11, 2024

mrmundt left a comment

mrmundt Sep 23, 2024

andrewlee94 Sep 23, 2024

mrmundt left a comment

Diagnostics tool for ill-posed constraints #1454

Are you sure you want to change the base?

Diagnostics tool for ill-posed constraints #1454

Conversation

andrewlee94 commented Jul 22, 2024 • edited Loading

Waiting on Pyomo/pyomo#3376

Summary/Motivation:

Changes proposed in this PR:

Legal Acknowledgement

codecov-commenter commented Jul 22, 2024 • edited Loading

Codecov Report

dallan-keylogic commented Jul 23, 2024

Robbybp commented Jul 23, 2024

dallan-keylogic commented Jul 23, 2024

Robbybp commented Jul 23, 2024

dallan-keylogic commented Jul 24, 2024

jsiirola left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrmundt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrewlee94 commented Sep 10, 2024

andrewlee94 commented Sep 11, 2024

mrmundt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mrmundt left a comment

Choose a reason for hiding this comment

andrewlee94 commented Jul 22, 2024 •

edited

Loading

codecov-commenter commented Jul 22, 2024 •

edited

Loading