Use functions returning NaN on branch cuts instead of abs (issue #109) #123

johanbluecreek · 2022-09-07T07:28:32Z

This PR is intended to resolve issue #109 . It replaces logarithmic _abs, acosh_abs, and sqrt_abs with corresponding functions returning NaN on their corresponding branch cuts. This also removes all regularizing terms in the arguments of the previous functions, e.g. the 1//100000000.

The corresponding functions for generic types are now mapped to non-regularized functions as well. As I understand the need for these function definitions is to map them to symbolic expressions for the package SymbolicUtils.jl. The symbolic expression the user gets may have branch cuts, but I think it should be up to the user to analyse those properties of the final expression instead of having the functions mapped to regularized versions of the same functions. But let me know if I'm missing some reason why the symbolic expressions should be regularized. There appear to be no unit-tests for this, let me know if some should be added.

The old tests have also been updated: the test_tree_construction tests are for the new functions only tested in their valid domain, and test_operators test points inside and outside the domain of the new functions.

To note, since pow_abs is marked with @deprecate I did not bother to change this operation. Let me know if there is something missing that should be fixed in relation to power expressions.

All tests pass on my computer, and I also did some searches that finished without any apparent errors.

…ts (issue MilesCranmer#109) This commit also removes the regularization terms present in the logs. This is a first step in resolving issue MilesCranmer#109.

The new tests does not only test a point within the domain, but also outside of the domain making sure it returns a NaN.

The test is slightly modified to test the new operators within their domain.

This is to avoid printing e.g. `log_nan` when `log` is sufficient.

…ng functions

This also removes the regularizing '1' that was present in the old definition. Instead the test domain is shifted.

…lesCranmer#109

MilesCranmer

Thanks, this is looking good - left some comments. Let me know what you think.

src/Operators.jl

src/Core.jl

src/Equation.jl

src/Operators.jl

…anch cuts

This node is to compensate that the new power-operation does not take the absolute value of the argument. This new pow_nan made the old "tree_bad"run into NaN value producing infinite losses.

There is now a function that uses a dictionary to rename the internal names of the operators to more user friendly ones. This is a bit more strict, and will not rename user-defined functions.

…perators There are now tests for the new safe operators that verifies that the "safe_" part of the name is removed when printed to the user. The tests are not only for the unary operators but the binary "safe_pow" as well.

johanbluecreek · 2022-09-08T15:57:51Z

This last batch of commits should hopefully be enough for the review comments (do I press the "Resolve Conversation" buttons for the ones I think I have solved?).

That is, pow_abs is now replaced by a safe_pow, all operators are renamed "safe_" instead of "_nan", and for printing only the safe_ operators that are defined internally will have their names changed and there are unit-tests that verifies those name changes.

I only left the first review question unresolved, and I don't think there is a particular useful way of resolving it. Let me reiterate my issue there: If the resulting expression found is, say, safe_log(x1), the general-type definition of safe_log is what is called on the SymbolicsUtils.Sym that is x1. I don't get when this should return NaN instead of a SymbolicUtils.Term that is log(x1). The NaN is only useful during the search to exclude expressions with domains not compatible with the input data, I don't see the use of having the NaN there for manipulating the results using SymbolicsUtils (which the docs says the general-type function definitions are needed for). Does that make sense?

MilesCranmer · 2022-09-09T18:37:38Z

Thanks, will take a look! Yes - you hit "Resolve Conversation" when a suggestion is addressed. I already clicked them though.

src/Operators.jl

test/test_operators.jl

src/Operators.jl

MilesCranmer · 2022-09-10T00:20:13Z

src/Options.jl

+- `binary_operators`: Tuple of binary operators to use. Each operator should
+    be defined for two input scalars, and one output scalar. All operators
+    need to be defined over the entire real line (excluding infinity - these
+    are stopped before they are input), or return `NaN` where not defined.


Nice, thanks for adding this!

MilesCranmer · 2022-09-10T00:21:06Z

test/test_print.jl

@@ -31,3 +31,19 @@ EquationSearch(
 s = repr(tree)
 true_s = "((sin(cos(sin(cos(v1) * v3) * 3.0) * -0.5) + 2.0) * 5.0)"
 @test s == true_s
+
+for unaop in [safe_log, safe_log2, safe_log10, safe_log1p, safe_sqrt, safe_acosh]


Nice, thanks for adding!

MilesCranmer

Looks great, thanks for the contribution!

johanbluecreek added 7 commits September 6, 2022 13:34

Replacing the logs using abs with logs returning NaN at branch cu…

3872a82

…ts (issue MilesCranmer#109) This commit also removes the regularization terms present in the logs. This is a first step in resolving issue MilesCranmer#109.

Updating the tests of log_abs ops to use log_nan

1d93b1f

The new tests does not only test a point within the domain, but also outside of the domain making sure it returns a NaN.

Updating tree construction test to use the new log_nan

b6f7d15

The test is slightly modified to test the new operators within their domain.

Print function with "_nan" using only the basename

41f1c6e

This is to avoid printing e.g. `log_nan` when `log` is sufficient.

Updating the documentation on binary operators to mention NaN-returni…

3836617

…ng functions

Switching acosh_abs to acosh_nan as was done for the logs

325d681

This also removes the regularizing '1' that was present in the old definition. Instead the test domain is shifted.

Changing sqrt_abs in favour of sqrt_nan as part of resolving issue Mi…

17566e6

…lesCranmer#109

MilesCranmer requested changes Sep 7, 2022

View reviewed changes

src/Operators.jl Outdated Show resolved Hide resolved

src/Core.jl Outdated Show resolved Hide resolved

src/Equation.jl Outdated Show resolved Hide resolved

src/Operators.jl Outdated Show resolved Hide resolved

johanbluecreek added 6 commits September 8, 2022 14:24

Adding a pow_nan operation that returns NaN on singular points and br…

cbd10b1

…anch cuts

Adding tests for the pow_nan operation

b4f1f35

Adding an abs node to the test tree

eae352e

This node is to compensate that the new power-operation does not take the absolute value of the argument. This new pow_nan made the old "tree_bad"run into NaN value producing infinite losses.

Renaming the NaN-compatible operators

1a22922

A more stable method for renaming the safe_ operations

333ce04

There is now a function that uses a dictionary to rename the internal names of the operators to more user friendly ones. This is a bit more strict, and will not rename user-defined functions.

johanbluecreek requested a review from MilesCranmer September 9, 2022 08:49

Make OP_NAMES mapping a constant

b3f5ec6

MilesCranmer reviewed Sep 9, 2022

View reviewed changes

src/Operators.jl Show resolved Hide resolved

Update Operators.jl

92dda60

MilesCranmer reviewed Sep 9, 2022

View reviewed changes

test/test_operators.jl Show resolved Hide resolved

MilesCranmer reviewed Sep 9, 2022

View reviewed changes

src/Operators.jl Show resolved Hide resolved

MilesCranmer added 2 commits September 9, 2022 16:02

Map gamma output=Inf to NaN

3e99d91

Additional NaN tests for operators

d4ce109

MilesCranmer reviewed Sep 10, 2022

View reviewed changes

MilesCranmer approved these changes Sep 10, 2022

View reviewed changes

MilesCranmer merged commit 645d135 into MilesCranmer:master Sep 10, 2022

MilesCranmer mentioned this pull request Sep 10, 2022

Update backend: x^y instead of abs(x)^y, and high-precision constants MilesCranmer/PySR#191

Merged

MilesCranmer mentioned this pull request Sep 24, 2022

[Feature] Use NaNMath instead of abs #109

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use functions returning NaN on branch cuts instead of abs (issue #109) #123

Use functions returning NaN on branch cuts instead of abs (issue #109) #123

johanbluecreek commented Sep 7, 2022

MilesCranmer left a comment

johanbluecreek commented Sep 8, 2022

MilesCranmer commented Sep 9, 2022

MilesCranmer Sep 10, 2022

MilesCranmer Sep 10, 2022

MilesCranmer left a comment

Use functions returning NaN on branch cuts instead of abs (issue #109) #123

Use functions returning NaN on branch cuts instead of abs (issue #109) #123

Conversation

johanbluecreek commented Sep 7, 2022

MilesCranmer left a comment

Choose a reason for hiding this comment

johanbluecreek commented Sep 8, 2022

MilesCranmer commented Sep 9, 2022

MilesCranmer Sep 10, 2022

Choose a reason for hiding this comment

MilesCranmer Sep 10, 2022

Choose a reason for hiding this comment

MilesCranmer left a comment

Choose a reason for hiding this comment