Accuracy issues of sum() specialization for floats/complexes #122234

skirpichev · 2024-07-24T14:07:07Z

Bug report

Bug description:

Unfortunately, #121176 was merged with a bug:

Lines 2749 to 2755 in e968121

    
           if (PyFloat_Check(item)) { 
        
               double value = PyFloat_AS_DOUBLE(item); 
        
               re_sum.hi += value; 
        
               im_sum.hi += 0.0; 
        
               _Py_DECREF_SPECIALIZED(item, _PyFloat_ExactDealloc); 
        
               continue; 
        
           }

L2751 lacks cs_add(). Sorry for that. Reproducer: sum([2j, 1., 10E100, 1., -10E100]) (should be 2+2j). I'll provide a patch.

But maybe cases for integer arguments also should use compensated summation? E.g.:

cpython/Python/bltinmodule.c

Lines 2689 to 2698 in e968121

    
           if (PyLong_Check(item)) { 
        
               long value; 
        
               int overflow; 
        
               value = PyLong_AsLongAndOverflow(item, &overflow); 
        
               if (!overflow) { 
        
                   re_sum.hi += (double)value; 
        
                   Py_DECREF(item); 
        
                   continue; 
        
               } 
        
           }

on L2694 (and use PyLong_AsDouble()). An example:

>>> sum([1.0, 10E100, 1.0, -10E100])
2.0
>>> sum([1.0, 10**100, 1.0, -10**100])  # huh?
0.0

I would guess, that integer values in this case are treated as exact and they are allowed to smash floating-point result to garbage. But... This looks as a bug for me. fsum() also chooses 2.0:

>>> math.fsum([1.0, 10**100, 1.0, -10**100])
2.0

CPython versions tested on:

CPython main branch

Operating systems tested on:

No response

Linked PRs

The text was updated successfully, but these errors were encountered:

skirpichev · 2024-07-24T14:10:17Z

CC @rhettinger for the second part (integer values). Is this an issue or a feature?

picnixz · 2024-07-24T14:21:40Z

Actually, even without the sum, we have:

>>> x = 10 ** 100
>>> 1.0 + x + 1.0 - x
0.0

skirpichev · 2024-07-24T14:33:40Z

@picnixz, that's a feature.

Ok, it seems that in the second case PyLong_AsLongAndOverflow() just overflows and we fallback to the generic sum. That's something copied from the specialization for integers (in 8ce8a78) and it wasn't changed in #100426. Maybe it should?

skirpichev · 2024-07-24T14:42:21Z

Hmm, with PyLong_AsDouble() + compensated summation it's even faster!

./python -m timeit -r11 -s 'xs=[1.0, 10**100, 1.0, -10**100]' 'sum(xs)'
500000 loops, best of 11: 634 nsec per loop

while in the main:

$ ./python -m timeit -r11 -s 'xs=[1.0, 10**100, 1.0, -10**100]' 'sum(xs)'
500000 loops, best of 11: 801 nsec per loop

* Use compensated summation for complex sums with floating-point items. This amends python#121176. * sum() specializations for floats and complexes now use PyLong_AsDouble() instead of PyLong_AsLongAndOverflow() and compensated summation as well.

skirpichev · 2024-07-24T16:19:12Z

PR is ready for review: #122236

It combines fixes for floats and integers. If second case requires more discussion or is "a feature", I can quickly revert that part.

* Use compensated summation for complex sums with floating-point items. This amends #121176. * sum() specializations for floats and complexes now use PyLong_AsDouble() instead of PyLong_AsLongAndOverflow() and compensated summation as well.

encukou · 2024-07-29T15:44:41Z

The added error returns leak references. This was caught by the noGIL refleak buildbot (which runs more often than refleaks for regular-builds): https://buildbot.python.org/#/builders/1226/builds/2352/steps/6/logs/stdio

Eclips4 · 2024-07-29T15:45:14Z

The added error returns leak references. This was caught by the noGIL refleak buildbot (which runs more often than refleaks for regular-builds): https://buildbot.python.org/#/builders/1226/builds/2352/steps/6/logs/stdio

PR is ready: #122405

encukou · 2024-07-29T15:47:23Z

My PR #122406 looks exactly the same :)
IMO, it's better to associate the PR with this issue, so the fix-up PR is kept together with the original.

Co-Authored-By: Kirill Podoprigora <kirill.bast9@mail.ru>

picnixz · 2024-07-29T16:40:02Z

Closing since the refleak is now fixed.

skirpichev added the type-bug An unexpected behavior, bug, or error label Jul 24, 2024

bedevere-app bot mentioned this issue Jul 24, 2024

gh-122234: fix accuracy issues for sum() #122236

Merged

Eclips4 added the interpreter-core (Objects, Python, Grammar, and Parser dirs) label Jul 24, 2024

vstinner closed this as completed Jul 29, 2024

encukou added a commit to encukou/cpython that referenced this issue Jul 29, 2024

pythongh-122234: Add DECREFs to error paths

a51d251

encukou reopened this Jul 29, 2024

bedevere-app bot mentioned this issue Jul 29, 2024

gh-122234: Add DECREFs to error paths #122406

Merged

This was referenced Jul 29, 2024

gh-122404: Fix reference leak in sum implementation #122405

Closed

test_builtin leaks references #122404

Closed

encukou added a commit that referenced this issue Jul 29, 2024

gh-122234: Add DECREFs to error paths (#122406)

89fa05f

Co-Authored-By: Kirill Podoprigora <kirill.bast9@mail.ru>

picnixz closed this as completed Jul 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy issues of sum() specialization for floats/complexes #122234

Accuracy issues of sum() specialization for floats/complexes #122234

skirpichev commented Jul 24, 2024 •

edited by bedevere-app bot

Loading

skirpichev commented Jul 24, 2024

picnixz commented Jul 24, 2024 •

edited

Loading

skirpichev commented Jul 24, 2024 •

edited

Loading

skirpichev commented Jul 24, 2024

skirpichev commented Jul 24, 2024

encukou commented Jul 29, 2024

Eclips4 commented Jul 29, 2024

encukou commented Jul 29, 2024

picnixz commented Jul 29, 2024

Accuracy issues of sum() specialization for floats/complexes #122234

Accuracy issues of sum() specialization for floats/complexes #122234

Comments

skirpichev commented Jul 24, 2024 • edited by bedevere-app bot Loading

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

skirpichev commented Jul 24, 2024

picnixz commented Jul 24, 2024 • edited Loading

skirpichev commented Jul 24, 2024 • edited Loading

skirpichev commented Jul 24, 2024

skirpichev commented Jul 24, 2024

encukou commented Jul 29, 2024

Eclips4 commented Jul 29, 2024

encukou commented Jul 29, 2024

picnixz commented Jul 29, 2024

skirpichev commented Jul 24, 2024 •

edited by bedevere-app bot

Loading

picnixz commented Jul 24, 2024 •

edited

Loading

skirpichev commented Jul 24, 2024 •

edited

Loading