Tracking issue: 32bit x86 targets lose float NaN payload in return values #115567

RalfJung · 2023-09-05T12:42:07Z

On x86 (32bit) targets, returning a floating-point number from a "C" ABI function by-value can change the bits of their NaN payloads. (Specifically, the quiet bit gets set.) This is caused by using x87 registers to pass return values, as mandated by the ABI for this target.

This is a known and long-standing problem, and very hard to fix. The purpose of this issue mostly is to document its existence and to give it a URL that can be referenced.

A proper fix would require patching LLVM to use different code to load float return values into an x87 register -- specifically, it has to be done in a way that NaN payloads are fully preserved.

Related issues:

x86-32: floating-point return values undergo implicit format conversion llvm/llvm-project#66803: the same bug on the LLVM side.
Tracking issue: 32bit x86 targets without SSE2 have unsound floating point behavior #114479: that's a different problem, also affecting x86-32 but concerning the actual FP computations, not the NaN bits of return values.

Prior issues:

Diggsey · 2023-10-14T14:52:13Z

It looks like the 80-bit float load/store operations don't modify the bits, so in principle one could fix this by performing the conversion from 32-bit or 64-bit float to 80-bit float (and back) in software, although I expect that would be prohibitively expensive.

RalfJung · 2023-10-14T15:13:26Z

Could we do a quick check whether the float is a NaN or not, and only do the software conversion for NaNs? That should be a lot faster I hope.

Diggsey · 2023-10-15T02:41:51Z

Came up with this code for software conversion:

typedef union {
    long double ld;
    struct {
        unsigned long long ull;
        unsigned short us;
    };
} ld_t;

typedef union {
    double d;
    unsigned long long ull;
} d_t;

long double identity(double x) {
    d_t y;
    y.d = x;
    ld_t z;
    z.ull = (y.ull << 11) & 0x7FFFFFFFFFFFFFFFull | 0x8000000000000000ull;
    unsigned short e = (y.ull >> 52) & 0x7FF;
    z.us = ((e == 0x7FF) ? 0x7FFF : e + 0x3C00) | ((y.ull >> 48) & 0x8000);
    return z.ld;
}

identity:
        sub     esp, 28
        movq    xmm1, QWORD PTR [esp+32]
        movdqa  xmm0, xmm1
        psllq   xmm1, 11
        psrlq   xmm0, 32
        movd    eax, xmm0
        movdqa  xmm0, XMMWORD PTR .LC0
        mov     edx, eax
        shr     edx, 20
        por     xmm1, xmm0
        and     dx, 2047
        movq    QWORD PTR [esp], xmm1
        cmp     dx, 2047
        lea     ecx, [edx+15360]
        mov     edx, 32767
        cmovne  edx, ecx
        shr     eax, 16
        and     ax, -32768
        or      eax, edx
        mov     WORD PTR [esp+8], ax
        fld     TBYTE PTR [esp]
        add     esp, 28
        ret
.LC0:
        .long   0
        .long   -2147483648
        .long   0
        .long   0

Or, special casing NaNs:

#include "math.h"

typedef union {
    long double ld;
    struct {
        unsigned long long ull;
        unsigned short us;
    };
} ld_t;

typedef union {
    double d;
    unsigned long long ull;
} d_t;

long double identity(double x) {
    if (!isnan(x)) {
        return x;
    }
    d_t y;
    y.d = x;
    ld_t z;
    z.ull = (y.ull << 11) & 0x7FFFFFFFFFFFFFFFull | 0x8000000000000000ull;
    z.us = (y.ull >> 48) | 0x7FFF;
    return z.ld;
}

identity:
        sub     esp, 44
        fld     QWORD PTR [esp+48]
        fst     QWORD PTR [esp+8]
        fucomi  st, st(0)
        jp      .L2
        add     esp, 44
        ret
.L2:
        fstp    st(0)
        mov     eax, DWORD PTR [esp+8]
        mov     edx, DWORD PTR [esp+12]
        movd    xmm0, eax
        movd    xmm1, edx
        mov     eax, edx
        punpckldq       xmm0, xmm1
        shr     eax, 16
        movdqa  xmm1, XMMWORD PTR .LC0
        psllq   xmm0, 11
        or      ax, 32767
        por     xmm0, xmm1
        mov     WORD PTR [esp+24], ax
        movq    QWORD PTR [esp+16], xmm0
        fld     TBYTE PTR [esp+16]
        add     esp, 44
        ret
.LC0:
        .long   0
        .long   -2147483648
        .long   0
        .long   0```

RalfJung · 2023-10-15T14:59:52Z

Nice. :)

So, I wonder -- what would it take to actually use this when returning floats or doubles? Do we have to patch LLVM or can the codegen backend do this?

RalfJung · 2023-10-16T05:13:55Z

Btw, can someone explain why this only affects return values? How are float arguments passed?

Diggsey · 2023-10-16T08:35:03Z

So, I wonder -- what would it take to actually use this when returning floats or doubles? Do we have to patch LLVM or can the codegen backend do this?

No idea :)

Btw, can someone explain why this only affects return values? How are float arguments passed?

Arguments on x86 (with the C calling convention) are always passed on the stack regardless of type. So the problem still exists as soon as you fld the value, but it's not inherent to the calling convention.

comex · 2023-10-19T18:37:27Z

By the same token, for non-extern "C" functions, floating point returns could be changed to go on the stack, even without SSE.

RalfJung · 2023-12-14T19:41:06Z

Here there is a claim that this issue only affects signaling NaN return values. Is that true? The example given here does seem to be a signaling NaN.

beetrees · 2024-06-21T11:40:46Z

For reference: the only change made to f32/f64 NaN payloads by x87 load/store is to quieten signalling NaN payloads by setting the quiet bit. Additionally, only f32 and f64 are affected by this issue as neither f16 nor f128 are passed in x87 registers.

Ensure floats are returned losslessly by the Rust ABI on 32-bit x86 Solves rust-lang#115567 for the (default) `"Rust"` ABI. When compiling for 32-bit x86, this PR changes the `"Rust"` ABI to return floats indirectly instead of in x87 registers (with the exception of single `f32`s, which this PR returns in general purpose registers as they are small enough to fit in one). No change is made to the `"C"` ABI as that ABI requires x87 register usage and therefore will need a different solution.

…workingjubilee Ensure floats are returned losslessly by the Rust ABI on 32-bit x86 Solves rust-lang#115567 for the (default) `"Rust"` ABI. When compiling for 32-bit x86, this PR changes the `"Rust"` ABI to return floats indirectly instead of in x87 registers (with the exception of single `f32`s, which this PR returns in general purpose registers as they are small enough to fit in one). No change is made to the `"C"` ABI as that ABI requires x87 register usage and therefore will need a different solution.

RalfJung added O-x86 C-tracking-issue Category: An issue tracking the progress of sth. like the implementation of an RFC A-floating-point Area: Floating point numbers and arithmetic A-ABI Area: Concerning the application binary interface (ABI) labels Sep 5, 2023

RalfJung mentioned this issue Sep 17, 2023

Some num tests fail for {i586, i686, x86_64}-unknown-linux-gnu #46948

Closed

GuentherVIII mentioned this issue Sep 18, 2023

Return f32 and f64 in XMM0 instead of FP0 on i686 #115919

Closed

jyknight mentioned this issue Sep 19, 2023

x86-32: floating-point return values undergo implicit format conversion llvm/llvm-project#66803

Open

RalfJung mentioned this issue Oct 3, 2023

add notes about non-compliant FP behavior on 32bit x86 targets #113053

Merged

RalfJung changed the title ~~Tracking issue: 32bit x86 targets lose NaN payload in return values~~ Tracking issue: 32bit x86 targets lose float NaN payload in return values Oct 8, 2023

Noratrieb added O-x86_32 Target: x86 processors, 32 bit (like i686-*) and removed O-x86-all labels Oct 25, 2023

RalfJung mentioned this issue Nov 5, 2023

Remove ability to disable some target features #116584

Closed

beetrees mentioned this issue Apr 2, 2024

Ensure floats are returned losslessly by the Rust ABI on 32-bit x86 #123351

Merged

RalfJung mentioned this issue Sep 2, 2024

enable const-float-classify test, and test_next_up/down on 32bit x86 #129835

Merged

beetrees mentioned this issue Oct 28, 2024

Test more targets against a custom-built musl libm rust-lang/libm#300

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking issue: 32bit x86 targets lose float NaN payload in return values #115567

Tracking issue: 32bit x86 targets lose float NaN payload in return values #115567

RalfJung commented Sep 5, 2023 •

edited

Loading

Diggsey commented Oct 14, 2023

RalfJung commented Oct 14, 2023

Diggsey commented Oct 15, 2023

RalfJung commented Oct 15, 2023

RalfJung commented Oct 16, 2023

Diggsey commented Oct 16, 2023

comex commented Oct 19, 2023

RalfJung commented Dec 14, 2023

beetrees commented Jun 21, 2024

Tracking issue: 32bit x86 targets lose float NaN payload in return values #115567

Tracking issue: 32bit x86 targets lose float NaN payload in return values #115567

Comments

RalfJung commented Sep 5, 2023 • edited Loading

Diggsey commented Oct 14, 2023

RalfJung commented Oct 14, 2023

Diggsey commented Oct 15, 2023

RalfJung commented Oct 15, 2023

RalfJung commented Oct 16, 2023

Diggsey commented Oct 16, 2023

comex commented Oct 19, 2023

RalfJung commented Dec 14, 2023

beetrees commented Jun 21, 2024

RalfJung commented Sep 5, 2023 •

edited

Loading