Float64 to Float16 conversion is slow #41161

JeffBezanson · 2021-06-09T22:11:30Z

julia> @btime Float16(rand())
  15.878 ns (0 allocations: 0 bytes)

julia> @btime Float16(Float32(rand()))
  5.928 ns (0 allocations: 0 bytes)

julia> @btime Float16(rand(Float32))
  4.282 ns (0 allocations: 0 bytes)

I believe we are calling compiler-rt for this. Of course this can't be implemented by converting via Float32 since that rounds twice, but it's frustrating that that method is so much faster. Would be nice to have a better implementation of this. See also #40315.

The text was updated successfully, but these errors were encountered:

oscardssmith · 2021-06-09T23:19:03Z

Yeah. This is one of those functions that should be really easy to implement, but is surprisingly hard to get correct and fast. It's been on my list for a while.

vchuravy · 2021-06-10T14:23:56Z

I believe we are calling compiler-rt for this.

julia> @code_native Float16(rand())
	.text
; ┌ @ float.jl:180 within `Float16'
	pushq	%rax
	movabsq	$__truncdfhf2, %rax
	callq	*%rax
	popq	%rcx
	retq
	nop
; └

Which is the compiler-rt name for it, but it should end up in

julia/src/intrinsics.cpp

Line 1490 in 15c19c8

extern "C" JL_DLLEXPORT uint16_t __truncdfhf2(double param)

Looking at it closely it internally converts to float and then uses our implementation of float_to_half. Whereas doing the conversion on the Julia level will use the x86 intrinsic to go from Float32->Float16. We might want to try compiler-rt (especially since nowadays OrcV2 let's you add static libraries to an ExecutionSession instead of having to turn the compiler-rt archive into a shared library as did in https://github.com/JuliaLang/julia/pull/17344/files#diff-c9f616510e5e877240287257026b05d8fb29270feead033f6a87ccf6213dd66bR566)

fingolfin · 2023-06-28T09:24:27Z

Interestingly this is already fast on M1 macs (so ARM), with Julia 1.9.1

julia> @btime Float16(rand())
  3.083 ns (0 allocations: 0 bytes)
Float16(0.952)

julia> @btime Float16(Float32(rand()))
  3.083 ns (0 allocations: 0 bytes)
Float16(0.798)

julia> @btime Float16(rand(Float32))
  3.083 ns (0 allocations: 0 bytes)
Float16(2.164e-5)

It is still slow on an x86_64 machine (also using Julia 1.9.1):

julia> @btime Float16(rand())
  18.348 ns (0 allocations: 0 bytes)
Float16(0.906)

julia> @btime Float16(Float32(rand()))
  5.037 ns (0 allocations: 0 bytes)
Float16(0.1693)

julia> @btime Float16(rand(Float32))
  4.571 ns (0 allocations: 0 bytes)
Float16(0.6675)

gbaraldi · 2023-06-28T11:29:12Z

@oscardssmith should we do the double conversion, define Float16(Float64) as Float16(Float32(Float64)) or is the double rounding wrong?

oscardssmith · 2023-06-28T11:42:38Z

double rounding is wrong

timholy · 2023-06-28T19:46:28Z

Demo: round 0.499 to 2 digits: you get 0.50. Now round to 1 digit: you get 1 (with "round up"). But round 0.499 to 1 digit immediately: you get 0, even with round up.

JeffBezanson added performance Must go faster float16 labels Jun 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float64 to Float16 conversion is slow #41161

Float64 to Float16 conversion is slow #41161

JeffBezanson commented Jun 9, 2021

oscardssmith commented Jun 9, 2021

vchuravy commented Jun 10, 2021

fingolfin commented Jun 28, 2023

gbaraldi commented Jun 28, 2023

oscardssmith commented Jun 28, 2023

timholy commented Jun 28, 2023

Float64 to Float16 conversion is slow #41161

Float64 to Float16 conversion is slow #41161

Comments

JeffBezanson commented Jun 9, 2021

oscardssmith commented Jun 9, 2021

vchuravy commented Jun 10, 2021

fingolfin commented Jun 28, 2023

gbaraldi commented Jun 28, 2023

oscardssmith commented Jun 28, 2023

timholy commented Jun 28, 2023