Switch Float16 LLVM representation from `i16` to `half` #26381

vchuravy · 2018-03-09T00:29:25Z

This is a first step towards implementing the parts of #18927 and #18734
in the laziest most straight-forward possible.

Use LLVM's half type for Float16
Use LLVM intrinsics for basic operations on Float16
Provide fallbacks for methods missing in our current libc.

This foregoes the completnes aspect of #18927, while providing
a way forward to potentially extend this to Float128, and it doesn't add
additional dependencies like #18734.

Follow-up PR's could extend RTLIB to be more generic or to not use Base
(thus allowing it to be it's own shared library). There are other places
where Float16 currently are eagerly converted to Float32 so this is
not an attempt at completness.

All the code paths are tested on x86 Linux and I am currently testing on ARM.
(There is a known PPC codegen issue, but I guess nobody but me cares about that).

@vtjnash is there anything I would need to do for anticodegen/LLVM free builds?

Notes:

Upstream LLVM PPC backend bug: https://bugs.llvm.org/show_bug.cgi?id=39865

Some LLVM intrinsics can't be lowered to instructions on target platforms. Some of these are therefore lowered to libc libcalls and glibc does not implement all of them. This commit uses `extern_c` to implement these fallback methods in native Julia. Similar projects to RTLIB are glibc and compiler-rt. For now we only implement conversion between `Float16` and `Float32`, we also cheat by going through `Float32` for the conversion between `Float64` and `Float16`.

LLVM intrinsics either map to instructions or to functions in compiler-rt. Since we provide our own implementation we can just look them up in sys.so and resolve to the function there. On Darwin we have to use a unmangled version of the function name.

vchuravy · 2018-03-09T00:33:21Z

base/rtlib/RTLIB.jl

+
+# We would like to use `@ccallable` here,
+# but building the sysimage fails, so we use a bootstrapped version.
+function register(f, rtype, argt, name)


@vtjnash I was trying to use @ccallable here but I was getting failures during sysimage building.

vchuravy · 2018-03-09T00:54:54Z

@vtjnash On AArch64 I am running into:

Wrong types for attribute: byval inalloca nest noalias nocapture nonnull readnone readonly signext sret zeroext dereferenceable(1) dereferenceable_or_null(1)
float (half)* @jlcapi_extendhfsf2_3064
LLVM ERROR: Broken function found, compilation aborted!

vchuravy · 2018-03-09T02:27:58Z

Further update on PPC situation. The backend can't select FP16 operations (even on master), so I will first need to work with upstream on adding support for that.
That means that this PR is currently blocked until I get around to that :(, except if somebody else has a clever idea.

I wouldn't want to simply disable this on PPC since that would inhibit GPU code on PPC from using Float16.

vchuravy · 2018-03-09T04:09:50Z

Today does not seem to be my day. It only worked beautifully on my tests because I have the F16C instruction set extension.
After fighting with recursive definitions I am now stuck on a seqfault that is probably related to my usage of jl_extern_c.

stevengj · 2018-03-09T16:25:41Z

The title of this PR refers to switching the representation. Does this actually change the Float16 format, or does it only change the algorithms for working with Float16?

yuyichao · 2018-03-09T16:36:17Z

I believe it changes the representation in LLVM but it shouldn't change the bit pattern.

philtomson · 2018-09-17T15:45:47Z

What's the status of this PR? (conflicts, of course, but is anyone working on it?)

vchuravy · 2018-09-17T18:36:42Z

Still blocked today on #26381 (comment), but I would welcome another set of eyes to look at this, I would like to see this happening, but I can't dedicate time to it.

jekbradbury · 2018-09-17T20:21:06Z

Is there any way to disable it on PPC host but allow it to stay on for the NVPTX backend used by LLVM.jl for GPU code?

vchuravy · 2018-09-17T21:27:18Z

No I don't think that is feasible we have CodegenParameters but that would require a fair amount of refactoring to get it to all places it needs to be, and it is unclear how we would handle it in pure Julia... It might be feasible to use i16 on PPC for host and GPU (e.g. turn float16 off completely).

ViralBShah · 2019-05-05T23:46:59Z

Close this?

vchuravy · 2019-05-05T23:51:33Z

While this PR is outdated, it is still an issue that needs fixing. So I would say leave it open.

…

On Sun, May 5, 2019, 19:47 Viral B. Shah ***@***.***> wrote: Close this? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#26381 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABDO2WJGP3YM4A2TYXMTEDPT5WXLANCNFSM4EUOIRBA> .

vtjnash

@vchuravy says this PR is outdated

vchuravy · 2020-09-10T13:11:48Z

Closed in favour of #37510

vchuravy added 4 commits March 8, 2018 19:11

resolve libcalls to sys.so

6a556f3

LLVM intrinsics either map to instructions or to functions in compiler-rt. Since we provide our own implementation we can just look them up in sys.so and resolve to the function there. On Darwin we have to use a unmangled version of the function name.

switch Float16 from i16 to half

eebad3e

switch Float16 to LLVM intrinsics

8a9abe7

This was referenced Mar 9, 2018

Basic support for builtins from compiler-rt #18734

Closed

[WIP/RFC/RFH] Implement compiler-rt libcalls in Julia #18927

Closed

vchuravy commented Mar 9, 2018

View reviewed changes

vchuravy requested a review from vtjnash March 9, 2018 00:33

fixup! [RTLIB] start implementing soft float16 support.

10d3f58

vchuravy added 2 commits March 8, 2018 21:47

fixup! fixup! [RTLIB] start implementing soft float16 support.

236aa09

fixup! [RTLIB] start implementing soft float16 support.

c94b693

vchuravy changed the title ~~Switch Float16 representation to half~~ Switch Float16 LLVM representation from i16 to half Mar 9, 2018

vchuravy mentioned this pull request Mar 13, 2018

Remove openlibm #26434

Open

17 tasks

KristofferC mentioned this pull request Nov 1, 2018

Float16+Integer does extra conversions #29889

Closed

KristofferC mentioned this pull request Jan 8, 2019

Mixed precision training. FluxML/Flux.jl#543

Closed

vtjnash removed their request for review August 5, 2019 13:39

vtjnash requested changes Aug 5, 2019

View reviewed changes

maleadt mentioned this pull request Nov 13, 2019

Implement wrappers for WMMA LLVM intrinsics JuliaGPU/CUDAnative.jl#494

Merged

vchuravy mentioned this pull request Nov 28, 2019

Change llvmcall ABI representation for Float16 #33970

Merged

maleadt mentioned this pull request Aug 25, 2020

Tracker: Float16 support JuliaGPU/CUDA.jl#391

Open

maleadt mentioned this pull request Sep 10, 2020

Switch Float16 to LLVM's half #37510

Merged

vchuravy closed this Sep 10, 2020

DilumAluthge deleted the vc/float16 branch March 25, 2021 22:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch Float16 LLVM representation from `i16` to `half` #26381

Switch Float16 LLVM representation from `i16` to `half` #26381

vchuravy commented Mar 9, 2018 •

edited

Loading

vchuravy Mar 9, 2018

vchuravy commented Mar 9, 2018

vchuravy commented Mar 9, 2018

vchuravy commented Mar 9, 2018

stevengj commented Mar 9, 2018

yuyichao commented Mar 9, 2018

philtomson commented Sep 17, 2018

vchuravy commented Sep 17, 2018 •

edited

Loading

jekbradbury commented Sep 17, 2018

vchuravy commented Sep 17, 2018

ViralBShah commented May 5, 2019

vchuravy commented May 5, 2019 via email

vtjnash left a comment •

edited by ViralBShah

Loading

vchuravy commented Sep 10, 2020

Switch Float16 LLVM representation from i16 to half #26381

Switch Float16 LLVM representation from i16 to half #26381

Conversation

vchuravy commented Mar 9, 2018 • edited Loading

Notes:

vchuravy Mar 9, 2018

Choose a reason for hiding this comment

vchuravy commented Mar 9, 2018

vchuravy commented Mar 9, 2018

vchuravy commented Mar 9, 2018

stevengj commented Mar 9, 2018

yuyichao commented Mar 9, 2018

philtomson commented Sep 17, 2018

vchuravy commented Sep 17, 2018 • edited Loading

jekbradbury commented Sep 17, 2018

vchuravy commented Sep 17, 2018

ViralBShah commented May 5, 2019

vchuravy commented May 5, 2019 via email

vtjnash left a comment • edited by ViralBShah Loading

Choose a reason for hiding this comment

vchuravy commented Sep 10, 2020

Switch Float16 LLVM representation from `i16` to `half` #26381

Switch Float16 LLVM representation from `i16` to `half` #26381

vchuravy commented Mar 9, 2018 •

edited

Loading

vchuravy commented Sep 17, 2018 •

edited

Loading

vtjnash left a comment •

edited by ViralBShah

Loading