Add BFloat16 runtime intrinsics. #51790

maleadt · 2023-10-20T10:47:37Z

After switching to LLVM for BFloat16 in #51470 (i.e., relying on Intrinsics.sub_float etc instead of hand-rolling bit-twiddling implementations), we also need to provide fallback runtime implementations for these intrinsics. This is too bad; I had hoped to put as much BFloat16-related things as possible in BFloat16s.jl.
It required modifying the unary operator preprocessor macros in order to differentiate between Float16 and BFloat16; I didn't generalize that to all intrinsics as the code is hairy enough already (and it's currently only useful for fptrunc/fpext).

@vtjnash @Keno Any suggestions for an alternative approach that keeps more of BFloat16 out of base? Ideally we'd implement these runtime fallbacks in Julia, as part of BFloat16s.jl (in fact, most of them already have an implementation over there), but that seems hard. Alternatively, we could require codegen.

maleadt · 2023-10-25T08:52:26Z

Well, if anybody has post-merge thoughts on how to move these definitions to BFloat16s.jl, let me know, but I'll go ahead and merge this so that we can get BFloat16 codegen to work.

Keno · 2023-10-25T08:54:11Z

I don't think we need to bother really. To me these aren't really BFloat16 concerns as much as they are LLVM concerns and our LLVM support just happens to be in this repo.

Extends #51790; I forgot the conversion intrinsics defined in `APInt-C`. To differentiate between Float16/BFloat16, I converted a couple of intrinsics to take a `jl_datatype_t` argument instead an unsigned number of bits.

Add BFloat16 runtime intrinsics.

928d1e3

maleadt added the float16 label Oct 20, 2023

maleadt requested a review from vtjnash October 20, 2023 10:47

maleadt merged commit a1ccf53 into master Oct 25, 2023
2 checks passed

maleadt deleted the tb/bfloat_runtime_intrinsics branch October 25, 2023 08:53

maleadt mentioned this pull request Oct 30, 2023

Implement more missing BFloat16 intrinsics #51935

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BFloat16 runtime intrinsics. #51790

Add BFloat16 runtime intrinsics. #51790

maleadt commented Oct 20, 2023

maleadt commented Oct 25, 2023

Keno commented Oct 25, 2023

Add BFloat16 runtime intrinsics. #51790

Add BFloat16 runtime intrinsics. #51790

Conversation

maleadt commented Oct 20, 2023

maleadt commented Oct 25, 2023

Keno commented Oct 25, 2023