Special-case switching on error union capture #18173

dweiller · 2023-12-01T05:14:19Z

Closes #11957 and as well as the related issues #16770 and #11812.

There are a few things left to do before merging:

implement AstGen (and any sema changes needed) for if (x) { ... } else |err| switch (err) { ... } pattern
check compile error messages makes sense (e.g. source locations) and consider adding compile error test cases
cleanup commit history (drop commit adding -Donly-ast-check build flag?)
look for ways to share more logic between normal switch, condbr and error union switch (both in AstGen.zig and Sema.zig)
mention the special-cased if and catch patterns in langref (contrast with using a block around the switch)

dweiller · 2023-12-02T04:52:47Z

Not sure what the errors in the CI are - I can reproduce the segfault in bootstrap locally, but don't know what would be causing it. Perhaps there is some incompatibility with the x86_64 backend.

Edit: The segfault referenced above has been fixed, but there is a new/different segfault happening in the CI that I cannot reproduce.

matu3ba · 2023-12-15T08:24:33Z

Looks like stage2 is broken, probably due to the C backend emitting broken code in readFromPackedMemory and utf8ToUtf16Le. Since this is Linux Release, I'd guess that readFromPackedMemory is broken.
Did you try running the command inside valgrind? You can also play around with setting -Wstringop-overflow=X to see what overflows occur. -Wformat-overflow and -Wformat-truncation also enable those and theres also -Wformat=2 & -Wformat-truncation. See also https://interrupt.memfault.com/blog/best-and-worst-gcc-clang-compiler-flags.

I do see 3 ways forward, since I suspect you checked that the miscompilation is not detectable within standard debugging tools (valgrind, memsanitizer etc):

1. try to add workarounds as to not make the C backends not breaking readFromPackedMemory with different usage (pattern)
1. try to find a minimal reproducible to get the C backend to emit broken code for readFromPackedMemory and fix it
1. add a workaround in the C backend after understanding the broken C code

cc -o zig2 zig2.c compiler_rt.c -std=c99 -O2 -fno-stack-protector -Istage1 -Wl,-z,stack-size=0x10000000 -pthread
zig2.c: In function ‘unicode_utf8ToUtf16Le__91520’:
zig2.c:3263751: warning: assignment to ‘const uint16_t (*)[16]’ {aka ‘const short unsigned int (*)[16]’} from incompatible pointer type ‘const uint16_t *’ {aka ‘const short unsigned int *’} [-Wincompatible-pointer-types]
3263751 |    t26 = t25.ptr;
        | 
In function ‘mem_readPackedIntLittle__anon_210076__210076’,
    inlined from ‘mem_readPackedInt__anon_187698__187698’ at zig2.c:2210564:0,
    inlined from ‘value_Value_readFromPackedMemory__22759’ at zig2.c:1798160:0:
zig2.c:2633608: warning: ‘memcpy’ reading 16 bytes from a region of size 10 [-Wstringop-overread]
2633608 |  memcpy(&t11, &t10, sizeof(zig_u128));
        | 
zig2.c: In function ‘value_Value_readFromPackedMemory__22759’:
zig2.c:2633585: note: source object ‘t10’ of size 10
2633585 |  uint8_t t10[10];
        | 
In function ‘mem_readPackedIntBig__anon_210077__210077’,
    inlined from ‘mem_readPackedInt__anon_187698__187698’ at zig2.c:2210568:0,
    inlined from ‘value_Value_readFromPackedMemory__22759’ at zig2.c:1798160:0:
zig2.c:2633682: warning: ‘memcpy’ reading 16 bytes from a region of size 10 [-Wstringop-overread]
2633682 |  memcpy(&t13, &t12, sizeof(zig_u128));
        | 
zig2.c: In function ‘value_Value_readFromPackedMemory__22759’:
zig2.c:2633644: note: source object ‘t12’ of size 10
2633644 |  uint8_t t12[10];
        | 
zig2.c: In function ‘value_Value_readFromMemory__22758’:
zig2.c:1395844: warning: ‘memcpy’ reading 16 bytes from a region of size 10 [-Wstringop-overread]
1395844 |      memcpy(&t47, &t46, sizeof(zig_u128));
        | 
zig2.c:1395703: note: source object ‘t46’ of size 10
1395703 |  uint8_t t46[10];
        | 
+ ./zig2 build -Dno-lib

Please let me know what you think.

rootbeer · 2023-12-15T17:25:53Z

Aren't the quoted warnings from cc about reading-from-the-wrong-size all just warnings? I feel like I see those all the time, and have successfully ignored them (they do seem scary though). You can see the same warnings in the successful debug build too: https://github.com/ziglang/zig/actions/runs/7217377559/job/19665091928?pr=18173

The failure in this case seems to be after those:

+ ./zig2 build -Dno-lib
Segmentation fault
Error: Process completed with exit code 139.

matu3ba · 2023-12-15T18:59:13Z

all just warnings

Buffer overreads or overwrites are undefined behavior and, if conforming to the c standard, compilers are free to optimize based on the absence of such things and miscompile your programs.
However I can not tell you, if this definitely happened here, but this UB is a candidate to consider as cause.

This change only emits the unwrap_errunion_err instruction if the error capture is actually used in a branch.

dweiller · 2024-01-09T05:00:09Z

Please let me know what you think.

I think you were probably right - I started investigating and it looked like a bug when readPackedIntLittle was called for a u80 (by readFromMemory for a f80) - for some reason the bitcast was emitting a memcpy for size of a zig_u128 (which was the dest type) rather than the 10 byte buffer for the u80 source. It seems the issue has been resolved by Andrew's build module rework in #18160, though I have no idea why as that PR did not touch src/codegen/c.zig and the logic for the bitcast air instruction in there looked like it would use the correct size previously...

Now that I've rebased to include those changes, hopefully the CI will pass on all the x86_64-linux targets.

Vexu · 2024-01-09T15:28:16Z

doc/langref.html.in

+    // The non-error and error cases are only peers if the error case is just a switch expression;
+    // the pattern `if (x) {...} else |err| blk: { switch (err) {...} }` does not consider the
+    // non-error and error case to be peers.


I don't like how this turns a limitation of the compiler into a feature of the language.

I was just trying to be clear about what the behaviour is so as to not cause confusion, but I can see that this may the source of future problems if peer type resolution changes for these cases. Should I remove the sentence, or add a note saying something along the lines of 'this is the current behaviour and may change'?

I think it's good enough for now. I need to go over the langref anyway; it's collected quite a few different contributors' conflicting ideas about what a langref should be and it needs to be reworked by one person with a vision.

andrewrk

Thank you @dweiller - nice work. The amount of additions and copy-pasted code makes me a bit uneasy, but you made up for it with robust test coverage.

I have a couple review comments but they are insignificant and so I will proceed with the merge regardless. Feel free to ignore them, they can be cleaned up later along with a future enhancement or bug fix.

andrewrk · 2024-01-09T21:36:42Z

src/Zir.zig

+        pub const Bits = packed struct(u32) {
+            /// If true, one or more prongs have multiple items.
+            has_multi_cases: bool,
+            /// If true, there is an else prong. This is mutually exclusive with `has_under`.


this comment looks like it was copy pasted and no longer applies since there is no has_under field.

andrewrk · 2024-01-09T21:46:33Z

src/Sema.zig

+        sema.inst_map.putAssumeCapacity(err_capture_inst, spa.operand);
+    }
+    defer if (extra.data.bits.any_uses_err_capture) assert(sema.inst_map.remove(err_capture_inst));
+    _ = try sema.analyzeSwitchRuntimeBlock(


it looks like this return value is never used by any caller.

andrewrk · 2024-01-09T21:51:41Z

doc/langref.html.in

+    // The non-error and error cases are only peers if the error case is just a switch expression;
+    // the pattern `if (x) {...} else |err| blk: { switch (err) {...} }` does not consider the
+    // non-error and error case to be peers.


I think it's good enough for now. I need to go over the langref anyway; it's collected quite a few different contributors' conflicting ideas about what a langref should be and it needs to be reworked by one person with a vision.

dweiller · 2024-01-10T02:15:28Z

Thank you @dweiller - nice work. The amount of additions and copy-pasted code makes me a bit uneasy, but you made up for it with robust test coverage.

I wouldn't be surprise if there is a way to unify the new logic more with the regular switch logic to reduce the amount of duplication. However, there are a number of subtle differences that made it tricky to do without requiring interleaving a bunch of if (this_is_an_error_union_switch) which I think won't make it clearer.

dweiller force-pushed the switch-err-union branch 3 times, most recently from 0f08626 to 0436b3b Compare December 1, 2023 06:07

dweiller force-pushed the switch-err-union branch 3 times, most recently from ad85f90 to 815b4d4 Compare December 8, 2023 11:45

dweiller force-pushed the switch-err-union branch from 815b4d4 to 905abb9 Compare December 14, 2023 07:11

dweiller marked this pull request as ready for review December 14, 2023 07:35

dweiller force-pushed the switch-err-union branch from 905abb9 to 599ca96 Compare December 15, 2023 02:57

dweiller force-pushed the switch-err-union branch from 599ca96 to 3d26d8c Compare December 30, 2023 01:58

dweiller added 16 commits January 9, 2024 14:42

zir: remove unused zir as instruction

063d55c

zir: add switch_block_err_union

4136097

sema: refactor error set switch logic

b784f64

astgen: use switch_block_err_union

2cf648f

sema: implement switch_block_err_union on comptime operands

ae19f69

sema: extract runtime switch AIR generation to function

6bf319e

sema: implement runtime switch_block_err_union

a175a64

sema: allow maybeErrorUnwrap to handle err_union_code

2fa69cc

sema: fix err union switch with inferred empty error sets

adcaad6

fix x86_64 crashes for switch_block_err_union

b7eb59f

This change only emits the unwrap_errunion_err instruction if the error capture is actually used in a branch.

astgen/sema: use switch_block_err_union for if-else-switch

6a18cee

langref: mention error union switch peer resolution

8695bc7

astgen/sema: fix source locations for switch_block_err_union

fc6dc79

test: add tests for switch_block_err_union

69ab687

sema: inherit block want_safety for err switch union

ec5b751

fixup! astgen: use switch_block_err_union

67d7d7b

dweiller force-pushed the switch-err-union branch from 3d26d8c to 67d7d7b Compare January 9, 2024 05:00

Vexu reviewed Jan 9, 2024

View reviewed changes

andrewrk approved these changes Jan 9, 2024

View reviewed changes

andrewrk merged commit acca16c into ziglang:master Jan 9, 2024
10 checks passed

dweiller deleted the switch-err-union branch January 10, 2024 01:57

travisstaloch mentioned this pull request Jan 16, 2024

regression: ast gen fails to catch incorrect by ref error captures #18583

Closed

melonedo mentioned this pull request Jan 16, 2024

Spurious traceback with enum switch #18574

Closed

This was referenced Jan 16, 2024

switch on error preserves error return trace #18591

Closed

switch on error doesn't validate operand type #18592

Closed

melonedo mentioned this pull request Jan 22, 2024

Traceback is misleading for http.Client #18650

Closed

xdBronch mentioned this pull request May 7, 2024

incorrect peer type resolution taking the address of catch |err| switch (err) #19881

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Special-case switching on error union capture #18173

Special-case switching on error union capture #18173

dweiller commented Dec 1, 2023 •

edited

Loading

dweiller commented Dec 2, 2023 •

edited

Loading

matu3ba commented Dec 15, 2023

rootbeer commented Dec 15, 2023

matu3ba commented Dec 15, 2023

dweiller commented Jan 9, 2024

Vexu Jan 9, 2024

dweiller Jan 9, 2024

andrewrk Jan 9, 2024

andrewrk left a comment

andrewrk Jan 9, 2024

andrewrk Jan 9, 2024

andrewrk Jan 9, 2024

dweiller commented Jan 10, 2024

Special-case switching on error union capture #18173

Special-case switching on error union capture #18173

Conversation

dweiller commented Dec 1, 2023 • edited Loading

dweiller commented Dec 2, 2023 • edited Loading

matu3ba commented Dec 15, 2023

rootbeer commented Dec 15, 2023

matu3ba commented Dec 15, 2023

dweiller commented Jan 9, 2024

Vexu Jan 9, 2024

Choose a reason for hiding this comment

dweiller Jan 9, 2024

Choose a reason for hiding this comment

andrewrk Jan 9, 2024

Choose a reason for hiding this comment

andrewrk left a comment

Choose a reason for hiding this comment

andrewrk Jan 9, 2024

Choose a reason for hiding this comment

andrewrk Jan 9, 2024

Choose a reason for hiding this comment

andrewrk Jan 9, 2024

Choose a reason for hiding this comment

dweiller commented Jan 10, 2024

dweiller commented Dec 1, 2023 •

edited

Loading

dweiller commented Dec 2, 2023 •

edited

Loading