add syntax to destructure array initialization lists #498

andrewrk · 2017-09-26T05:09:29Z

This proposal is an alternative to the rejected multiple expression values proposal (#83). It affects inline assembly improvements (#215). It depends on or at least is related to my comment in #346.

Add ability for functions to have multiple return values.

fn div(numerator: i32, denominator: i32) -> i32, i32 {
    return numerator / denominator, numerator % denominator;
}

If you want an error, it's recommended to use a struct:

error DivByZero;
const DivResult = struct {quotient: i32, remainder: i32 };
fn div(numerator: i32, denominator: i32) -> %DivResult {
    if (denominator == 0) return error.DivByZero;
    return DivResult {
        .quotient = numerator / denominator,
        .remainder = numerator % denominator,
    };
}

return statements can have multiple return values:

fn foo(condition: bool) {
    const x, const y = div(3, 1);
    const a, const b = if (condition) {
        return :this false, 1234;
    } else {
        return :this true, 5678;
    };
}

This is not general-purpose tuples. This is multiple assignment and multiple return values.

Real Actual Use Case: https://github.com/zig-lang/zig/blob/cba4a9ad4a149766c650e3f3d71435cef14867a3/std/os/child_process.zig#L237-L246

The text was updated successfully, but these errors were encountered:

PavelVozenilek · 2017-09-26T13:27:13Z

It would make sense to have named return values:

fn foo() -> int_val : i32, float_val : f32 {
   int_val = 0;
   ...
   if (...) return 10, 3.14;
   ...
   if (...) {
      float_val = 1.0;
      // implicitly converted to return int_val, float_val, compiler makes sure both were set
      return; 
   }
  ...
  if (...) {
     // compiler verifies float_val was set
     return 20, float_val;
  }
  int_val += 1;
  ...
  return int_val, float_val; // equivalent of return;
}

The more return values the more this would help.

Nim does this.

It is similar to proposal:
#83 (comment)

Will there be a chance to use undefined as "I do not care" return value?

fn foo() -> i32, f32 {
   if (...) return 1, undefined;
   ...
   return 1, 2.2;
}

If a structure is used as return type, it would be really handy to define it "inline". Otherwise people may place this type somewhere far away from the function definition, increasing confusion and potential for misuse.

fn my_func() -> const my_result_type = struct {foo: i32, bar: i32 }
    return my_result_type { ...  };
}
 
var x : my_func.my_result_type  = my_func();

Unnamed variant:

fn my_func() -> struct {foo: i32, bar: i32 }
    return { ...  }; // compiler knows it returns unnamed struct type
}
 
// result_type is contextual keyword understood by the compiler
var x : my_func.result_type  = my_func();

Here the function acts like a namespace for the return type.

Risk that someone mistakenly uses the type in inappropriate context is lower.

This should be possible:

fn foo() -> i32, f32 { ... }

var bar, baz = foo(); // type inference

It may be handy to ignore some return value

fn foo() -> i32, f32 { ... }

var bar, undefined = foo(); // type inference for bar

hasenj · 2017-09-27T04:39:21Z

Is it just me or the struct + error thing violates the maxim about

Only one obvious way to do things.

andrewrk · 2017-09-27T13:30:32Z

Can you give an example where it's not obvious which thing to do?

hasenj · 2017-09-28T05:54:41Z

The obvious thing (previously, I suppose) if you want to return multiple values is to use a struct.

Now you have two options: struct or multiple returns.

Suppose you have a function that returns two values, and later it evolves to also support errors. Now you have to rewrite the function and all calls to it so that it uses a struct. Maybe after a few times you decide to always use a struct and never use multiple return values.

Suppose the opposite: you use a struct because the function can return errors, but later it gets simplified and there are no errors anymore. Should you refactor it to return multiple values or leave it as-is?

PavelVozenilek · 2017-09-28T18:38:32Z

@hasenj: adding a struct increases number of "high level things" in the system.

One may be tempted to reuse return structure in different contexts, e.g. as member in some other structure. This discourages later changes.

Struct definition could be placed far away from its function. (Project rules may require such structuring - first define all constants, then the structures, last the functions.) It gets even better when the struct gives no hint of its intended purpose.

Having a struct also requires one to invent new name (could be solved by allowing function_name.return_type).

On the other hand, multiple return values is very local thing. It has no chance to affect unrelated code. It is always present where it is needed: at function definition and function invocations, and nowhere else. One is not temped to extend/reuse it for other purposes.

IMHO it should be preferred to structs/tuples.

hasenj · 2017-09-29T02:12:45Z

Project rules may require such structuring

The problem here is with arbitrary project rules.

I can also see another problem with multiple return values: it's not clear what is what (Just like with a regular tuple).

fn div(numerator: i32, denominator: i32) -> i32, i32

Without looking at the code, which value is the div and which is the mod?

I shall invoke other items from the zen:

Reduce the amount one must remember.

Favor reading code over writing code.

When you get a struct, the field name will clearly denote which item is which.

Avoid local maximums.

It might be easier to write the function once and use it once or twice. But can you imagine a project full of such functions?

It can be tempting to litter the code with multiple-value returning functions instead of properly defining the data structures that represent the problem and solution one is trying to build.

PavelVozenilek · 2017-09-29T10:49:02Z

@hasenj:

Project rules may require such structuring [of source file sections]

The problem here is with arbitrary project rules.

Yes, but this happens and the negative impact could be reduced a bit.

I can also see another problem with multiple return values: it's not clear what is what (Just like with a regular tuple).

Above I proposed optional named return values (adding the ability to manipulate individual values).

At call place returned value is assigned to a named variable. If one uses wrong name or wrong names order ... well, that's mistake like any other.

It might be easier to write the function once and use it once or twice. But can you imagine a project full of such functions?

Yes, I imagine that. Formal project rules kick in with full force:

// Mandatory project header template

//=== constants ===
...
// === types ===
...
// === functions ===
...

More seriously: "too many functions" is problem that should be solved on different level, by proper modularity, hiding the details as much as possible.

Multiple named values have their place: if there are only few of them (hard limit could be used, or some style guide or compiler check, per project) and when they make intuitive sense ( fn date() -> year : u32, month : u32, day : u32 ).

Structures are good if there's reuse, or if the data get too complex.

In C people often prefer multiple return values: all those out parameters by pointer, instead of defining return structure. Projects invent rules where to place these out parameters, tools are created to catch common bugs. This could happen to Zig too.

hasenj · 2017-09-29T13:10:59Z

What's the difference between a named tuple and a struct?

PavelVozenilek · 2017-09-29T13:27:55Z

@hasenj: no, I do not mean named tuple (which can be freely used in other places). I mean:

fn foo() -> ret_val1 : i32, ret_val2 : f64 { ... }

var x, y = foo();

The point is that the ret_val1 : i32, ret_val2 : f64 is tied to this function only, is predictably always at the right place, and does not require unique name.

PavelVozenilek · 2017-10-20T01:49:26Z

There is yet another use case for multiple return values: comptime expressions.

Setting a value using comptime is tricky (perhaps I didn't learn enough).

This works:

const x = comptime {
  var i : i32 = 99;
  i += 1;
  i
};

It is bit clumsy (avoid ; after last expression, don't forget ; after closing bracket) and, mainly, it does not allow to return more than one value. Yes, one can define a struct, but this makes design more complicated than it needs to be.

I imagine something as:

const x, y, z = comptime {
  ...
  i, j , k
};

hasenj · 2017-10-20T05:07:35Z

Some further questions to consider:

Can the multiple return include an error as one of the items?

 error DivByZero;
 fn div(numerator: i32, denominator: i32) -> (i32, i32, error) {
     if (denominator == 0) return (0, 0, error.DivByZero);
     return (numerator / denominator, numerator % denominator, null);
 }

Why can't a multiple return value also be wrapped/union-ed with an error value?

 error DivByZero;
 fn div(numerator: i32, denominator: i32) -> %(i32, i32) {
     if (denominator == 0) return error.DivByZero;
     return (numerator / denominator, numerator % denominator);
 }

I think the main issue I'm trying to raise is, why can't "tuples" be used outside the context of a function return? It seems like an asymmetry that can cause problems or confusion. One of which is the inability to union the return value with an error.

andrewrk · 2017-12-08T04:51:16Z

The questions brought up by @hasenj are resolved with #632, and since that's now accepted, I'm going to accept this proposal as well.

andrewrk · 2018-06-01T16:07:04Z

Removing accepted label as it conflicts with #208.

andrewrk · 2018-11-21T03:57:48Z

This proposal depends on #208 and #287 and would allow something like this:

const S = struct {field: i32};
var s: S = undefined;
var x, const y, s.field = blk: {
    break :blk .{foo(), bar(), baz() + 1};
};

InKryption · 2021-07-04T06:53:27Z

@billzez I'd just like to point out, let(.{ &a, &b }, .{b, a}); wouldn't allow for destructuring into const variables.

moosichu · 2021-07-04T10:26:26Z

Another option could be out parameters (like C# has), which does have some nice benefits as it can allow for APIs that can scope variable initialisation conditionally as well. Eg:

if(queue.tryDequeue(out const someVar)) {...}

jamii · 2022-10-05T14:39:45Z

(EDIT moved to #3805 (comment))

This change implements the following syntax into the compiler: ```zig const x: u32, var y, foo.bar = .{ 1, 2, 3 }; ``` A destructure expression may only appear within a block (i.e. not at comtainer scope). The LHS consists of a sequence of comma-separated var decls and/or lvalue expressions. The RHS is a normal expression. A new result location type, `destructure`, is used, which contains result pointers for each component of the destructure. This means that when the RHS is a more complicated expression, peer type resolution is not used: each result value is individually destructured and written to the result pointers. RLS is always used for destructure expressions, meaning every `const` on the LHS of such an expression creates a true stack allocation. Aside from anonymous array literals, Sema is capable of destructuring the following types: * Tuples * Arrays * Vectors A destructure may be prefixed with the `comptime` keyword, in which case the entire destructure is evaluated at comptime: this means all `var`s in the LHS are `comptime var`s, every lvalue expression is evaluated at comptime, and the RHS is evaluated at comptime. If every LHS is a `const`, this is not allowed: as with single declarations, the user should instead mark the RHS as `comptime`. There are a few subtleties in the grammar changes here. For one thing, if every LHS is an lvalue expression (rather than a var decl), a destructure is considered an expression. This makes, for instance, `if (cond) x, y = .{ 1, 2 };` valid Zig code. A destructure is allowed in almost every context where a standard assignment expression is permitted. The exception is `switch` prongs, which cannot be destructures as the comma is ambiguous with the end of the prong. A follow-up commit will begin utilizing this syntax in the Zig compiler. Resolves: ziglang#498

nektro · 2023-09-16T03:36:12Z

is this going to also support destructuring named structs by field name? i worry that if the answer is not eventually yes it might slightly pressure apis to use tuples more often and hurt readability in the long term

Implemented yesterday: ziglang/zig#498

This change implements the following syntax into the compiler: ```zig const x: u32, var y, foo.bar = .{ 1, 2, 3 }; ``` A destructure expression may only appear within a block (i.e. not at comtainer scope). The LHS consists of a sequence of comma-separated var decls and/or lvalue expressions. The RHS is a normal expression. A new result location type, `destructure`, is used, which contains result pointers for each component of the destructure. This means that when the RHS is a more complicated expression, peer type resolution is not used: each result value is individually destructured and written to the result pointers. RLS is always used for destructure expressions, meaning every `const` on the LHS of such an expression creates a true stack allocation. Aside from anonymous array literals, Sema is capable of destructuring the following types: * Tuples * Arrays * Vectors A destructure may be prefixed with the `comptime` keyword, in which case the entire destructure is evaluated at comptime: this means all `var`s in the LHS are `comptime var`s, every lvalue expression is evaluated at comptime, and the RHS is evaluated at comptime. If every LHS is a `const`, this is not allowed: as with single declarations, the user should instead mark the RHS as `comptime`. There are a few subtleties in the grammar changes here. For one thing, if every LHS is an lvalue expression (rather than a var decl), a destructure is considered an expression. This makes, for instance, `if (cond) x, y = .{ 1, 2 };` valid Zig code. A destructure is allowed in almost every context where a standard assignment expression is permitted. The exception is `switch` prongs, which cannot be destructures as the comma is ambiguous with the end of the prong. A follow-up commit will begin utilizing this syntax in the Zig compiler. Resolves: ziglang#498

andrewrk added enhancement Solving this issue will likely involve adding new logic or components to the codebase. proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. labels Sep 26, 2017

andrewrk added this to the 0.2.0 milestone Sep 26, 2017

andrewrk added the accepted This proposal is planned. label Dec 8, 2017

andrewrk modified the milestones: 0.2.0, 0.3.0 Feb 28, 2018

andrewrk removed the accepted This proposal is planned. label Jun 1, 2018

andrewrk modified the milestones: 0.3.0, 0.4.0 Jul 18, 2018

andrewrk removed the enhancement Solving this issue will likely involve adding new logic or components to the codebase. label Nov 21, 2018

andrewrk changed the title ~~proposal: multiple block return values~~ add syntax to destructure array initialization lists Nov 21, 2018

andrewrk added the accepted This proposal is planned. label Feb 15, 2019

andrewrk modified the milestones: 0.4.0, 0.5.0 Feb 15, 2019

andrewrk modified the milestones: 0.5.0, 0.6.0 Aug 28, 2019

Snektron mentioned this issue Aug 20, 2021

Functions Pointers Snektron/vulkan-zig#19

Open

InKryption mentioned this issue Sep 17, 2021

@hide: a way to manually/prematurely end the scope of identifiers. #9792

Closed

InKryption mentioned this issue Oct 22, 2021

std.mem: add indexOfMin and indexOfMax #9915

Merged

andrewrk modified the milestones: 0.9.0, 0.10.0 Nov 23, 2021

andrewrk mentioned this issue Nov 30, 2021

make overflow arithmetic builtins return a tuple instead of using a pointer parameter and bool return value #10248

Closed

andrewrk modified the milestones: 0.10.0, 0.11.0 Apr 16, 2022

andrewrk modified the milestones: 0.11.0, 0.12.0 Apr 9, 2023

andrewrk modified the milestones: 0.13.0, 0.12.0 Jul 9, 2023

andrewrk closed this as completed in 88f5315 Sep 15, 2023

andrewrk modified the milestones: 0.13.0, 0.12.0 Sep 15, 2023

andrewrk added the accepted This proposal is planned. label Sep 16, 2023

mk12 added a commit to mk12/blog that referenced this issue Sep 16, 2023

Use destructuring!

8eb7920

Implemented yesterday: ziglang/zig#498

Vexu mentioned this issue Nov 30, 2023

std: use math overflow helpers instead of builtins #18165

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add syntax to destructure array initialization lists #498

add syntax to destructure array initialization lists #498

andrewrk commented Sep 26, 2017 •

edited

Loading

PavelVozenilek commented Sep 26, 2017

hasenj commented Sep 27, 2017

andrewrk commented Sep 27, 2017

hasenj commented Sep 28, 2017

PavelVozenilek commented Sep 28, 2017

hasenj commented Sep 29, 2017

PavelVozenilek commented Sep 29, 2017

hasenj commented Sep 29, 2017

PavelVozenilek commented Sep 29, 2017

PavelVozenilek commented Oct 20, 2017

hasenj commented Oct 20, 2017 •

edited

Loading

andrewrk commented Dec 8, 2017

andrewrk commented Jun 1, 2018

andrewrk commented Nov 21, 2018

InKryption commented Jul 4, 2021

moosichu commented Jul 4, 2021

jamii commented Oct 5, 2022 •

edited

Loading

nektro commented Sep 16, 2023

add syntax to destructure array initialization lists #498

add syntax to destructure array initialization lists #498

Comments

andrewrk commented Sep 26, 2017 • edited Loading

PavelVozenilek commented Sep 26, 2017

hasenj commented Sep 27, 2017

andrewrk commented Sep 27, 2017

hasenj commented Sep 28, 2017

PavelVozenilek commented Sep 28, 2017

hasenj commented Sep 29, 2017

PavelVozenilek commented Sep 29, 2017

hasenj commented Sep 29, 2017

PavelVozenilek commented Sep 29, 2017

PavelVozenilek commented Oct 20, 2017

hasenj commented Oct 20, 2017 • edited Loading

andrewrk commented Dec 8, 2017

andrewrk commented Jun 1, 2018

andrewrk commented Nov 21, 2018

InKryption commented Jul 4, 2021

moosichu commented Jul 4, 2021

jamii commented Oct 5, 2022 • edited Loading

nektro commented Sep 16, 2023

andrewrk commented Sep 26, 2017 •

edited

Loading

hasenj commented Oct 20, 2017 •

edited

Loading

jamii commented Oct 5, 2022 •

edited

Loading