Finalize opcode encodings #29

lars-t-hansen · 2019-02-13T08:13:30Z

As part of the effort to drive both this proposal and the bulk memory proposal toward shipping status, let's nail down the opcode encodings. (The bulk memory proposal depends on what we choose for ref.null and ref.func, since those are used to express passive element segments.)

The spec interpreter in this repo has some TODO comments around the opcode encodings, and some opcodes are missing from the interpreter at present, and all proposed opcodes are precious single-byte ones.

From the interpreter in this repo we have:

0x25 == table.get
0x26 == table.set
0xd0 == ref.null
0xd1 == ref.is_null
0xd2 == ref.func

From the bulk memory proposal we have these proposed codes:

0xfc 0x08 == memory.init
0xfc 0x09 == data.drop
0xfc 0x0a == memory.copy
0xfc 0x0b == memory.fill
0xfc 0x0c == table.init
0xfc 0x0d == elem.drop
0xfc 0x0e == table.copy

In addition we need opcodes in this proposal for table.grow, table.size, and (possibly) table.fill.

We are in somewhat short supply of single-byte opcodes so I propose that we (a) change the encoding of ref.func since is not likely to be a very common opcode, and (b) allocate prefixed opcodes also for the three table operations mentioned above, yielding the following table for the present proposal:

0x25 == table.get
0x26 == table.set

0xd0 == ref.null
0xd1 == ref.is_null

0xfc 0x0f == table.grow
0xfc 0x10 == table.size
0xfc 0x11 == table.fill

0xfc 0x20 == ref.func

with the idea that 0xfc 0x20 can be the start of the group for multi-byte gc/reftypes operations, and 0xd0 remains the start of the group for single-byte gc operations.

@rossberg @binji @lukewagner @titzer, opinions?

The text was updated successfully, but these errors were encountered:

rossberg · 2019-02-13T08:25:46Z

I wouldn't mind moving table.get/set to 0xfc as well, but I'm fine either way.

On the other hand I think ref.func should remain single-byte -- note that one of its uses will be as a constant instruction in generalised element segments, where it could be fairly frequent.

lars-t-hansen · 2019-02-13T08:48:43Z

I agree that making ref.func two-byte will bloat the passive element segments that contain functions, but given how regular such an element segment would look to a level-1 wasm compressor or even a generic compressor, I'd expect that use case not to be all that important for compact encoding (during transmission).

(No objection to moving table.get / table.set.)

rossberg · 2019-02-13T09:27:50Z

But why rely on external compression for cases where we can avoid it without notable drawback? In this case there really seems to be no benefit in doing so.

lars-t-hansen · 2019-02-13T09:45:22Z

The benefit would be saving a scarce single-byte opcode (but only that).

binji · 2019-02-13T16:42:41Z

I believe the MVP leaves us with 68 single-byte opcodes, assuming all opcodes from 0xf0 through 0xff are reserved for prefixes. Of those, a few are already being claimed by proposals: 5 for exception handling, 2 for tail calls, and 5 for sign-extension. That leaves 56 opcodes remaining: [0x14..0x19], [0x1c..0x1f], [0x25..0x27], [0xc5..0xef].

Making ref.func a single byte seems useful to me, and given that we still have this many single-byte opcodes, I'm not too concerned yet about using one more.

Keeping table.get and table.set at 0x25 and 0x26 is nice aesthetically since they're near global.get and global.set. But I'd be OK moving them too.

rossberg · 2019-02-13T16:45:48Z

Okay, I'd propose leaving all three as single-byte then.

alexcrichton · 2019-02-13T21:05:54Z

Would it be worth finalizing the table index encodings as well? It looks like the interpreter in this repo for table.get encodes the index as a varuint32 after the opcode but I think Firefox uses an 0x4 byte followed by varuint32 for nonzero table indices. (also for existing instructions like where to put the table index for call_indirect)

lars-t-hansen · 2019-02-14T06:28:14Z

Firefox is going to be updated shortly to follow the interpreter here. The flag byte is not useful, it's just a holdover from an older regime.

lars-t-hansen · 2019-02-14T12:24:56Z

Summarizing the discussion so far, the proposal is:

0x25 == table.get
0x26 == table.set

0xd0 == ref.null
0xd1 == ref.is_null
0xd2 == ref.func      (* modulo bikeshedding of the name *)

0xfc 0x0f == table.grow
0xfc 0x10 == table.size
0xfc 0x11 == table.fill

Additionally, the table arguments to all the table operations are simple varuint32 values that are always present, there are no flags. This includes the second operand to call_indirect.

lars-t-hansen · 2019-11-15T09:15:06Z

Landed and fixed everywhere.

…able.copy`. (#29) This would make it simpler to extend those instructions to support multiple memories/tables, and copying between different memories/tables. The current encoding has a single placeholder zero byte for those instructions, which allows extension to multiple memories/tables, but would require a more complicated encoding to add two immediate indices.

See #18, #29, and #36

binji mentioned this issue Feb 13, 2019

Add support for the reference types proposal WebAssembly/wabt#938

Merged

alexcrichton mentioned this issue Feb 14, 2019

Fill out support for table manipulation instructions bytecodealliance/wasmparser#91

Merged

lars-t-hansen mentioned this issue Apr 3, 2019

[spec/interpreter/test] Add table bulk instructions missing from bulk op proposal #35

Merged

lars-t-hansen closed this as completed Nov 15, 2019

rossberg pushed a commit that referenced this issue Nov 20, 2019

Two zero immediates for memory.copy and table.copy (#43)

178a7f6

See #18, #29, and #36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finalize opcode encodings #29

Finalize opcode encodings #29

lars-t-hansen commented Feb 13, 2019

rossberg commented Feb 13, 2019

lars-t-hansen commented Feb 13, 2019

rossberg commented Feb 13, 2019

lars-t-hansen commented Feb 13, 2019

binji commented Feb 13, 2019

rossberg commented Feb 13, 2019

alexcrichton commented Feb 13, 2019

lars-t-hansen commented Feb 14, 2019

lars-t-hansen commented Feb 14, 2019 •

edited

Loading

lars-t-hansen commented Nov 15, 2019

Finalize opcode encodings #29

Finalize opcode encodings #29

Comments

lars-t-hansen commented Feb 13, 2019

rossberg commented Feb 13, 2019

lars-t-hansen commented Feb 13, 2019

rossberg commented Feb 13, 2019

lars-t-hansen commented Feb 13, 2019

binji commented Feb 13, 2019

rossberg commented Feb 13, 2019

alexcrichton commented Feb 13, 2019

lars-t-hansen commented Feb 14, 2019

lars-t-hansen commented Feb 14, 2019 • edited Loading

lars-t-hansen commented Nov 15, 2019

lars-t-hansen commented Feb 14, 2019 •

edited

Loading