Implement shiftleft/shiftright by moving packet around #1032

wingo · 2016-10-14T08:31:38Z

Instead of memmoving a packet's data around this PR instead moves
around the packet pointer itself.

The only performance testing I did was "snabb lwaftr bench", where it
appears to be neutral-to-slight-win (relative to the previous strategy we had, in which we manually managed a headroom and data pointer per packet). I am hoping that the Hydra/R
scientists can help me out on this one.

This change to the packet structure does have a knock-on effect, in
two significant ways. One is that the shiftleft, shiftright, and
prepend functions now return a new packet pointer and silently corrupt
the old one (because probably they shifted around the data but still
had to update the new "length" pointer). That's somewhat OK though.

The problem comes in the datagram library, which sometimes wants to
work on a user-supplied packet. The sequence was:

local dgram = datagram:new(pkt)
dgram:push(....)
dgram:free()
...
foo(pkt)

This pattern is no longer valid :( The reason is that the datagram
methods that mutate a packet often do so by packet.prepend(). The
datagram library updates its internal packet pointer, but the external
one that was initially passed in is no longer valid. So much of this
patch is visiting (hopefully) all of the uses/users and updating them.

See also Igalia#407 and Igalia#481.

Note that this PR does not update the virtio code to use the new interfaces. (In the lwaftr branch, we use direct descriptors and push/pop the virtq tags using this mechanism.)

Instead of memmoving a packet's data around this patch instead moves around the packet pointer itself. The only performance testing I did was "snabb lwaftr bench", where it appears to be neutral-to-slight-win. I am hoping that the Hydra/R scientists can help me out on this one. This change to the packet structure does have a knock-on effect, in two significant ways. One is that the shiftleft, shiftright, and prepend functions now return a new packet pointer and silently corrupt the old one (because probably they shifted around the data but still had to update the new "length" pointer). That's somewhat OK though. The problem comes in the datagram library, which sometimes wants to work on a user-supplied packet. The sequence was: local dgram = datagram:new(pkt) dgram:push(....) dgram:free() ... foo(pkt) This pattern is no longer valid :( The reason is that the datagram methods that mutate a packet often do so by packet.prepend(). The datagram library updates its internal packet pointer, but the external one that was initially passed in is no longer valid. So much of this patch is visiting (hopefully) all of the uses/users and updating them.

lukego · 2016-10-14T08:35:43Z

Exciting!

lukego · 2016-10-14T08:39:17Z

@wingo How about if I setup a Hydra job that benchmarks branch wingo/snabb#optimize compared with master and next? Then you could force-push experimental code like this that you want Hydra to benchmark before we upstream. If you create that branch and let me know then I will setup the jobset (same offer goes for everybody else).

lukego · 2016-10-14T08:41:20Z

(btw @domenkozar and others are working on extending Hydra to be able to automatically test Github PRs. This will be nice.)

wingo · 2016-10-14T08:58:30Z

@lukego done :) i assume you want it named "optimize" just so that I can test other things by force-pushing to that branch? Sounds good to me :) We definitely need to throw some science at this one!

domenkozar · 2016-10-14T08:59:38Z

Just for the record, Hydra+Github PRs integration prototype is done, but depends on NixOS/nixpkgs#19396

lukego · 2016-10-14T09:13:55Z

Groovy. I created jobset snabb-new-tests/optimize-murren that tests many configurations (all dpdk/qemu/etc) but only a small number of iterations per scenario (3). This should provide a solid picture of the overall impact of the change but we have to be cautious about slicing-and-dicing too much (the jobset testing next does 10x more iterations).

Have to revise the R reports soon. I would particularly like to slice-and-decide datasets into groups using tools that avoid false-positives like xkcd 882 by correcting the confidence level based on how much the data is being "tortured." Visualizations are weak in this respect - easy to see patterns in randomness - and we probably need to include e.g. properly adjusted confidence intervals.

lukego · 2016-10-14T09:15:57Z

@domenkozar any idea on this evaluation error? https://hydra.snabb.co/jobset/snabb-new-tests/optimize-murren#tabs-errors

domenkozar · 2016-10-14T09:29:18Z

These are harmless, they are placeholder values for nixpkgs arhitecture to know what packages to build. I'll remove them in master :)

lukego · 2016-10-14T10:50:06Z

The first report is up. I know that reading these things is a bit of an art, but it looks like the change is basically neutral and does not cause any performance regression there.

This was tested on the murren servers without a hardware NIC. I suppose it makes sense to also test on the lugano servers that do have hardware NICs in case, for example, the alignment of memory has implications for DMA.

(On reflection I believe that the 82599 requires DMA to be aligned to even-numbered addresses. If that is right then we might need to hold that invariant for struct packet and fall back to the copying mode in case the packet is offset by an odd amount.)

domenkozar · 2016-10-14T10:52:13Z

@lukego Addressed in snabblab/snabblab-nixos@9b05d04

wingo · 2016-10-14T14:02:21Z

Added minimum packet alignment and updated the wingo/optimize head too.

wingo · 2016-10-14T14:25:02Z

Report with minimum alignment will be here: https://hydra.snabb.co/build/658140

wingo · 2016-10-17T07:41:57Z

My conclusion is that perf-wise this doesn't regress any benchmark. The open question is its effect on lib.protocol.datagram et al (cc @alexandergall). Are we OK with these changes? If so let's merge. If not let's talk :)

The context is that I'd like to get the lwaftr merged back to master early this week, and this is one of the few changes we made to core code.

kbara · 2016-10-17T09:31:56Z

This is looking nice! Minor nit: your ' -- No headroom for the shift; re-set the headroom to the default.' comment is slightly stale with the latest change. It's now no headroom or a shift by an odd number of bytes.

lukego · 2016-10-17T09:40:00Z

Any chance of a selftest() function in core.packet to exercise the basic code paths?

Like:

allocate, free.
allocate, shift left within headroom, free.
allocate, shift left beyond headroom, free.
allocate, shift right within headroom, free.
allocate, shirt right beyond headroom, free.

... and checking that free is putting the packets back into a canonical form i.e. resetting the headroom?

wingo · 2016-10-17T09:45:42Z

Good idea @lukego and nice catch @kbara, can do. What do you think of the datagram changes tho Luke? Any out-of-tree uses of the datagram library will be silently broken (like @alexandergall's vpn).

MHO: breaking datagram users is too bad but it is the right thing to do as this ultimately makes encap and decap operations like vlan tagging much cheaper, we should just own that decision.

lukego · 2016-10-17T09:56:02Z

For my part I am fine with the datagram library changes.

Causing breakage downstream is indeed regrettable. I would like to solve this by bringing more code (and tests and benchmarks) upstream so that we can take care of it while making core changes like this one. (I think this is the normal "selfish" reason that people upstream their code i.e. to share the responsibility for keeping it working.)

wingo · 2016-10-17T10:53:08Z

Hydra run here: https://hydra.snabb.co/build/679729

wingo · 2016-10-17T11:38:32Z

Well goll darnit it looks like that last change tanked performance. Will give it another go. Sadness.

wingo · 2016-10-17T11:45:37Z

Actually the commit seemed to tank correctness, not performance :) I was thinking that since new_packet set the p.length, that free_internal didn't have to. But the dual of free_internal is actually allocate, not new_packet :P

wingo · 2016-10-17T11:47:55Z

perf to be here: https://hydra.snabb.co/build/682109

wingo · 2016-10-17T12:59:21Z

OK all good now, ptal @lukego

lukego · 2016-10-17T13:17:08Z

LGTM!

wingo · 2016-10-17T13:18:37Z

thanks :) I will merge to wingo-next.

kbara · 2016-10-17T13:22:44Z

Looks worse on DPDK + configuration test 2.1.0 noind, which looks bimodal. Probably noise.

wingo · 2016-10-17T13:26:56Z

@kbara yeah hard to read that one indeed. the lower mode does seems to have lower probability tho

lukego · 2016-10-17T13:33:10Z

relatedly: I recently came across the term researcher degrees of freedom for situations where the person interpreting the data can inadvertently make uncontrolled decisions that lead to seeing things that seem more significant than they are. I believe that I have created exactly such a situation with this version of the R report that serves up a menu of sliced-and-diced graphs to choose from. Just by picking the most interesting graph from the report we are already on the way to seeing patterns in randomness. Have to think about a better presentation...

kbara · 2016-10-17T13:50:06Z

Obligatory: http://www.xkcd.com/1725/

kbara · 2016-10-17T13:54:36Z

I think the current presentation is ok - we just need to bear in mind that if a couple of results look like outliers, they might well be noise, especially if they don't persist between runs.

I see a lot of value in visualizing fine-grained data, seeing where bimodal results occur, etc. No matter how we display it, we have to think about it, test whether it's reproducible at times, etc. Even boiling all of the output down to one number and no graphs wouldn't shield us from misinterpreting significance.

Fix lj_auditlog double-inclusion and accidental globals

wingo added 3 commits October 14, 2016 08:25

Fix assumptions that packet == packet.data

938594d

lwaftr is resilient to changes in p.data

5416f9c

Add minimum packet alignment.

c2097d2

wingo mentioned this pull request Oct 14, 2016

Virtio uses shiftleft/shiftright to tag packets #1034

Merged

Add tests.

93629e0

Fix allocate/new_packet mixup.

12f77a3

wingo mentioned this pull request Oct 17, 2016

Implement headroom by moving packet around Igalia/snabb#481

Closed

wingo added a commit that referenced this pull request Oct 17, 2016

Merge #1032 (sliding struct packet) into wingo-next

141e917

wingo closed this Oct 19, 2016

wingo deleted the packet-shift branch October 19, 2016 10:02

lukego pushed a commit to lukego/snabb that referenced this pull request Mar 22, 2018

Merge pull request snabbco#1032 from Igalia/raptorjit-upstream

003050e

Fix lj_auditlog double-inclusion and accidental globals

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement shiftleft/shiftright by moving packet around #1032

Implement shiftleft/shiftright by moving packet around #1032

wingo commented Oct 14, 2016 •

edited

Loading

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

wingo commented Oct 14, 2016

domenkozar commented Oct 14, 2016 •

edited

Loading

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

domenkozar commented Oct 14, 2016

lukego commented Oct 14, 2016

domenkozar commented Oct 14, 2016

wingo commented Oct 14, 2016

wingo commented Oct 14, 2016

wingo commented Oct 17, 2016

kbara commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

kbara commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

kbara commented Oct 17, 2016

kbara commented Oct 17, 2016

Implement shiftleft/shiftright by moving packet around #1032

Implement shiftleft/shiftright by moving packet around #1032

Conversation

wingo commented Oct 14, 2016 • edited Loading

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

wingo commented Oct 14, 2016

domenkozar commented Oct 14, 2016 • edited Loading

lukego commented Oct 14, 2016

lukego commented Oct 14, 2016

domenkozar commented Oct 14, 2016

lukego commented Oct 14, 2016

domenkozar commented Oct 14, 2016

wingo commented Oct 14, 2016

wingo commented Oct 14, 2016

wingo commented Oct 17, 2016

kbara commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

wingo commented Oct 17, 2016

kbara commented Oct 17, 2016

wingo commented Oct 17, 2016

lukego commented Oct 17, 2016

kbara commented Oct 17, 2016

kbara commented Oct 17, 2016

wingo commented Oct 14, 2016 •

edited

Loading

domenkozar commented Oct 14, 2016 •

edited

Loading