Reorder structure fields to put commonly-used fields first: quick wins #4747

gilles-peskine-arm · 2021-07-01T21:58:31Z

There are advantages to grouping commonly-used fields in structures together. On Cortex-M0, an access to the first 128 elements of a structure (p->x when offsetof(t, x) / sizeof(x) < 128 where sizeof(x) is 1, 2 or 4) uses less code than an access beyond this boundary. On platforms with a cache, putting commonly-used fields in the same cache line optimizes cache use.

Anecdotal evidence suggests that in some structures, such as the SSL context, the placement of a new field can make hundreds of bytes of difference in the size of the library. In terms of effort per byte, this is a very nice win.

Note that when reordering fields, we should avoid creating holes due to padding on typical architectures, since these holes are both wasted RAM and wasted space in the “easy access” region.

Hanno writes:

it was r649. The logic applied there, however, was not to move commonly used fields to the top, but rather sort them in size from smallest to biggest. This helps because the maximum immediate offset is 127 * the size of the access, so 32-bit accesses have a large max. offset than byte accesses. E.g. if you have a struct with 128 byte fields, 64 halfword fields and 32 word fields, you can access all of them from the base of the structure through an immediate offset if and only if you order them in precisely this way.

The goal of this issue is to identify and perform some quick wins. I expect to at least move a few fields around in mbedtls_ssl_context and mbedtls_ssl_config. The wins should be visible in the sizes reported by tests/scripts/all.sh build_arm_none_eabi_gcc_m0plus.

I also expect some ideas on how to progress further: how might we measure which fields are the most commonly used? Which structures are large enough for this to matter? Observations and suggestions should be recorded and filed for a follow-up that would do some less quick wins.

This should be done before the 2.2x LTS since we try not to change the ABI in an LTS.

The text was updated successfully, but these errors were encountered:

hanno-becker · 2021-07-02T04:57:24Z

Note: A first step should be ordering fields by size to get as many of them as possible accessible through an immediate offset from the base. Only if that leads to the conclusion that a single immediate offset cannot cover the whole structure it is necessary to resort to reasoning about frequency of use.

mpg · 2021-07-05T07:46:11Z

This should be done before the 2.2x LTS since we try not to change the ABI in an LTS.

I'm not sure I agree with this reasoning. I mean, I obviously agree that this is not something we want to do in an LTS, but I disagree that a consequence is we want to do it in 2.2x before it becomes an LTS - we can also simply not do it in 2.2x and only do it in the 3.x line.

Looking at this table, we can see that r649 "optimize structure" is near the middle of the table at 344 bytes saved (in the reference configuration of the baremetal branch, would probably be more in the default configuration to be fair). Why prioritize it over other changes that are higher in the table and that we also can't do post-LTS, such as making more things compile-time optional (AES decrypt, resumption, TLS as opposed to DTLS...) or other change which allow larger savings?
Currently the EPIC description reads "Things we want to do to facilitate maintenance, that would be too disruptive for an LTS branch." Again, I agree this issue satisfies the second criterion, but I don't think it satisfies the first at all. I don't think we should start adding improvements to the pre-LTS EPIC just because they seem nice to have, easy to do and we can't do them post-LTS. This is a slippery slope that could easily result in the EPIC growing endlessly. I think for pre-LTS work we should really focus on making maintenance easier, and keep improvements for the 3.x line.

gilles-peskine-arm · 2021-07-05T09:24:53Z

Compared to other higher wins from #3535 (comment), this one breaks the ABI (many don't because they add new compile-time options), is broadly applicable (applies to every TLS user, as opposed to making features optional which only help users who don't need these features), doesn't add a maintenance burden (many of the items on that list are things we probably won't ever do because they make the code too complex), requires little prior knowledge (most of the others require surgical intervention inside the SSL stack), and has a result that's easy to review. So it's a good candidate for a new joiner, hobbyist or Friday afternoon activity. In terms of effort vs win, it beats everything else on that list.

mpg · 2021-07-06T07:58:19Z

Those are all good points indeed. Considering these reasons I agree it makes sense to have this issue in the pre-LTS EPIC without it being the start of an endless feature creep.

gilles-peskine-arm added enhancement good-first-issue Good for newcomers Product Backlog labels Jul 1, 2021

gilles-peskine-arm mentioned this issue Jul 1, 2021

Identify improvements in baremetal branch that should be upstreamed #3535

Open

gilles-peskine-arm added the size-s Estimated task size: small (~2d) label Jul 4, 2021

bensze01 modified the milestone: 2.x pre-LTS work Jul 28, 2021

lukgni mentioned this issue Jul 29, 2021

Reorder structure fields to maximize usage of immediate offset access #4821

Closed

lukgni self-assigned this Aug 4, 2021

lukgni linked a pull request Aug 4, 2021 that will close this issue

Reorder structure fields to maximize usage of immediate offset access #4821

Closed

bensze01 removed this from the 2.x pre-LTS work milestone Aug 11, 2021

mpg linked a pull request Oct 26, 2021 that will close this issue

Backport 2.x: Reorder structure fields to maximize usage of immediate offset access #4880

Closed

mpg assigned gilles-peskine-arm Oct 29, 2021

This was referenced Nov 16, 2021

Backport 2.x: Reorder structure fields to maximize usage of immediate offset access #5188

Closed

Backport 2.x: Reorder structure fields to maximize usage of immediate offset access #5189

Merged

mpg removed a link to a pull request Nov 17, 2021

Backport 2.x: Reorder structure fields to maximize usage of immediate offset access #4880

Closed

mpg linked a pull request Nov 17, 2021 that will close this issue

Backport 2.x: Reorder structure fields to maximize usage of immediate offset access #5189

Merged

gilles-peskine-arm mentioned this issue Dec 6, 2021

Reorder structure fields to maximize usage of immediate offset access #5268

Merged

mpg linked a pull request Dec 7, 2021 that will close this issue

Reorder structure fields to maximize usage of immediate offset access #5268

Merged

mpg closed this as completed in #5268 Dec 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reorder structure fields to put commonly-used fields first: quick wins #4747

Reorder structure fields to put commonly-used fields first: quick wins #4747

gilles-peskine-arm commented Jul 1, 2021 •

edited

Loading

hanno-becker commented Jul 2, 2021

mpg commented Jul 5, 2021

gilles-peskine-arm commented Jul 5, 2021

mpg commented Jul 6, 2021

Reorder structure fields to put commonly-used fields first: quick wins #4747

Reorder structure fields to put commonly-used fields first: quick wins #4747

Comments

gilles-peskine-arm commented Jul 1, 2021 • edited Loading

hanno-becker commented Jul 2, 2021

mpg commented Jul 5, 2021

gilles-peskine-arm commented Jul 5, 2021

mpg commented Jul 6, 2021

gilles-peskine-arm commented Jul 1, 2021 •

edited

Loading