core(lantern): always use flexible network ordering #14612

brendankenny · 2022-12-14T01:22:43Z

Remove flexibleOrdering from the simulator and always start requests when able to be started and a connection is available. Part of #14166

Simplifies simulator code and improves lantern accuracy (with a small change to the TTI and SI coefficients)

(initial image from original PR - see above instead)

The lantern simulator has two modes for connection reuse controlled by the boolean flexibleOrdering:

if flexibleOrdering: false, the default value, any network request that was made on a warm connection (connectionReused: true) can only be started once that connection is warm, and the NetworkNode is kept in the queue until whatever request was first on that connection in the unthrottled real load is completed in the simulator
if flexibleOrdering: true, connectionReused is not considered in deciding when network requests should start

Originally the observed connection reuse ordering was required, and flexibleOrdering came later when we explicitly wanted to make requests earlier than they had occurred, as if they had been preloaded.

However, the default flexibleOrdering: false is a little bit weird today. It's basically an additional dependency constraint on the graph, though not represented through the main dependency mechanism. In an ideal world, if there needs to be a dependency relationship between two requests for improved accuracy, we should make it explicit. We're also simulating the whole page load on a different speed machine; there's no reason to idle a connection waiting for the request that was observed to be first on a connection when other requests could be made sooner (that works against the whole reason for simulating). That's in an ideal world again, though, and we may be missing key simulator features that are being made up for by flexibleOrdering.

When testing to see what additional work it would take to remove inflexible ordering, I was prepared for accuracy to drop and additional dependency-graph relationships to be needed to counteract that. However, the lantern tests actually showed small improvements in accuracy in multiple audits—highest in p90 SI—except a small decrease in accuracy in p50 TTI and p95 SI.

The lantern coefficients for SI and TTI are super old, so I re-computed them from the full 9-run lantern dataset, then scaled back the changes considerably because I'm worried about overfitting to the test data. Looking at the goals laid out in #5120, TTI stays at optimistic + pessimistic === 1 while favoring the pessimistic estimate slightly more now, and SI moves very slightly toward a total of 1, but more importantly can finally use an intercept of 0 (the last of the metrics to do so).

Combined, these two changes make major improvements to the SI accuracy. Most of that is due to the coefficient/intercept change, but it means we can simplify the simulator code and remove an artificial constraint while maintaining or improving simulator accuracy.

connorjclark · 2022-12-14T19:34:10Z

Nice work!

However, the lantern tests actually showed small improvements in accuracy in multiple audits—highest in p90 SI—except a small decrease in accuracy in p50 TTI and p95 SI.

I take it that this statement is limited to metrics, without saying anything about other audits, correct?

I'm curious how you assess the changes in audits that went from inflexible to flexible. Since inflexible was the default value, from just looking at the PR diff we can only see the no-op changes from uses-http2, preload-lcp-image–nothing should have changed there. For the non-metric audits that changed simulation behavior (are there any?), can we assess how this change impacted the results? Or do we not particularly care, and should just limit our scope to the metrics when assessing changes to the simulator?

brendankenny · 2022-12-14T19:47:18Z

Yeah, it's limited to the metrics. In theory, if you take LCP of the original graph and a modified graph and the opportunity is the difference, then any improvements in LCP accuracy should improve that estimate. But that's not necessarily true in practice (the dbw failure in this run looks like a good example, I need to compare the before/after to see if we were relying on coincidence before or if there's a real regression).

We don't have any automated method for evaluating opportunity accuracy, but we should consider how we could build one in 2023.

We should also update the lantern test data. Somehow it's already been three years since you updated it last.

connorjclark · 2022-12-14T19:49:05Z

We should also update the lantern test data. Somehow it's already been three years since you updated it last.

Ooooh boy that will be a fun exercise! 3 years of software and product churn, and the cloud, what could go wrong?

connorjclark · 2023-02-04T00:58:11Z

We can add this for 10.0, just need to update the branch. Sorry it didn't get merged earlier.

connorjclark · 2023-05-03T18:41:12Z

https://github.com/GoogleChrome/lighthouse/actions/runs/4874982307/jobs/8697270087#step:10:1263 (failing only for ToT)

paulirish · 2023-10-05T21:02:51Z

huh we still want this right?

connorjclark · 2023-10-05T21:14:35Z

Yeah, it needs some TLC though (conflicts).

I wanted to have the lantern traces updated before merging, but an analysis of this PR's effects could be done posthoc so no need to wait on that.

core/lib/lantern/metric.js

connorjclark · 2024-04-18T20:55:34Z

core/test/audits/byte-efficiency/render-blocking-resources-test.js

@@ -82,7 +82,7 @@ describe('Render blocking resources audit', () => {
    const settings = {throttlingMethod: 'simulate', throttling: mobileSlow4G};
    const computedCache = new Map();
    const result = await RenderBlockingResourcesAudit.audit(artifacts, {settings, computedCache});
-    expect(result.numericValue).toEqual(316);
+    expect(result.numericValue).toEqual(0);


TODO: verify this or change test.

const {nodeTimings} = simulator.simulate(fcpGraph); in render-blocking-resources.js estimateSavingsWithGraphs gives 2884 for script.js, whereas before it was 2100. AFAICT no other numbers in this method varied. This is enough to make the "before inline" FCP estimate always be higher than the "after inline" estimate for this trace.

The test does otherwise fail if we comment out the amp-specific handling in render-blocking-resources.js, and I think the change in the FCP estimate is to be expected from this PR, so updating to 0 here seems the approach to take.

connorjclark · 2024-04-18T21:33:27Z

Last remaining item is the smoke failure that only occurs in the bundled test (odd): yarn test-bundle byte-efficiency

brendankenny requested a review from a team as a code owner December 14, 2022 01:22

brendankenny requested review from adamraine and removed request for a team December 14, 2022 01:22

brendankenny force-pushed the always-flexible branch from aa13de1 to 4360edd Compare December 14, 2022 01:40

vercel bot deployed to Preview December 14, 2022 01:41 View deployment

devtools-bot assigned adamraine Dec 15, 2022

devtools-bot added the waiting4reviewer label Dec 15, 2022

connorjclark approved these changes Dec 16, 2022

View reviewed changes

connorjclark added the 10.1 label Feb 6, 2023

adamraine removed the 10.1 label Mar 29, 2023

connorjclark force-pushed the always-flexible branch from 4360edd to abef56b Compare April 27, 2023 18:22

vercel bot deployed to Preview April 27, 2023 18:22 View deployment

brendankenny and others added 5 commits May 3, 2023 10:50

core(lantern): always use flexible network ordering

538ed45

test updates

05b225f

update SI/TTI coefficients

7bffa76

update TTI/SI test values

7c182f8

update smoke lcp start/end

53e50d4

connorjclark force-pushed the always-flexible branch from abef56b to 53e50d4 Compare May 3, 2023 17:53

vercel bot deployed to Preview May 3, 2023 17:54 View deployment

connorjclark added the 11.0 cranked up to eleven label Jul 10, 2023

connorjclark removed the 11.0 cranked up to eleven label Jul 26, 2023

connorjclark assigned connorjclark and unassigned adamraine Sep 6, 2023

paulirish added this to the v12.0 milestone Oct 5, 2023

Merge remote-tracking branch 'origin/main' into always-flexible

910894b

connorjclark reviewed Apr 10, 2024

View reviewed changes

core/lib/lantern/metric.js Outdated Show resolved Hide resolved

vercel bot deployed to Preview April 10, 2024 18:35 View deployment

lint

57c1716

vercel bot deployed to Preview April 11, 2024 19:10 View deployment

connorjclark added 2 commits April 11, 2024 12:12

oops in interactive coefs

785b7b6

fix tests

32502ad

vercel bot deployed to Preview April 11, 2024 19:16 View deployment

connorjclark added 2 commits April 18, 2024 13:46

Merge remote-tracking branch 'origin/main' into always-flexible

5af3672

update

7816de0

vercel bot deployed to Preview April 18, 2024 20:51 View deployment

rm

ff03edc

connorjclark reviewed Apr 18, 2024

View reviewed changes

remove optimisticFlexSpeedIndex

a8c9bc7

vercel bot deployed to Preview April 18, 2024 21:02 View deployment

adamraine approved these changes Apr 18, 2024

View reviewed changes

Merge remote-tracking branch 'origin/main' into always-flexible

be4a82b

vercel bot deployed to Preview April 19, 2024 18:56 View deployment

change smoke test a bit

ecbfeb3

vercel bot deployed to Preview April 19, 2024 19:08 View deployment

connorjclark merged commit b3f5396 into main Apr 19, 2024
27 checks passed

connorjclark deleted the always-flexible branch April 19, 2024 21:13

alcance mentioned this pull request Jul 10, 2024

[Snyk] Upgrade lighthouse from 12.0.0 to 12.1.0 alcance/lighthouse-reporter#4

Open

jones58 mentioned this pull request Jul 26, 2024

[Snyk] Upgrade lighthouse from 12.0.0 to 12.1.0 jones58/right-to-city#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core(lantern): always use flexible network ordering #14612

core(lantern): always use flexible network ordering #14612

brendankenny commented Dec 14, 2022 •

edited by connorjclark

Loading

connorjclark commented Dec 14, 2022 •

edited

Loading

brendankenny commented Dec 14, 2022

connorjclark commented Dec 14, 2022 •

edited

Loading

connorjclark commented Feb 4, 2023

connorjclark commented May 3, 2023

paulirish commented Oct 5, 2023

connorjclark commented Oct 5, 2023

connorjclark Apr 18, 2024

connorjclark Apr 18, 2024

connorjclark commented Apr 18, 2024

core(lantern): always use flexible network ordering #14612

core(lantern): always use flexible network ordering #14612

Conversation

brendankenny commented Dec 14, 2022 • edited by connorjclark Loading

connorjclark commented Dec 14, 2022 • edited Loading

brendankenny commented Dec 14, 2022

connorjclark commented Dec 14, 2022 • edited Loading

connorjclark commented Feb 4, 2023

connorjclark commented May 3, 2023

paulirish commented Oct 5, 2023

connorjclark commented Oct 5, 2023

connorjclark Apr 18, 2024

Choose a reason for hiding this comment

connorjclark Apr 18, 2024

Choose a reason for hiding this comment

connorjclark commented Apr 18, 2024

brendankenny commented Dec 14, 2022 •

edited by connorjclark

Loading

connorjclark commented Dec 14, 2022 •

edited

Loading

connorjclark commented Dec 14, 2022 •

edited

Loading