feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds #32883

KyleAMathews · 2021-08-23T19:10:33Z

Otherwise, we have to wait to start querying each page until the previous one finishes. This change lets us query all collection pages in parallel. So instead of fetching one collection page at a time, we can fetch up to the maximum concurrency (default 20).

For a test site with ~3200 entities and a warm Drupal cache (and no CDN cache), this PR dropped sourcing time from ~14s to 4s.

On a very large production Drupal site (~600k entities). Fetching time for a cold build dropped from 2 hours to 30 minutes 🚀

TODOs

document in README
during the initial build, log out every 50 requests w/ reporter.verbose the queue length & execution rate

…tras to construct URLs Otherwise, we have to wait to start querying each page until the previous one finishes. This change lets us query all pages in parallel. So instead of fetching one collection page at a time, we can fetch up to the maximum concurrency (default 20). For a test site with ~3200 entities, this PR dropped sourcing time from ~14s to 4s.

benrobertsonio · 2021-08-23T19:19:32Z

@KyleAMathews - should we include an addition to the readme to help people discover this update? ie - push people to enable the proper JSON API Extras settings?

wardpeet

I've asked a few questions here

packages/gatsby-source-drupal/src/gatsby-node.js

KyleAMathews · 2021-08-23T19:24:26Z

Yeah definitely. I'll add it once we do some more real-world testing to verify this change.

smthomas

This looks great to me! Just one small comment nit.

packages/gatsby-source-drupal/src/gatsby-node.js

…tras to enable parallel API requests for cold builds (#32883) * feat(gatsby-source-drupal): Use the collection count from JSON:API extras to construct URLs Otherwise, we have to wait to start querying each page until the previous one finishes. This change lets us query all pages in parallel. So instead of fetching one collection page at a time, we can fetch up to the maximum concurrency (default 20). For a test site with ~3200 entities, this PR dropped sourcing time from ~14s to 4s. * use new browser-based URL parser * Comment code more * Use the page size the site has set instead of assuming 50 * Use the original type that's set as that's always there * Log out updates while sourcing * Encourage people to enable this setting in the README * Update gatsby-node.js

gatsbot bot added the status: triage needed Issue or pull request that need to be triaged and assigned to a reviewer label Aug 23, 2021

KyleAMathews requested a review from smthomas August 23, 2021 19:10

KyleAMathews changed the title ~~feat(gatsby-source-drupal): Use the collection count from JSON:API extras to construct URLs~~ feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests Aug 23, 2021

drupalninja previously approved these changes Aug 23, 2021

View reviewed changes

KyleAMathews added topic: source-drupal Related to Gatsby's integration with Drupal and removed status: triage needed Issue or pull request that need to be triaged and assigned to a reviewer labels Aug 23, 2021

wardpeet reviewed Aug 23, 2021

View reviewed changes

packages/gatsby-source-drupal/src/gatsby-node.js Outdated Show resolved Hide resolved

packages/gatsby-source-drupal/src/gatsby-node.js Outdated Show resolved Hide resolved

packages/gatsby-source-drupal/src/gatsby-node.js Outdated Show resolved Hide resolved

use new browser-based URL parser

3cf6797

KyleAMathews dismissed drupalninja’s stale review via 3cf6797 August 23, 2021 20:32

Comment code more

9800d55

KyleAMathews changed the title ~~feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests~~ feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds Aug 23, 2021

KyleAMathews added 4 commits August 23, 2021 15:26

Use the page size the site has set instead of assuming 50

017980d

Use the original type that's set as that's always there

f37a3a5

Log out updates while sourcing

1f482c6

Encourage people to enable this setting in the README

a6f7d24

smthomas previously approved these changes Aug 26, 2021

View reviewed changes

packages/gatsby-source-drupal/src/gatsby-node.js Outdated Show resolved Hide resolved

Update gatsby-node.js

5c2e96a

KyleAMathews dismissed smthomas’s stale review via 5c2e96a August 26, 2021 04:37

KyleAMathews merged commit 568d4ce into master Aug 26, 2021

KyleAMathews deleted the parallel-fetches-drupal branch August 26, 2021 04:37

Rutam21 mentioned this pull request Oct 9, 2024

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2958

Open

This was referenced Oct 10, 2024

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4304

Open

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4306

Open

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4307

Open

ferdi05 mentioned this pull request Oct 18, 2024

[Snyk] Security upgrade gatsby from 2.32.13 to 3.13.0 ferdi05/gatsby#263

Open

kaocher82 mentioned this pull request Oct 18, 2024

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#6988

Open

This was referenced Oct 19, 2024

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2986

Open

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2987

Open

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2989

Open

nidhi42 mentioned this pull request Oct 19, 2024

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4355

Open

kaocher82 mentioned this pull request Oct 19, 2024

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7005

Open

This was referenced Oct 19, 2024

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2993

Open

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#2998

Open

nidhi42 mentioned this pull request Oct 20, 2024

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4372

Open

This was referenced Oct 20, 2024

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7022

Open

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7030

Open

saurabharch mentioned this pull request Oct 21, 2024

[Snyk] Fix for 8 vulnerabilities saurabharch/gatsby#3433

Open

Rutam21 mentioned this pull request Oct 22, 2024

[Snyk] Security upgrade gatsby from 2.32.13 to 3.13.0 Rutam21/gatsby#3006

Open

monizb mentioned this pull request Oct 22, 2024

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 monizb/gatsby#2338

Open

ferdi05 mentioned this pull request Oct 22, 2024

[Snyk] Security upgrade gatsby from 2.32.13 to 3.13.0 ferdi05/gatsby#294

Open

Rutam21 mentioned this pull request Oct 23, 2024

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#3014

Open

This was referenced Oct 23, 2024

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7089

Open

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7095

Open

[Snyk] Security upgrade gatsby from 2.24.79 to 3.13.0 Xtuden-com/gatsby#7097

Open

Rutam21 mentioned this pull request Oct 24, 2024

[Snyk] Security upgrade gatsby from 3.6.2 to 3.13.0 Rutam21/gatsby#3025

Open

nidhi42 mentioned this pull request Oct 24, 2024

[Snyk] Security upgrade gatsby from 2.18.12 to 3.13.0 nidhi42/gatsby#4418

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds #32883

feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds #32883

KyleAMathews commented Aug 23, 2021 •

edited

Loading

benrobertsonio commented Aug 23, 2021

wardpeet left a comment

KyleAMathews commented Aug 23, 2021

smthomas left a comment

feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds #32883

feat(gatsby-source-drupal): Use the collection count from JSON:API extras to enable parallel API requests for cold builds #32883

Conversation

KyleAMathews commented Aug 23, 2021 • edited Loading

benrobertsonio commented Aug 23, 2021

wardpeet left a comment

Choose a reason for hiding this comment

KyleAMathews commented Aug 23, 2021

smthomas left a comment

Choose a reason for hiding this comment

KyleAMathews commented Aug 23, 2021 •

edited

Loading