Download all the files associated with a package from a CDN #235

Caleb-T-Owens · 2024-01-24T18:17:59Z

This is not yet ready for review

The main focus

The main focus of this PR is to resolve the issue around packages which are comprised of multiple commands. Before this change, importmap-rails only looked for the entrypoint js file and downloaded that. When it was pinning CDN links, this worked just fine because browsers would be able to resolve the relative imports inside the CDN correctly.

Now that we are downloading the entrypoint and it's hosted on our server's Domains, when the browser tries to perform a relative import, it's unable to find the file.

The solution to this problem is to download all the supporting files for each package.

My choice of API

To accomplish this, there were two main options:

Firstly, we could make use of the new download API as suggested by DHH
Looking into this API it turns out that it was returning far more files than what was necessary. It appears to be providing all the files that are not included in the .npmignore file. (In some of the libraries that looked at, it was including cjs, umd and other non esm module systems)

In an ideal world, we would only download files that get used by the library.
My second option was to use the staticDeps and dynamicDeps properties provided by the generate API that we are already using.
These properties only included the files that get imported by the JS libraries and as we already make this API call, it seemed ideal to make use of it.

Update Fri 26/01/2024:
After a conversation with guybedford I've switched over to using the download API. This is because the staticDeps and dynamicDeps can be too restrictive and may not provide enough files for our use case.

Guybedford has said that he will be looking into safely cutting down the number of files provided by the download API so we can hopefully avoid pulling down non esm-related JS files

The new package class

While implementing the new downloads, I found that the download logic was very interwoven with the packager logic and I found what I think is a fairly reasonable division of responsibility, where a package is responsible for resolving the folder names and doing the downloading and removals, and a packager is responsible for manipulating the importmap.rb.

I'd be interested in hearing about people's thoughts on this and would be more than happy to squash it back down into the packager class if we're not ready to take on a new abstraction.

The new JspmApi class

After starting to use the download API from jspm today I noticed a bunch of duplicate code between the packager class and the package class. To reduce this, I centralised the location of the network-related code into one JspmApi class.

OK - so I've got a folder of downloaded JS but my browser is failing to import the modules

The big culprit here is the fingerprinting/digesting done by sprockets and propshaft.

The issue that we run into is how relative imports are handled. When a browser has a relative import, it will resolve the URL relative to the current file. Both sprockets and propshaft however have a fingerprint appended to the end of all JS files when they're served but don't update the import statements in the JS. This results in a situation where we have a file in our filesystem in the right location with the right name, but propshaft refuses to serve it.

Even though it does mean that it's currently not much of an improvement over what we've got; I think that this is something that should not hold back this PR as it's a limitation of two other packages (propshaft and importmaps).

I have made a PR (Related rails/propshaft#181) for propshaft that will transform all the js import and export statements to use the digested URLs

guybedford · 2024-01-25T20:46:31Z

Feedback very much noted here that loading all files might not be preferable!

I'm going to take another stab at adding some graph metadata to the download API that would then provide a comprehensive package metadata info that could be used to do selective downloading. I think this might be preferable to staticDeps and dynamicDeps in supporting graph traces that might not have been captured during the original import map construction.

If you're interested in trying it out I hope to have something together this weekend.

Caleb-T-Owens · 2024-01-25T21:00:17Z

Hi @guybedford! In what cases there might be some files that haven't been captured in staticDeps and dynamicDeps?

I think having the download API with graph metadata capabilities would be fantastic (thanks for taking on the feedback!), but it would be quite nice if the same logic to generate a comprehensive list of required dependency files could be applied to staticDeps and dynamicDeps so they can reliably be used for preloading all dependencies as is their suggested use in the docs, as well as meaning that we don't need to make two API calls.

Caleb-T-Owens · 2024-01-25T21:12:03Z

I've taken a look into sprockets to see if I could make a similar change to what I did for Propshaft (rails/propshaft#181), but looking into the codebase, I've not been able to identify where I would go about doing such a thing (if there is a place that it would fit in sprockets at all!).

Given that Rails 8 will be moving to propshaft anyway it might not be a concern, but if anyone does know of a simple way to achieve a similar change, please let me know!

guybedford · 2024-01-25T22:23:18Z

@Caleb-T-Owens to explain the difference - when preloading you only care about the exact graph that was traced - that means only those modules from the package that are used are traced, and only under those environment conditions that are set. On the other hand, a full trace of a package that works for different environment conditions and for different internal modules being imported is a slightly larger list, but still a smaller list than the full file list of the package. The benefit of using the latter for the download API is that the downloaded package folder can be cached across uses.

Caleb-T-Owens · 2024-01-25T22:33:21Z

@guybedford Thanks for explaining. I'll move over to the download API tomorrow in anticipation of this option

This is mostly covered by other cases so I'm happy removing it. Ideally we'd use one HTTP2 session to speed up the downloads

Caleb-T-Owens · 2024-01-26T21:42:17Z

Update! I've moved over to the download API

guybedford · 2024-01-28T07:19:00Z

As discussed I implemented a new exclude option for the JSPM download API in https://jspm.org/cdn/api#download. Setting exclude=unused,readme,types should reduce the file count considerably. Further feedback very much welcome.

Caleb-T-Owens · 2024-01-28T12:13:10Z

@guybedford Hi! Thanks for being so fast with that update. It does seem that exclude is now a required field which is probably undesirable; IE: GET https://api.jspm.io/download/@popperjs/core@2.11.8 returns {"error":"\"exclude\" must be a list"}.

Something that would be nice but needed, is if I could POST directly to https://api.jspm.io/download and provide the versioned packages as an array in the body, rather than appending things onto the end of the URL.

guybedford · 2024-01-28T19:21:48Z

@Caleb-T-Owens thanks for the quick feedback too, both changes are added now.

guybedford · 2024-01-28T19:25:50Z

@Caleb-T-Owens if you're still finding there's too many files I'd be interested to hear that too - we could possibly also add an env option to this exclude feature as well, which would filter to the needed conditional environments of the package (eg perhaps you don't want to download development condition files when you're just downloading the package for production). If that would be useful do let me know.

(The important difference from normal generator env here being that normally we only trace a single environment, whereas this env filter can trace multiple environments at the same time)

Caleb-T-Owens · 2024-01-29T19:46:38Z

@guybedford I think I could do with some more context about what the best env option is for generate.

At the minute, we're using env: ["browser", "module"], which in most cases, only needs the one built file (which has been built in the package developers build step). For example, with alpine.js, we are provided with the entrypoint https://ga.jspm.io/npm:alpinejs@3.13.5/dist/module.esm.js and we don't need to look anywhere else.

When I use the download API for alpine.js, we get quite a bit of extra stuff provided. Seemingly, these are all the files that have been built out of your RollupJS and other steps, however when trying different envs, I can't seem to give me the entrypoint of those files, and instead was giving me https://ga.jspm.io/npm:alpinejs@3.13.5/dist/module.cjs.js

Is there a better env setting that we should be using for development/production to get those particular entry points?

I guess what I'm trying to wrap my head around is; if we're downloading all these extra files and adding them to our version control, we need a good justification for having them, and I can see having the ability to make a second importmap (or altering our pin function to take a "dev" argument) which makes use of the development entrypoints being quite a good justification, but I've not been able to find an "env" that would give me those development entrypoints.

guybedford · 2024-02-12T04:37:23Z

Sorry for the delay here @Caleb-T-Owens - moved apartments and had two work trips. So for the alpine.js case - this package does not have a package.json "exports" field. So all module files are potentially loadable via subpathing. This is because the rule of the "exports" field is it provides the only subpaths accessible in the package - it is encapsulating. So packages without this field have weaker guarantees about what code is public and what code is private.

When JSPM finds a package without an "exports" field, it falls back to using its global code analysis for all consumers of that package over all code on npm. And for alpine.js it turns out that users very much do use all exports of it - there are libraries on npm doing imports like require('alpinejs/src/lifecycle'). So that is why JSPM has added them into the "exports".

For JSPM, we could add an override to its package.json for JSPM like "exports": { ".": "./dist/module.cjs.js" }, which would then resolve this back to just being one file.

Alternatively we could support a download variant that only downloads the main entry point for a package instead of subpaths - but packages that do rely on other subpathing would then break when used against that mechanic (and subpaths are actually a best practice for breaking up libraries into smaller chunks of code).

Hope that explains the situation a bit better? I'm open to download API adjustments, and I'd also be more than happy to guide you through the override process if you'd like to override the alpinejs configuration here specifically.

Caleb-T-Owens · 2024-02-12T13:40:52Z

@guybedford Thanks for the explanation, that makes sense. I don't think we'd want to cut it down more and potentially break other packages

Skypack seems abandoned: - skypackjs/skypack-cdn#365 - skypackjs/skypack-cdn#362 importmap-rails switched logic to always download: rails/importmap-rails#217 But it doesn't work for complex bigger packages. There's a WIP PR to address this: rails/importmap-rails#235 So I decided to use the new rails-importmap approach where I can, and switch to jsdelivr where I have to. I've pinned to major versions and jsdelivr should take care of everything else. I've also updated the rake task to check for major version updates.

mhenrixon · 2024-08-09T16:01:51Z

@Caleb-T-Owens, @guybedford what do we need to be able to finish this? It keeps biting me in the butt and since work was done here, I figured it is best to use this issue.

Always happy to pair on the problem.

guybedford · 2024-08-10T22:05:13Z

@mhenrixon the download API for JSPM is public and stable at https://jspm.org/cdn/api#download. And for cases like the package previously discussed, it is always possible to explicitly define the "exports" to reduce the file count or provide a JSPM-specific package override. I don't have the bandwidth to work directly on the integration side here, but do just let me know if I can help further at all.

Caleb-T-Owens added 4 commits January 24, 2024 09:33

Download all listed dependency packages

e2b104b

Vendor without specifying the vendor folder

1d9e296

Added in error classes

35a2962

Update pinning code to correctly reference folder

6c55a61

Caleb-T-Owens mentioned this pull request Jan 25, 2024

Fix relative imports when using propshaft without a bundler rails/propshaft#181

Closed

Caleb-T-Owens mentioned this pull request Jan 25, 2024

Feature suggestion: optional process.env polyfill #236

Open

Caleb-T-Owens added 5 commits January 26, 2024 17:03

Move over to using the download API

84673d8

Remove a slow test case

9614339

This is mostly covered by other cases so I'm happy removing it. Ideally we'd use one HTTP2 session to speed up the downloads

Rename jspmApi → jspm_api, using Net::HTTP.start over individual calls

b7457ca

Added more tests for jspm_api

0647eae

Change testing to ESM build

1ebbcd3

Fix calling Thor methods in non Thor context

0c63b8b

Caleb-T-Owens mentioned this pull request Jan 27, 2024

bin/importmap verify compares vendored files with remotes #237

Open

Update usage of download API to exclude unneeded files

62f7f93

Move over to passing packages as array in POST body

ebddf48

Caleb-T-Owens mentioned this pull request Feb 7, 2024

Importmap update removes pins via http #242

Closed

KonnorRogers mentioned this pull request Mar 6, 2024

Issue with importmaps and lit-html KonnorRogers/rhino-editor#161

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Download all the files associated with a package from a CDN #235

Download all the files associated with a package from a CDN #235

Caleb-T-Owens commented Jan 24, 2024 •

edited

Loading

guybedford commented Jan 25, 2024

Caleb-T-Owens commented Jan 25, 2024 •

edited

Loading

Caleb-T-Owens commented Jan 25, 2024

guybedford commented Jan 25, 2024

Caleb-T-Owens commented Jan 25, 2024

Caleb-T-Owens commented Jan 26, 2024

guybedford commented Jan 28, 2024

Caleb-T-Owens commented Jan 28, 2024

guybedford commented Jan 28, 2024

guybedford commented Jan 28, 2024 •

edited

Loading

Caleb-T-Owens commented Jan 29, 2024

guybedford commented Feb 12, 2024 •

edited

Loading

Caleb-T-Owens commented Feb 12, 2024

mhenrixon commented Aug 9, 2024

guybedford commented Aug 10, 2024

Download all the files associated with a package from a CDN #235

Are you sure you want to change the base?

Download all the files associated with a package from a CDN #235

Conversation

Caleb-T-Owens commented Jan 24, 2024 • edited Loading

The main focus

My choice of API

The new package class

The new JspmApi class

OK - so I've got a folder of downloaded JS but my browser is failing to import the modules

guybedford commented Jan 25, 2024

Caleb-T-Owens commented Jan 25, 2024 • edited Loading

Caleb-T-Owens commented Jan 25, 2024

guybedford commented Jan 25, 2024

Caleb-T-Owens commented Jan 25, 2024

Caleb-T-Owens commented Jan 26, 2024

guybedford commented Jan 28, 2024

Caleb-T-Owens commented Jan 28, 2024

guybedford commented Jan 28, 2024

guybedford commented Jan 28, 2024 • edited Loading

Caleb-T-Owens commented Jan 29, 2024

guybedford commented Feb 12, 2024 • edited Loading

Caleb-T-Owens commented Feb 12, 2024

mhenrixon commented Aug 9, 2024

guybedford commented Aug 10, 2024

Caleb-T-Owens commented Jan 24, 2024 •

edited

Loading

Caleb-T-Owens commented Jan 25, 2024 •

edited

Loading

guybedford commented Jan 28, 2024 •

edited

Loading

guybedford commented Feb 12, 2024 •

edited

Loading