Skip to content
This repository has been archived by the owner on Feb 9, 2021. It is now read-only.

Downloading pre-built Ruby Versions at runtime #44

Open
MSP-Greg opened this issue Dec 30, 2019 · 61 comments
Open

Downloading pre-built Ruby Versions at runtime #44

MSP-Greg opened this issue Dec 30, 2019 · 61 comments
Labels
enhancement New feature or request

Comments

@MSP-Greg
Copy link
Contributor

MSP-Greg commented Dec 30, 2019

@bryanmacfarlane

With #42 closed, I thought I'd continue here.

If one views this repo's purpose as accessing pre-built Rubies located in 'hostedtoolcache', then it is functioning as expected.

For quite a while, issues have been posted that imply that what's contained in 'hostedtoolcache' is not meeting users needs.

Given that, and given that GitHub employs several people active in the Ruby community, are there any plans for changing the current setup?

The Ruby community can certainly come up with solutions, but some at point, we/they will have some needs from Actions to make it function reasonably well. An example would be MSYS2 for Windows extension gem CI, and also building Ruby itself. Installing MSYS2 from scratch is time consuming.

If this isn't the palace to discuss this, please suggest where...

@MSP-Greg
Copy link
Contributor Author

Ping @ioquatix @eregon @tenderlove

Sorry for the ping...

@MSP-Greg
Copy link
Contributor Author

A Ruby build zip is +/- 10 MB. One thought is to have the Ruby community build Rubies and host them as packages in some repo, then provide an action that downloads and installs them...

@eregon
Copy link
Contributor

eregon commented Dec 30, 2019

I was thinking something along the same line.
We could have a separate GitHub action that just builds various Ruby versions for all virtual environments available in GitHub Action, for instance using ruby-build or ruby-install.
Then this action could just download the result from that action's repository release downloads.

@eregon
Copy link
Contributor

eregon commented Dec 31, 2019

cc @clupprich this might interest you too (as the author of https://github.com/clupprich/ruby-build-action).
I think a more convenient caching/building mechanism would be quite nice for actions, instead of a cache per repository.

@bryanmacfarlane
Copy link
Member

bryanmacfarlane commented Dec 31, 2019

Yes, avoiding having consumers build it every time would be great. Hosted machines are a fresh VM every job.

Building ruby, making it globally available and then having the setup-ruby pull the binaries just in time would offer the ability to pin to any version flexibly.

Things to note:

  • The actions/virtual-environments images effectively do this not as flexibly. I believe they only lay down a couple versions into the cache which is why I believe the defaults push a major version binding. They do it at factory disk creation (not users build time) which is an advantage but it can't possibly lay down all versions.
  • If you build and publish ruby binaries to a globally available location it would be nice to have a json file with all the metadata so the action can do semver and match versions. Nodejs does this and the setup-node action uses it.
  • self-hosted runners will benefit from job to job if downloaded into the tool cache by tool, version, platform
  • The recommended pattern in the tool actions is to match the semver supplied against the cache dir first and then against the cloud. This adds resiliency for availability / load. e.g. nodejs was having download issues and it affected all workflows since the factory disk tool cache didn't have enough versions (12 was published).
  • If you have a globally available set of bits, a CDN would be best. Nodejs didn't have a CDN as of the time 12 was released and the load caused issues.

That would mean you have to create, manage a CDN. That is unless you can lean on another packaging service like GPR etc. for a universal type package of bits. One other option would be to use releases in the ruby repo and attach binaries (see actions/runner where we do that with most recent release). The binary packages in a release are backed by CDN.

Hopefully, all that makes sense. Let me know how I can help.

@MSP-Greg
Copy link
Contributor Author

@bryanmacfarlane

Thanks for the response. I've assumed that binaries/builds would be stored in a repo's releases.

I've maintained a Ruby master MinGW for a few years for CI use. I've had it running as a cron job, three times a day. One question about using GH repo releases for storage.

The current actions/upload-release-asset isn't really clear about how to update an item to an existing release.

Will there be issues with a repo's Actions job trying to download the binary when the cron job is updating the release build binary? Maybe I should ask the question there...

@eregon
Copy link
Contributor

eregon commented Dec 31, 2019

I made a quick prototype (just a proof of concept at this point) to build Ruby versions for each GitHub virtual environment: https://github.com/eregon/ruby-install-builder/releases/tag/builds

Then I think downloading that and unpacking that for the same virtual environment should just work.

One potential issue is virtual environments/images do change over time, so that might break things if e.g., a new libssl version is used.

It's really easy for Linux and macOS, but probably quite a bit different for Windows, where I think we should likely use MSP-Greg's Windows Ruby builds.

@eregon
Copy link
Contributor

eregon commented Jan 1, 2020

I worked a bit more on this idea, and https://github.com/eregon/use-ruby-action now works with MRI, JRuby and TruffleRuby, with exact versions, on Ubuntu and macOS.
It just downloads and extracts an archive, so it's just a couple seconds to set it up.

Would an approach like that make sense for actions/setup-ruby?

@eregon eregon mentioned this issue Jan 2, 2020
@bryanmacfarlane
Copy link
Member

@eregon - thanks, I'll have a look today.

@ioquatix
Copy link

ioquatix commented Jan 2, 2020

I reviewed use-ruby-action and ruby-install-builder and I think we could adopt them. They seem well put together, have good separation of concerns, and should be reliable and "scalable" (i.e. support different versions of Ruby) into the future.

The only thing I'd suggest is perhaps supporting more Ruby versions (trivial) and perhaps having a suggested template for users to follow, not sure if this exists or not. Ideally, the default template should be super simple - I wish users don't need to specify Ruby versions, but instead could write something like: all supported releases which right now would be MRI 2.4 - 2.7, JRuby and TruffleRuby, rather than mucking around with environments. Because most gems should just use this by default.

@ioquatix
Copy link

ioquatix commented Jan 2, 2020

Sorry, my bad,I just saw https://github.com/eregon/use-ruby-action#usage which shows the usage - so my only point is perhaps just some way to make it easier for users to have a default build matrix which includes all current released versions of Ruby across all latest supported OS distributions.

@eregon
Copy link
Contributor

eregon commented Jan 2, 2020

The matrix needs to list all combinations explicitly for the workflow to understand it AFAIK, so I think it can't really be much better than:
https://github.com/eregon/ruby-install-builder/blob/f967c81fdf7097dc33fbf6ac5adeab551ae9edaa/.github/workflows/build.yml#L32-L34
Copying here for convenience:

      matrix:
        os: [ 'ubuntu-16.04', 'ubuntu-18.04', 'macos-latest' ]
        ruby: [ 'ruby-2.4.9', 'ruby-2.5.7', 'ruby-2.6.5', 'ruby-2.7.0', 'truffleruby-19.3.0', 'jruby-9.2.9.0' ]
# This also works and is a fair bit shorter:
        ruby: [ '2.4.9', '2.5.7', '2.6.5', '2.7.0', 'truffleruby', 'jruby' ]

We could accept ruby-2.4 and ruby-2.4.x (setup-ruby supports this) of course.

@ioquatix
Copy link

ioquatix commented Jan 2, 2020

What about adding -head variants?

@eregon
Copy link
Contributor

eregon commented Jan 2, 2020

What follows is just my quick opinion.
First, I think testing against MRI master is of little value, it regularly breaks and there is no CI before pushing. Also, features and changes on master are often reverted/tweaked/fixed, so it would cause a lot of failures for CI of gems.
I think JRuby doesn't really recommend testing against master, the CI is not always green either.
For TruffleRuby I think it could be useful because all tests must pass before merging to master, but we would first need nightly builds of TruffleRuby (oracle/truffleruby#1483, building TruffleRuby entirely from source is more challenging than downloading an existing release and just recompiling the openssl extension).

In practice it's probably not too hard since I think ruby-install supports building ruby-head. Not sure about jruby-head, there is https://www.jruby.org/nightly but that doesn't give a single link.

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 2, 2020

@eregon

JFYI, re ruby master, ruby-loco is set up that only the most recent passing build is used when downloading. All test suites are run from the install folder, and I also check that bin files are working correctly, or at least reporting a version (rake, gem, bundler, etc).

Granted, API changes are a different matter...

@eregon
Copy link
Contributor

eregon commented Jan 2, 2020

@MSP-Greg Indeed, if we want to support MRI head, we'd need something like that on all platforms. I think in TravisCI they had a hook triggering only if ruby's own CI passed, we could have something like that.

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 2, 2020

@eregon

ruby-loco is currently built on AppVeyor, but I expect to move it to Actions. I started with the ruby/ruby MinGW build and it's passing. Note that it's running spec tests from the install folder. Working on mswin now. Can't get test-all to pass.

Once I get ruby-loco moved, I'll try using similar code for Ubuntu & MacOS. Might have some questions though...

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 3, 2020

@bryanmacfarlane

Assuming you're not often coding in Ruby, maybe you can direct me in the correct direction.

Ubuntu and macOS have compile tools as part of the OS. Ruby on Windows is built with gcc, and uses the MSYS2 system. Also, any Ruby extension gems (gems that have c or c++ code) require the MSYS2 system for compiling on CI.

Currently, the Windows images have three copies of MSYS2, all of which are embedded in the installed Ruby versions. And, yes, three versions is a waste.

Anyway, the question is whether GitHub will reconsider this decision and add an independent MSYS2 install in future image updates. AppVeyor has an extensive set of installed MSYS2 packages (see https://ci.appveyor.com/project/MSP-Greg/appveyor-ruby). I'm not looking for all of that, but the base system and gcc would certainly be a good start.

For other disucssions, see:
https://git.luolix.topmunity/t5/GitHub-Actions/Windows-MSYS2-Ruby/m-p/30885
actions/runner-images#30

Anyway, MSYS2 is also used by langauges/applications other than Ruby.

So, where do we inquire about this?

@bryanmacfarlane
Copy link
Member

bryanmacfarlane commented Jan 7, 2020

For questions / issues on what's on the VM images (msys2 etc.), you're best to drive issues here: https://github.com/actions/virtual-environments

There's multiple issues covering the what versions of ruby are available so I think the discussion should probably be centralized there for ruby versions: https://github.com/actions/virtual-environments/issues?utf8=%E2%9C%93&q=is%3Aissue+is%3Aopen+ruby

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 7, 2020

Ok, the below is a response to something you've remove from the above post, but it may be helpful for some...

Regarding msys2, you will also have to consider what happens on self-hosted machines. setup-ruby needs to work in isolation so maybe it would fault in the single msys2??

First of all, Standard Windows Rubies (SWR's) are stand-alone, they do not need MSYS2 to run the code that they contain. All required non-Windows dlls are packaged in bin/ruby_builtin_dlls.

SWR's have OS specific code to locate the MSYS2 install, and they will find it if it's installed at C:\msys64.

I can explain more, but I've worked with a lot of repos/gems on Windows, with multiple Ruby versions, and I only have one MSYS2 install.

The one issue is that if one is compiling extension gems, one needs to be aware of library package versions. This issue is common to all OS's, an example would be OpenSSL. Older versions of Ruby use OpenSSL 1.0.2 (or earlier), newer versions use 1.1.1. If one is compiling an extension gem using OpenSSL (common with socket/web server gems), one should use the same version of OpenSSL that the host language version (Ruby) uses.

Lastly, AppVeyor has used an stand-alone MSYS2 install for a long time, and everyone seems able to work with it...

@bryanmacfarlane
Copy link
Member

bryanmacfarlane commented Jan 7, 2020

Thanks. Yeah, I removed because I realized it wasn't a runtime thing.

If the user / workflow is also compiling extension gems then they are bringing the need for msys2 so for self-hosted where they bring their own machine it would be on them to install it like any other software on a self hosted machine. Regarding the hosted images, yes, I believe it makes sense for those to have a stand alone install since it's a clean machine every time and you wouldn't want to have to pull that JIT. And for that, the virtual-environments repo is the best place to drive that.

Hope that makes sense ...

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 7, 2020

Thank you. I've worked with MSYS2 for a while, and there is no way to install it from scratch without taking quite a while, at least in terms of CI time frames.

Ruby is one thing, as they're between 10 & 30 MB compressed, MSYS2 is an order or two in magnitude larger.

EDIT: See actions/runner-images#229

@eregon
Copy link
Contributor

eregon commented Jan 7, 2020

@bryanmacfarlane wrote:

@eregon - thanks, I'll have a look today.

Did you have a chance to take a look at https://github.com/eregon/use-ruby-action?
What do you think of that approach?

I think one of the major advantages is it can make new releases of Ruby available within hours.
Relying on the toolcache probably means many days (or weeks?) to use new Ruby releases.
That action also already supports JRuby and TruffleRuby.

How do you see the future of this action? Using only the toolcache, or prebuilt rubies like use-ruby-action or a combination of both?

I think in the short time it would be nice to merge #27.
That would be the first sign for me that this repo isn't inactive.

And then I think we should add a section in the README about alternatives and the status of this action.
I think this is urgent because so many actions are now created to work around the limitations of this action.
And there is no point to duplicate efforts, rather we should join efforts in one good action.
Some example actions I could find:
https://github.com/MSP-Greg/actions-ruby by @MSP-Greg
https://github.com/clupprich/ruby-build-action by @clupprich
https://github.com/masa-iwasaki/setup-rbenv by @masa-iwasaki
https://github.com/eregon/use-ruby-action by @eregon (me)

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 7, 2020

I ran a quick 'info' workflow (see the Ruby Info step) from https://github.com/eregon/use-ruby-action, and speed is quick enough to not generate complaints.

See:

https://github.com/MSP-Greg/use-ruby-action-info/runs/374992625

@bryanmacfarlane
Copy link
Member

bryanmacfarlane commented Jan 7, 2020

Thanks. In general, getting tools JIT (as well as caching) is a good direction for this action to move.

I think this is a product question beyond just this action since our current approach is tool cache on the virtual environments and we're working with that team as well.

Adding @chrispat and @ethomson for product feedback. FYI @alepauly from VMs.

My thoughts:

  • It shouldn't be an either / or between virtual environments and pulling JIT. The VM should have some common ones and JIT pull what's not there (today it's just what's on the VM). This is what we do for node.
  • If we do pull JIT, it should be official (signed?) binaries from ruby/ruby. Metadata and downloads shouldn't come from a personal repo. For example, with node, it's an official signed distribution: https://nodejs.org/dist/index.json
  • We have specifically chosen to not overload installers. Our pattern is one setup action per versionable tool to cache by name, version, arch. e.g. setup-node has explicitly chosen to not overload npm and yarn even though they're both node and we had those discussions. They both have independent versions just like ruby and jruby have independent versions. The actions org should maintain the ruby one and the TruffleRuby owner can an action. It's OK to have actions/setup-ruby, jruby/setup-jruby and oracle/setup-truffleruby in the marketplace.
  • On a related note of overload, each release doesn't need to publish every version of ruby. If the build came from ruby/ruby (or wherever ruby is built), a single version * n platforms could be with each release
  • We would need arm versions as well.
  • We happily take contributions via PRs from forks but you have to be in github / actions to be a contrib / maintainer / admin on a repo in the actions org so that's a pretty hard stop. If an external org took over maintaining it (e.g. setup-ruby goes to ruby/ruby) it would move.

@chrispat
Copy link
Member

chrispat commented Jan 7, 2020

I have no fundamental issue with having the setup tasks handle more environment configuration which can include downloading a runtime.

One of the issues we have seen in the past with Ruby and Python is making sure things work consistently across Windows, MacOS and Linux with respect to the abiliy to download the runtime on demand. Becuase of this we have chosen to cache Ruby and Python only and not offer the download model. However, if there is a way to maintain the action in such a way that it can download Ruby on the fly for Windows, MacOS and Linux I am happy to discuss taking that PR.

As far as caching is concerned, we do have plans to break the cache module out as part of the toolkit so it can be used by other actions to add that feature automatically.

@bryanmacfarlane bryanmacfarlane changed the title Actions & Ruby Downloading pre-built Ruby Versions at runtime Jan 7, 2020
@bryanmacfarlane bryanmacfarlane added the enhancement New feature or request label Jan 7, 2020
@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 8, 2020

@ethomson

Thank you. Can't speak for the others, but I'm very happy with what you've said.

Throughout the beta until now, discussions have sometimes gotten messy, as many actions are so tightly bound to the images that there wasn't one place to discuss changes. Moving to an 'install on demand' model works for Ruby, as the binaries/builds are small enough to not hinder 'quick' CI.

I hate to ask, but what about MSYS2? It's needed to build Windows Ruby itself, and it's needed to build Windows extension gems. At present, users and some custom actions link to one of the embedded installs, but that's rather brittle. As mentioned, a current 'base', 'base-devel', and gcc would allow people to avoid the time it takes for a fresh install, and updating with specific packages doesn't take too long. Ruby builds generally use the current packages/tools.

@ethomson
Copy link
Contributor

ethomson commented Jan 8, 2020

I hate to ask, but what about MSYS2?

If this were a perfect world, what would you want us to do here? msys2 preinstalled on the Windows runners in a predictable place? c:\msys2 or something along those lines?

My fuzzy recollection -- and it's been a long time since I've wrangled with native extensions on Windows -- is that there's a particular msys2 version that we should be using? Is that right, or could we just get away with always installing the latest version of msys2? If not, does every Ruby version use a different msys2 version?

@MSP-Greg
Copy link
Contributor Author

MSP-Greg commented Jan 8, 2020

If this were a perfect world, what would you want us to do here?

Stock. Lots of it. Sorry.

As to location, C:\msys64 would be the preferred location. Windows Ruby builds 2.4 and later have OS specific code to search for it there.

it's been a long time since I've wrangled with native extensions on Windows

Depends on how log ago. Ruby 2.3 and earlier used a custom package of MSYS. Ruby 2.4 and later are using standard MSYS2. A big difference.

is that there's a particular msys2 version

I've got a custom action that updates all the MSYS2 gcc tools, and everything is building & testing fine. So, preference would be the current MSYS2 with base, base-devel, and mingw-w64-x86_64-gcc installed, then pacman -Syuu until it's updated. The MSYS2 'installer' or archive is not updated often, so one does need to update the packages.

As to anything beyond that, libyaml, zlib, openssl, ragel, etc. But, anything much more than a base install might encourage a lot of requests for package X, Y or Z.

A list of what packages AppVeyor has installed is here:
https://ci.appveyor.com/project/MSP-Greg/appveyor-ruby

I'm going to stay away from whether 32 bit should be installed. I rarely use it or test on it, but there are some people using 32 bit Ruby with old legacy apps/dlls/ActiveX/OLE things on Windows.

ping @larskanis

@eregon
Copy link
Contributor

eregon commented Jan 10, 2020

@ethomson wrote:

Yeah, I'm happy with eregon/ruby-install-builder basically becoming actions/setup-ruby v2.

That's great to hear.

Would this repo both builds Rubies and sets them up then?
I think in that case we'll probably end up with a separate branch to trigger new builds (or some other mechanism to trigger building new Rubies manually).

https://github.com/eregon/ruby-install-builder/releases now has all Ruby 2.3.0-2.7.0 releases, it was fairly easy to add them incrementally to a single GitHub release.

@MSP-Greg
Copy link
Contributor Author

It would complicate things, but we could leave actions/setup-ruby to specifically use only versions in the toolcache, and make eregon/ruby-install-builder something like actions/install-ruby. Just a thought, no strong opinion...

@eregon
Copy link
Contributor

eregon commented Jan 10, 2020

@MSP-Greg I think that would cause confusion. My comment was about building Rubies and using them is a fairly different thing, which is why I split it in two repositories for my account. But it doesn't matter, we can use the same repository and two different branches or so.

@ioquatix
Copy link

It would be awesome if I could also point it at a specific ruby version, e.g git@github.com:ioquatix/ruby#thread-selector. Is that going to be possible?

@eregon
Copy link
Contributor

eregon commented Jan 11, 2020

It would be awesome if I could also point it at a specific ruby version, e.g git@github.com:ioquatix/ruby#thread-selector. Is that going to be possible?

That's a different feature (and it would be hard to make it efficient). The pre-built Rubies mentioned here are only for official releases.
Just like I think a builder for -head versions doesn't belong here.

If you really want this, I think that repo should build its own releases and then maybe we could have a way to specify a release URL to download some Ruby from.

@eregon
Copy link
Contributor

eregon commented Jan 11, 2020

I created #48 with the details about how to get started building Rubies in this repository (since I guess not everyone here wants all the details).

@bryanmacfarlane
Copy link
Member

#49

eregon added a commit to eregon/actions-setup-ruby that referenced this issue Jan 17, 2020
* Many Ruby users will want something like that as the issues on this
  repository make very clear.
* Once actions#44 is addressed
  we can remove this.
@MSP-Greg
Copy link
Contributor Author

@ethomson

Re MSYS2, see actions/runner-images#30. I posted a sample script for installation of MSYS2. Installs it, updates it, installs a few compile tools, both 64 & 32. Installs to what I believe is considered the normal location, C:/msys64.

It doesn't have all the packages needed to build Ruby, but it has all the compile tools. I can certainly add more packages, I'm not sure what opinion is on that.

Having a more complete version in one location would be much better than the current out-of-date versions in four locations.

Off-topic, but Ruby compiles fine with VS2019. Obviously there's issues with things like fork, signals, UNIXSockets, some thread issues, etc. A few years ago, ruby/ruby ran almost no tests on Windows builds. Now, MinGW & mswin (msvc) are fully tested.

Thanks, Greg

eregon added a commit to eregon/actions-setup-ruby that referenced this issue Jan 18, 2020
* Many Ruby users will want something like that as the issues on this
  repository make very clear.
* Once actions#44 is addressed
  we can remove this.
eregon added a commit to eregon/actions-setup-ruby that referenced this issue Jan 23, 2020
* Many Ruby users will want something like that as the issues on this
  repository make very clear.
* Once actions#44 is addressed
  we can remove this.
@bryanmacfarlane
Copy link
Member

I updated the ADR with another option that would work across self hosted and other platforms and only cost more on first cache miss. It also aligns with the only officially supported distribution mechanism

#49

@eregon
Copy link
Contributor

eregon commented Feb 6, 2020

FYI, eregon/use-ruby-action has moved to ruby/setup-ruby.
Which means that action (and the builder repos) are now part of the official Ruby organization on GitHub.

@bryanmacfarlane
Copy link
Member

@eregon - Awesome. I'll follow up soon

@bryanmacfarlane
Copy link
Member

@ethomson and @chrispat

@MSP-Greg
Copy link
Contributor Author

I'm ok to close...

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

7 participants