Support for EC2 Spot Best practices - Diversification of instances using ABS #1400

ruecarlo · 2021-11-09T15:32:20Z

Currently the lambda in charge of scaling up nodes (including Spot configurations), uses the RunInstance API to create a Spot instance. Spot instances as spare capacity might be limited. Best approach when using Spot instance is to diversify across a set of instances that qualify for the workload and use one of the API that allows for that diversification.

The suggestion here is to change the RunInstance call and instead use the Drop In replacement API for EC2 Fleet in instant and the Spot Capacity-Optimized allocation strategy. EC2 Fleet allows diversification and still provides a synchronous API that adhere to Spot best practices providing the spot instance types that are selected to minimise the frequency of interruptions for the workload. More examples here.

Another thing for consideration in the implementation is to use the newly released attribute based instance selection

The text was updated successfully, but these errors were encountered:

npalm · 2021-12-11T12:46:53Z

I did a short experiment, see gist. It creates the instances but caues 2 issues

it also tries to launch instances in the default vpc, which fails since the required security group is not available in the default VPC. Not figured out, how to avoid possible creation. The result of the createFleet contains 1 instance in a valid subnet. But also the error mentioned before.
Creating the SSM properties does not work, did not investigate at all yet

npalm · 2021-12-11T12:50:11Z

With replacing the runInstance API call by createFleet also the dynamic launch templates can be removed. Finding a right spot instances (or on demand) should be moved to this API call.

terraform-aws-github-runner/modules/runners/main.tf

Line 57 in dbba705

count = length(local.instance_types)

npalm · 2021-12-11T12:52:55Z

Currently we specify in the launch template via the option market_type whenever we create a spot or on-demand. Maybe we should move this logic also to the API call for creating a fleet.

npalm · 2021-12-21T20:25:22Z

Have a local working POC ready, will implement asap. This will replace the loop for creating instances. And add a fall back to on-demand instances

npalm added good first issue Good for newcomers help wanted Extra attention is needed labels Nov 9, 2021

npalm mentioned this issue Dec 2, 2021

Quesiton: Runner based on label #73

Closed

ScottGuymer self-assigned this Dec 10, 2021

npalm assigned npalm and unassigned ScottGuymer Dec 21, 2021

npalm mentioned this issue Dec 23, 2021

feat: Replace run instance API by create fleet API #1556

Merged

npalm linked a pull request Dec 23, 2021 that will close this issue

feat: Replace run instance API by create fleet API #1556

Merged

npalm mentioned this issue Dec 31, 2021

Feat/support fleet api #1576

Closed

npalm closed this as completed in #1556 Jan 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for EC2 Spot Best practices - Diversification of instances using ABS #1400

Support for EC2 Spot Best practices - Diversification of instances using ABS #1400

ruecarlo commented Nov 9, 2021

npalm commented Dec 11, 2021

npalm commented Dec 11, 2021

npalm commented Dec 11, 2021

npalm commented Dec 21, 2021

Support for EC2 Spot Best practices - Diversification of instances using ABS #1400

Support for EC2 Spot Best practices - Diversification of instances using ABS #1400

Comments

ruecarlo commented Nov 9, 2021

npalm commented Dec 11, 2021

npalm commented Dec 11, 2021

npalm commented Dec 11, 2021

npalm commented Dec 21, 2021