Economy of scale #183

yaron2 · 2021-06-16T16:42:20Z

From what I can see, for every new deployment that needs to be scaled down to zero, there are two additional deployments that occur:

Interceptor
External scaler

As these deployments are not scaled down to zero and take resources from the cluster, the benefits of scale to zero are greatly reduced at best (if the user deployment takes equal or more resources than the interceptor and external scaler combined) or completely negated at worst (if the user deployment takes less resources than the interceptor and external scaler combined).

In addition, when the app is not scaled to zero, the interceptor and external scaler when deployed for every app will take resources from the cluster that could otherwise be used to schedule new deployments, and are sitting idle unless scale to zero occurs.

These insights rely on the design doc and my understanding of the code after a brief overview, so excuse misunderstandings.

/cc @tomkerkhove

arschles · 2021-06-16T17:17:52Z

@yaron2 are you interested in making a change? It would be certainly possible to make the interceptor and scaler multi-tenant if you're interested in that.

yaron2 · 2021-06-16T17:21:56Z

I am certainly interested in that, but I'm not sure it's "me" being interested in that as much as its about every user should be interested in that, because without a multi-tenant interceptor and external scaler I don't see any real benefit to running the HTTP add-on, again, unless I am missing something obvious?

arschles · 2021-06-16T17:36:07Z

Autoscaling your HTTP workloads would be one major benefit that persists even though you can't (currently) scale the interceptor or external scaler to zero. Regardless, yes, it would certainly be more resource efficient to run multi-tenant interceptors and scalers.

yaron2 · 2021-06-16T17:39:29Z

Autoscaling your HTTP workloads would be one major benefit that persists even though you can't (currently) scale the interceptor or external scaler to zero. Regardless, yes, it would certainly be more resource efficient to run multi-tenant interceptors and scalers.

Yeah I'm talking specifically about the scale to zero part. I didn't see 1:N autoscaling so far, but that can be achieved with the Prometheus scaler today, albeit in a more cumbersome way.

arschles · 2021-06-16T17:42:05Z

got it. yea, if you're looking to truly scale resources to zero on your cluster, you couldn't use the HTTP addon, yea. Would you be interested in helping design and/or build these multi-tenant components?

yaron2 · 2021-06-16T19:55:23Z

got it. yea, if you're looking to truly scale resources to zero on your cluster, you couldn't use the HTTP addon, yea. Would you be interested in helping design and/or build these multi-tenant components?

I could probably help with design, for sure.

tomkerkhove · 2021-06-17T05:34:15Z

I haven't thought about it from that angle and that definitely makes sense, we should go multi-tenant for sure.

Yeah I'm talking specifically about the scale to zero part. I didn't see 1:N autoscaling so far, but that can be achieved with the Prometheus scaler today, albeit in a more cumbersome way.

@yaron2 We want to support 1:N scaling as well because we don't want to force people to use Prometheus. For example, my customers always use Azure Monitor and should not have to run Prometheus because of that.

yaron2 · 2021-06-17T17:11:19Z

I haven't thought about it from that angle and that definitely makes sense, we should go multi-tenant for sure.

Yeah I'm talking specifically about the scale to zero part. I didn't see 1:N autoscaling so far, but that can be achieved with the Prometheus scaler today, albeit in a more cumbersome way.

@yaron2 We want to support 1:N scaling as well because we don't want to force people to use Prometheus. For example, my customers always use Azure Monitor and should not have to run Prometheus because of that.

Yeah I understand, I totally support doing 1:N scaling. we should just make sure this is done in a way that isn't too costly on cluster resources.

arschles · 2021-06-22T23:32:12Z

@yaron2 I've begun a draft design doc: https://hackmd.io/@arschles/mutitenant-keda-http-addon

there are a few places where it's a bit rough - the biggest one is pushing routing table updates to interceptors and verifying that they've updated their in-memory copy.

let me know what you think

khaosdoctor · 2021-07-29T13:08:33Z

@ajanth97 Raised a concern about the isActive methods in the external scaler interface that will probably be addressed in this too. I'm just waiting for an issue to be created so we can track that down

khaosdoctor · 2021-07-29T13:42:13Z

@yaron2 I've begun a draft design doc: hackmd.io/@arschles/mutitenant-keda-http-addon

there are a few places where it's a bit rough - the biggest one is pushing routing table updates to interceptors and verifying that they've updated their in-memory copy.

let me know what you think

@arschles added a few comments on the document too, see if they're helpful

arschles · 2021-07-30T21:25:22Z

Thanks @khaosdoctor !

arschles added the enhancement New feature or request label Jun 16, 2021

arschles mentioned this issue Jul 1, 2021

Multi-tenant interceptor and scaler #206

Merged

29 tasks

arschles mentioned this issue Aug 16, 2021

Have interceptors record what routing table version(s) they are aware of #225

Closed

tomkerkhove mentioned this issue Aug 25, 2021

Make interceptor Cluster bound instead of namespace bound #240

Closed

arschles closed this as completed in #206 Sep 3, 2021

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Economy of scale #183

Economy of scale #183

yaron2 commented Jun 16, 2021 •

edited

Loading

arschles commented Jun 16, 2021

yaron2 commented Jun 16, 2021

arschles commented Jun 16, 2021

yaron2 commented Jun 16, 2021

arschles commented Jun 16, 2021 •

edited

Loading

yaron2 commented Jun 16, 2021

tomkerkhove commented Jun 17, 2021 •

edited

Loading

yaron2 commented Jun 17, 2021

arschles commented Jun 22, 2021

khaosdoctor commented Jul 29, 2021

khaosdoctor commented Jul 29, 2021

arschles commented Jul 30, 2021

Economy of scale #183

Economy of scale #183

Comments

yaron2 commented Jun 16, 2021 • edited Loading

arschles commented Jun 16, 2021

yaron2 commented Jun 16, 2021

arschles commented Jun 16, 2021

yaron2 commented Jun 16, 2021

arschles commented Jun 16, 2021 • edited Loading

yaron2 commented Jun 16, 2021

tomkerkhove commented Jun 17, 2021 • edited Loading

yaron2 commented Jun 17, 2021

arschles commented Jun 22, 2021

khaosdoctor commented Jul 29, 2021

khaosdoctor commented Jul 29, 2021

arschles commented Jul 30, 2021

yaron2 commented Jun 16, 2021 •

edited

Loading

arschles commented Jun 16, 2021 •

edited

Loading

tomkerkhove commented Jun 17, 2021 •

edited

Loading