How different PodAutoScaler configure RPS in Autoscale #5975

andyyin · 2019-11-08T11:41:37Z

In what area(s)?

/area autoscale

switch pa.Metric() { case autoscaling.RPS: total = config.RPSTargetDefault tu = config.TargetUtilization default: // Concurrency is used by default total = float64(pa.Spec.ContainerConcurrency) // If containerConcurrency is 0 we'll always target the default. if total == 0 { total = config.ContainerConcurrencyTargetDefault } tu = config.ContainerConcurrencyTargetFraction }

Ask your question here:

Why RPS is handled differently than Concurrency. If so, different podAutoscaler, you can not set rps separately

The text was updated successfully, but these errors were encountered:

andyyin · 2019-11-08T11:42:43Z

@yanweiguo

markusthoemmes · 2019-11-08T12:17:31Z

RPS is not subject to the containerConcurrency setting so in this case, we don't handle anything related to that in the RPS branch.

The diff of ContainerConcurrencyTargetFraction vs. TargetUtilization is simple that we're going to deprecate the former for the latter eventually. Semantically they are the same.

Does that answer your questions?

yanweiguo · 2019-11-08T17:15:14Z

Yeah as @markusthoemmes said, TargetUtilization is going to replace ContainerConcurrencyTargetFraction for concurrency usage.

However I have questions: are we going to support multiple metrics for KPA? How do we support custom metrics autoscaling? We may have to change the knobs.

andyyin · 2019-11-09T01:00:39Z

RPS is not subject to the containerConcurrency setting so in this case, we don't handle anything related to that in the RPS branch.

The diff of ContainerConcurrencyTargetFraction vs. TargetUtilization is simple that we're going to deprecate the former for the latter eventually. Semantically they are the same.

Does that answer your questions?

No, maybe I didn't explain it. I mean the current value (total and tu) of RPS is from autoscaler.Config. This configuration is a global configuration. All Revisons are the same value. How do I set different values for different Revision settings?

andyyin · 2019-11-09T01:01:46Z

Yeah as @markusthoemmes said, TargetUtilization is going to replace ContainerConcurrencyTargetFraction for concurrency usage.

However I have questions: are we going to support multiple metrics for KPA? How do we support custom metrics autoscaling? We may have to change the knobs.

This is another question for me. In the actual production environment, multiple metrics for KPA will be more reasonable.

yanweiguo · 2019-11-09T02:09:27Z

RPS is not subject to the containerConcurrency setting so in this case, we don't handle anything related to that in the RPS branch.
The diff of ContainerConcurrencyTargetFraction vs. TargetUtilization is simple that we're going to deprecate the former for the latter eventually. Semantically they are the same.
Does that answer your questions?

No, maybe I didn't explain it. I mean the current value (total and tu) of RPS is from autoscaler.Config. This configuration is a global configuration. All Revisons are the same value. How do I set different values for different Revision settings?

You can use the following annotations on revision to override:

autoscaling.knative.dev/metric: rps 
autoscaling.knative.dev/target: 200 
autoscaling.knative.dev/targetUtilizationPercentage: 70

The override happens here:

serving/pkg/reconciler/autoscaling/resources/target.go

Line 58 in 7b6e76d

target = math.Max(1, math.Min(target, annotationTarget*tu))

andyyin · 2019-11-09T03:09:03Z

RPS is not subject to the containerConcurrency setting so in this case, we don't handle anything related to that in the RPS branch.
The diff of ContainerConcurrencyTargetFraction vs. TargetUtilization is simple that we're going to deprecate the former for the latter eventually. Semantically they are the same.
Does that answer your questions?

No, maybe I didn't explain it. I mean the current value (total and tu) of RPS is from autoscaler.Config. This configuration is a global configuration. All Revisons are the same value. How do I set different values for different Revision settings?

You can use the following annotations on revision to override:
autoscaling.knative.dev/metric: rps 
autoscaling.knative.dev/target: 200 
autoscaling.knative.dev/targetUtilizationPercentage: 70 
The override happens here:

serving/pkg/reconciler/autoscaling/resources/target.go

Line 58 in 7b6e76d

target = math.Max(1, math.Min(target, annotationTarget*tu))

I understand this, but math.Min(target, annotationTarget*tu) has a limit, meaning that the final value may not be annotationTarget*tu, but config.RPSTargetDefault*config.TargetUtilization. This means that config.RPSTargetDefault and config.TargetUtilization are difficult to configure a suitable value. I think RPSTargetDefault is more suitable in pa.Spec, similar to pa.Spec.ContainerConcurrency.

andyyin · 2019-11-11T08:38:23Z

@yanweiguo

markusthoemmes · 2019-11-11T10:00:43Z

@andyyin you are right indeed! This is a bug! See #5991 for a potential fix.

andyyin added the kind/question Further information is requested label Nov 8, 2019

markusthoemmes mentioned this issue Nov 11, 2019

Make sure target annotations can exceed the configured default values. #5991

Merged

knative-prow-robot closed this as completed in #5991 Nov 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How different PodAutoScaler configure RPS in Autoscale #5975

How different PodAutoScaler configure RPS in Autoscale #5975

andyyin commented Nov 8, 2019

andyyin commented Nov 8, 2019

markusthoemmes commented Nov 8, 2019

yanweiguo commented Nov 8, 2019

andyyin commented Nov 9, 2019

andyyin commented Nov 9, 2019 •

edited

Loading

yanweiguo commented Nov 9, 2019

andyyin commented Nov 9, 2019 •

edited

Loading

andyyin commented Nov 11, 2019

markusthoemmes commented Nov 11, 2019

How different PodAutoScaler configure RPS in Autoscale #5975

How different PodAutoScaler configure RPS in Autoscale #5975

Comments

andyyin commented Nov 8, 2019

In what area(s)?

Ask your question here:

andyyin commented Nov 8, 2019

markusthoemmes commented Nov 8, 2019

yanweiguo commented Nov 8, 2019

andyyin commented Nov 9, 2019

andyyin commented Nov 9, 2019 • edited Loading

yanweiguo commented Nov 9, 2019

andyyin commented Nov 9, 2019 • edited Loading

andyyin commented Nov 11, 2019

markusthoemmes commented Nov 11, 2019

andyyin commented Nov 9, 2019 •

edited

Loading

andyyin commented Nov 9, 2019 •

edited

Loading