Cache ListRecords result #178

ideahitme · 2017-04-27T13:09:08Z

Currently each iteration of the synchronisation loop requires external-dns to fetch the list of all records from DNS Provider, which can be in general case avoided by caching records in memory. We can define a cache with lease period, which will be updated in two scenarios:

Lease period expires
create records fails due to the name overlapping.

In general case it should greatly help with reducing the API rates, especially in cases where create API rarely fails (never happens in case if DNS provider is used solely by single instance of external-dns). In case of stable and moderately active cluster (with not so many ingress/service being created/modified) external-dns will be able to reduce its interaction with DNS provider to bare minimum (on average to one request per lease period)

This behaviour can be pluggable via cmd line flag e.g. --enable-cache

The text was updated successfully, but these errors were encountered:

ideahitme · 2017-04-27T13:11:38Z

/cc @linki @justinsb @iterion @hjacobs thoughts? I think this can be potentially useful in the view of AWS rate limiting issues

linki · 2017-04-27T16:59:26Z

In general 👍 But how does this differ from setting the --interval to a higher value?

ideahitme · 2017-04-27T18:26:03Z

Setting --interval to higher value would mean user having to wait longer for the record to be created. With cache enabled we will "try our luck" by creating (not upsert) and if it fails update the cache. It would mean with lease period of 1 hour and interval of 1 min - each minute we would only need to send ChangeRecords request to DNS Provider if something has changed according to the cache (without actually polling for records first) - so comparing to current implementation within one hour we could make as less as 1/60 of the requests.

linki · 2017-04-27T20:04:46Z

sgtm 👍

hjacobs · 2017-04-27T20:15:44Z

@ideahitme what would you use as default TTL for the cache? I think 1 hour is far too long as it would mean that "manually" changing/deleting records would only be "restored" after one hour: IMHO the system should strive for correctness, i.e. Kubernetes state should reflect real DNS state. I guess something in the range of minutes is good enough, e.g. we could reduce the default interval to 30s and have a cache TTL of 300s (5 minutes):

the reduced default interval makes sure that any changes in Kubernetes (mostly triggered by users) are quickly synced to DNS ➡️ users get quick feedback
the 5 min cache makes sure that we stay within (AWS) API rate limits

ideahitme · 2017-04-28T12:18:24Z

@hjacobs yes, 1 hour is just an example :D but even setting a TTL of 5min would give a huge win - minimising number of potential clashes (after manual changes) and significantly reducing number of AWS API requests

fejta-bot · 2019-04-21T05:25:33Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-05-21T06:08:38Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-06-20T06:58:42Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-06-20T06:58:49Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

raravena80 · 2019-09-30T23:50:22Z

/reopen

raxod502-plaid · 2022-04-18T19:57:00Z

The feature was never implemented, so it was inappropriate for this issue to be closed. ExternalDNS still badly needs a cache; with a large number of records in a hosted zone, it will max out Route 53 rate limits every time the sync loop runs.

tehlers320 · 2022-05-09T16:28:40Z

/reopen

does that work @raxod502-plaid ?

k8s-ci-robot · 2022-05-09T16:28:50Z

@tehlers320: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

does that work @raxod502-plaid ?

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

…-sigs` (kubernetes-sigs#178) * Change import paths from GoogleContainerTools -> kubernetes-sigs * Replace all remaining occurances GoogleContainerTools -> kubernetes-sigs With this commit the krew-index is also switched to kubernetes-sigs

raxod502-plaid · 2022-05-16T14:16:00Z

does that work @raxod502-plaid ?

Sorry, would you mind clarifying what you're asking?

ideahitme added the kind/feature Categorizes issue or PR as related to a new feature. label Apr 27, 2017

ideahitme mentioned this issue Mar 8, 2018

Why does external-dns poll? Polling causes too many API requests #484

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 21, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 21, 2019

k8s-ci-robot closed this as completed Jun 20, 2019

raxod502-plaid mentioned this issue Apr 18, 2022

Rate Limiting #2135

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache ListRecords result #178

Cache ListRecords result #178

ideahitme commented Apr 27, 2017 •

edited

Loading

ideahitme commented Apr 27, 2017

linki commented Apr 27, 2017

ideahitme commented Apr 27, 2017

linki commented Apr 27, 2017

hjacobs commented Apr 27, 2017 •

edited

Loading

ideahitme commented Apr 28, 2017

fejta-bot commented Apr 21, 2019

fejta-bot commented May 21, 2019

fejta-bot commented Jun 20, 2019

k8s-ci-robot commented Jun 20, 2019

raravena80 commented Sep 30, 2019 •

edited

Loading

raxod502-plaid commented Apr 18, 2022

tehlers320 commented May 9, 2022

k8s-ci-robot commented May 9, 2022

raxod502-plaid commented May 16, 2022

Cache ListRecords result #178

Cache ListRecords result #178

Comments

ideahitme commented Apr 27, 2017 • edited Loading

ideahitme commented Apr 27, 2017

linki commented Apr 27, 2017

ideahitme commented Apr 27, 2017

linki commented Apr 27, 2017

hjacobs commented Apr 27, 2017 • edited Loading

ideahitme commented Apr 28, 2017

fejta-bot commented Apr 21, 2019

fejta-bot commented May 21, 2019

fejta-bot commented Jun 20, 2019

k8s-ci-robot commented Jun 20, 2019

raravena80 commented Sep 30, 2019 • edited Loading

raxod502-plaid commented Apr 18, 2022

tehlers320 commented May 9, 2022

k8s-ci-robot commented May 9, 2022

raxod502-plaid commented May 16, 2022

ideahitme commented Apr 27, 2017 •

edited

Loading

hjacobs commented Apr 27, 2017 •

edited

Loading

raravena80 commented Sep 30, 2019 •

edited

Loading