[CCR] Auto follow patterns #33007

martijnvg · 2018-08-21T03:07:07Z

Tasks

Description

Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices.

The auto follow patterns are managed via a put auto follow api:

PUT /_ccr/_autofollow/{{remote_cluster}}
{
   "leader_index_pattern": ["logs-*"], 
   "follow_index_pattern": "{{leader_index}}-copy",
   "max_concurrent_read_batches": 2
   ... // other optional parameters
}

The follow index name used defaults the the leader index name. In certain cases (e.g. follow an index in the same cluster) this is unwanted and the follow_index_pattern parameter can be used to pick a different name.

This api will also support other parameters (max_concurrent_read_batches etc.) that the create_and_follow api supports. These parameters will be used instead of the defaults when the auto follow feature is invoking the create_and_follow api.

and delete auto follow api:

DELETE /_ccr/_autofollow/{{remote_cluster_alias}}

The auto follow patterns are stored as custom metadata in the cluster state.

The follow cluster should have a component that periodically checks the cluster states of multiple leader clusters (depends on the number of remote cluster aliases being followed) whether new indices have been created that match with patterns specified in the put autofollow api. If that is the case then this component invokes the create_and_follow api for each matching new index. The follow cluster will use the cluster state api to fetch cluster states from leader clusters. How often this component checks remote clusters for newly created indices dependents on the a poll interval setting (‘ccr.auto_follow.poll_interval’).

The component needs to keep track for what indices it already invoked the create_and_follow api for. The UUID of these indices should also be saved in the auto follow custom metadata. If a new a new pattern is added then the component should not follow existing indices matching this pattern, only indices created after this pattern was added to the auto_follow_patterns setting. This is achieved by including the index UUID of already created indices to the autofollow custom metadata (without actually following these indices). The component also need to keep track of indices in leader cluster that were auto followed and then removed. These index uuids need to be pruned in the custom index metadata.

The component can be implemented by a simple task that runs on the elected master node. In the background it schedules a task (ThreadPool#schedule(...)) that checks whether new leader indices need to be followed in remote clusters.

Relates to #30086 -

The text was updated successfully, but these errors were encountered:

elasticmachine · 2018-08-21T03:07:08Z

Pinging @elastic/es-distributed

martijnvg · 2018-08-21T14:41:46Z

I've updated the description of this issue to remove the fact that auto follow patterns are stored as dynamic cluster settings and use dedicated apis to manage auto follow patterns instead.

When auto follow patters are updated then existing leader indices matching with these patterns need to marked as followed (it is expected that only newly created indices should be followed automatically). In order to do this remote calls need to be made and there is no opportunity to do this in when a settings update consumer gets executed. So it is better to manage auto follow patters via dedicated apis.

I hoped that dynamic cluster settings would be enough, but this turned out to be not the case.

Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to elastic#33007

Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to #33007 Co-authored-by: Jason Tedor jason@tedor.me

Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to #33007 Co-authored-by: Jason Tedor <jason@tedor.me>

Relates to elastic#33007

The following stats are being kept track of: 1) The total number of times that auto following a leader index succeed. 2) The total number of times that auto following a leader index failed. 3) The total number of times that fetching a remote cluster state failed. 4) The most recent 256 auto follow failures per auto leader index (e.g. create_and_follow api call fails) or cluster alias (e.g. fetching remote cluster state fails). Each auto follow run now produces a result that is being used to update the stats being kept track of in AutoFollowCoordinator. Relates to elastic#33007

Relates to #33007

…cs (#33684) The following stats are being kept track of: 1) The total number of times that auto following a leader index succeed. 2) The total number of times that auto following a leader index failed. 3) The total number of times that fetching a remote cluster state failed. 4) The most recent 256 auto follow failures per auto leader index (e.g. create_and_follow api call fails) or cluster alias (e.g. fetching remote cluster state fails). Each auto follow run now produces a result that is being used to update the stats being kept track of in AutoFollowCoordinator. Relates to #33007

GET /_ccr/auto_follow/stats Returns: ``` { "number_of_successful_follow_indices": ... "number_of_failed_follow_indices": ... "number_of_failed_remote_cluster_state_requests": ... "recent_auto_follow_errors": [ ... ] } ``` Relates to elastic#33007

Relates to elastic#33007

GET /_ccr/auto_follow/stats Returns: ``` { "number_of_successful_follow_indices": ... "number_of_failed_follow_indices": ... "number_of_failed_remote_cluster_state_requests": ... "recent_auto_follow_errors": [ ... ] } ``` Relates to #33007

Relates to #33007

GET /_ccr/auto_follow/stats Returns: ``` { "number_of_successful_follow_indices": ... "number_of_failed_follow_indices": ... "number_of_failed_remote_cluster_state_requests": ... "recent_auto_follow_errors": [ ... ] } ``` Relates to #33007

…te cluster and replaced poll interval setting with a hardcoded poll interval. The hard coded interval will be removed in a follow up change to make use of cluster state API's wait_for_metatdata_version. Originates from elastic#35895 Relates to elastic#33007

…ster (#36031) and replaced poll interval setting with a hardcoded poll interval. The hard coded interval will be removed in a follow up change to make use of cluster state API's wait_for_metatdata_version. Before the auto following was bootstrapped from thread pool scheduler, but now auto followers for new remote clusters are bootstrapped when a new cluster state is published. Originates from #35895 Relates to #33007

Changed AutofollowCoordinator makes use of the wait_for_metadata_version feature in cluster state API and removed hard coded poll interval. Originates from elastic#35895 Relates to elastic#33007

The auto follow coordinator keeps track of the UUIDs of indices that it has followed. The index UUID strings need to be cleaned up in the case that these indices are removed in the remote cluster. Relates to elastic#33007

The auto follow coordinator keeps track of the UUIDs of indices that it has followed. The index UUID strings need to be cleaned up in the case that these indices are removed in the remote cluster. Relates to #33007

…36264) Changed AutofollowCoordinator makes use of the wait_for_metadata_version feature in cluster state API and removed hard coded poll interval. Originates from #35895 Relates to #33007

The AutoFollowCoordinator should be resilient to the fact that the follower index has already been created and in that case it should only update the auto follow metadata with the fact that the follower index was created. Relates to elastic#33007

…36264) Changed AutofollowCoordinator makes use of the wait_for_metadata_version feature in cluster state API and removed hard coded poll interval. Originates from #35895 Relates to #33007

For each remote cluster the auto follow coordinator, starts an auto follower that checks the remote cluster state and determines whether an index needs to be auto followed. The time since last auto follow is reported per remote cluster and gives insight whether the auto follow process is alive. Relates to elastic#33007 Originates from elastic#35895

) For each remote cluster the auto follow coordinator, starts an auto follower that checks the remote cluster state and determines whether an index needs to be auto followed. The time since last auto follow is reported per remote cluster and gives insight whether the auto follow process is alive. Relates to #33007 Originates from #35895

…with soft deletes disabled Currently if a leader index with soft deletes disabled is auto followed then this index is silently ignored. This commit changes this behaviour to mark these indices as auto followed and report an error, which is visible in auto follow stats. Marking the index as auto follow is important, because otherwise the auto follower will continuously try to auto follow and fail. Relates to elastic#33007

…with soft deletes disabled (#36886) Currently if a leader index with soft deletes disabled is auto followed then this index is silently ignored. This commit changes this behavior to mark these indices as auto followed and report an error, which is visible in auto follow stats. Marking the index as auto follow is important, because otherwise the auto follower will continuously try to auto follow and fail. Relates to #33007

The AutoFollowCoordinator should be resilient to the fact that the follower index has already been created and in that case it should only update the auto follow metadata with the fact that the follower index was created. Relates to #33007

martijnvg · 2018-12-24T09:32:58Z

All tasks have been implemented 🎉

martijnvg added Meta :Distributed Indexing/CCR Issues around the Cross Cluster State Replication features labels Aug 21, 2018

martijnvg self-assigned this Aug 21, 2018

martijnvg mentioned this issue Aug 24, 2018

[CCR] Added auto follow patterns feature #33118

Merged

elasticmachine mentioned this issue Sep 6, 2018

Introduce cross-cluster replication #30086

Closed

29 tasks

jasontedor mentioned this issue Sep 7, 2018

Add license checks for auto-follow implementation #33496

Merged

martijnvg added a commit to martijnvg/elasticsearch that referenced this issue Sep 7, 2018

[CCR] Make auto follow patterns work with security

e1a0438

Relates to elastic#33007

martijnvg mentioned this issue Sep 7, 2018

[CCR] Make auto follow patterns work with security #33501

Merged

martijnvg mentioned this issue Sep 13, 2018

[CCR] Changed AutoFollowCoordinator to keep track of certain statistics #33684

Merged

martijnvg added a commit that referenced this issue Sep 17, 2018

[CCR] Make auto follow patterns work with security (#33501)

481f8a9

Relates to #33007

martijnvg added a commit that referenced this issue Sep 17, 2018

[CCR] Make auto follow patterns work with security (#33501)

b3962c1

Relates to #33007

martijnvg mentioned this issue Sep 18, 2018

[CCR] Add auto follow stats api #33801

Merged

martijnvg added a commit to martijnvg/elasticsearch that referenced this issue Sep 19, 2018

[CCR] Add get auto follow pattern api

94b5c02

Relates to elastic#33007

martijnvg mentioned this issue Sep 19, 2018

[CCR] Add get auto follow pattern api #33849

Merged

martijnvg added a commit that referenced this issue Sep 24, 2018

[CCR] Add get auto follow pattern api (#33849)

2795ef5

Relates to #33007

martijnvg added a commit that referenced this issue Sep 24, 2018

[CCR] Add get auto follow pattern api (#33849)

c6ea38f

Relates to #33007

martijnvg mentioned this issue Nov 26, 2018

[CCR] Refactore auto follow coordinator #35895

Closed

martijnvg mentioned this issue Nov 29, 2018

Refactor AutoFollowCoordinator to track leader indices per remote cluster #36031

Merged

martijnvg mentioned this issue Dec 5, 2018

[CCR] Change AutofollowCoordinator to use wait_for_metadata_version #36264

Merged

martijnvg mentioned this issue Dec 9, 2018

[CCR] Clean followed leader index UUIDs in auto follow metadata #36408

Merged

martijnvg mentioned this issue Dec 12, 2018

[CCR] AutoFollowCoordinator and follower index already created #36540

Merged

martijnvg mentioned this issue Dec 12, 2018

[CCR] Add time since last auto follow fetch to auto follow stats #36542

Merged

martijnvg mentioned this issue Dec 20, 2018

[CCR] Report error if auto follower tries auto follow a leader index with soft deletes disabled #36886

Merged

martijnvg closed this as completed Dec 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CCR] Auto follow patterns #33007

[CCR] Auto follow patterns #33007

martijnvg commented Aug 21, 2018 •

edited

Loading

elasticmachine commented Aug 21, 2018

martijnvg commented Aug 21, 2018

martijnvg commented Dec 24, 2018

[CCR] Auto follow patterns #33007

[CCR] Auto follow patterns #33007

Comments

martijnvg commented Aug 21, 2018 • edited Loading

Tasks

Description

elasticmachine commented Aug 21, 2018

martijnvg commented Aug 21, 2018

martijnvg commented Dec 24, 2018

martijnvg commented Aug 21, 2018 •

edited

Loading