Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12567

ashking94 · 2024-03-08T10:24:37Z

Is your feature request related to a problem? Please describe

With remote store feature, we upload 2 kinds of data to remote store - data and metadata against both translog and metadata. We have #5854 for allowing buffering of requests before uploading it after every 650ms (default value). This works well in steady state. However, I have faced issue where I am running performance test with single index and higher number of shards.

The current path structure looks like this ->

!__Base path
    !__Index path - <Base_path>/<index_uuid>/
        !__Shard path - <Index_path>/<shard_id>
            !__Segment path - <Shard_path>/segments
                !__files path - <Segment_path>/data
                    |__segments_<N>__<file-gen>
                    |__<N>.si__<file-gen>
                    |__<N>.cfe__<file-gen>
                    |__<N>.cfs__<file-gen>
                !__metadata path - <Segment_path>/metadata
                    |__metadata_path_<file-gen>_version
                !__lock path - <Segment_path>/lock_files
                    !__metadata_path_<file-gen>_version.v2_lock

            !__Translog path - <Shard_path>/translog
                !__data path - <Translog_path>/data
                    |__primary-term
                        |__translog-<gen>.tlog
                        |__translog-<gen>.ckp
                !__metadata path - <Translog_path>/metadata
                    |__metadata_prefix_gen_version

If we notice, the physical layout and logical layout of data is same. This structure allows some limits on number of GETs, PUTs, DELETEs, LISTs. However, the limits becomes bottleneck when there are too many shards for an index.

Describe the solution you'd like

A prefix pattern that is accepted by multiple repository providers like AWS S3, GCP storage. The general recommendation by the providers is to maximise the spread of data across as many prefixes as possible. This allows them to scale better.

So, the proposed prefix pattern is ->

hash(data,index_uuid,sharid,translog|segment)/<Base-path>/<index-uuid>/<shardid>  → data
hash(md,index_uuid,sharid,translog|segment)<Base-path>/<index-uuid>/<shardid>  → metadata

With above prefix pattern, we ensure that the prefixes are as random but predictable. For the combination of translog-data, translog-metadata, segment-data, segment-metadata, the path would be fixed and will remain same throughout it's life.

We can also see this referred by multiple cloud providers below -

GCP Storage - https://cloud.google.com/storage/docs/request-rate#ramp-up
AWS S3 - https://repost.aws/knowledge-center/http-5xx-errors-s3

Related component

Storage:Performance

Describe alternatives you've considered

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

ashking94 · 2024-05-02T15:05:49Z

Marking this closed since the feature has been successfully completed.

ashking94 added enhancement Enhancement or improvement to existing feature or request untriaged labels Mar 8, 2024

ashking94 self-assigned this Mar 8, 2024

github-actions bot added the Storage:Performance label Mar 8, 2024

ashking94 removed the untriaged label Mar 8, 2024

ashking94 changed the title ~~[Feature Request] Optimised prefix pattern per shard for remote store data and metadata files for higher throughput~~ Optimised prefix pattern per shard for remote store data and metadata files for higher throughput Mar 8, 2024

This was referenced Mar 11, 2024

[META] Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12589

Closed

[Remote Store] Introduce remote store path type in customData in IndexMetadata #12608

Closed

ashking94 added the v2.14.0 label Mar 12, 2024

This was referenced Mar 14, 2024

[Remote Store] During index creation, upload path for tlog-data, tlog-md, segment-data and segment-md to fixed base path #12661

Closed

[Remote Store] Use cluster default path type during snapshot restore #12730

Closed

ashking94 added the Storage:Resiliency Issues and PRs related to the storage resiliency label Mar 18, 2024

This was referenced Apr 11, 2024

Upload remote paths during index creation or full cluster upload #13150

Merged

Add faster scaling composite hash value encoding for remote path #13155

Merged

Add faster scaling composite hash value encoding for remote path #13251

Merged

ashking94 mentioned this issue Apr 19, 2024

[Feature Request] Upload remote index path for migrating indexes from docrep to remote #13302

Closed

ashking94 closed this as completed May 2, 2024

Bukhtawar mentioned this issue Jun 27, 2024

Add Ashish Singh as maintainer #14567

Merged

ashking94 mentioned this issue Jul 12, 2024

[Remote Store] Node bootstrap fails during repository verification on account of remote store throttling #14741

Closed

ashking94 mentioned this issue Aug 7, 2024

[RFC] Optimized Prefix Pattern for Shard-Level Files for Efficient Snapshots #15146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12567

Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12567

ashking94 commented Mar 8, 2024

ashking94 commented May 2, 2024

Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12567

Optimised prefix pattern per shard for remote store data and metadata files for higher throughput #12567

Comments

ashking94 commented Mar 8, 2024

Is your feature request related to a problem? Please describe

Describe the solution you'd like

Related component

Describe alternatives you've considered

Additional context

ashking94 commented May 2, 2024