Mas i350 selectivesync #1794

martinsumner · 2021-09-14T09:43:02Z

Add selective sync to Riak KV. Prior to this change either all vnodes sync'd a PUT (i.e. flushed to disk as part of accepting the PUT) or no vnodes. With no sync the flush decision is controlled by either the database (bitcask, eleveldb) or the operating system (leveled).

This changes allows for three options:

backend - rely on backend configuration as now
all - force a sync on all vnodes regardless of backend configuration
one - force a sync only on the coordinating vnode

The sync option can be set on a per-bucket basis, and over-written on a per-write basis. This provides much more flexibility.

Previously customer who wanted to make sure data was flushed to disk in a least one place for some buckets, had to set backend sync settings to sync, and sync to all vnodes on every write on every bucket. Throughput between sync and non-sync configurations have been discovered to be large even with hardware acceleration (e.g. flash-backed write caches) - see martinsumner/leveled#350

backend (use backend's config value) one (sync co-ordinating write only, or backend config value if set to sync there) all (sync all writes irrespective of backend config value)

Fixed error in spec on riak_kv_wm_object:malformed_custom_param

Normalised sync_on_write as atom

…50-selectivesync

to xref/dialyzer/eunit pass

Default bucket types required

martinsumner · 2021-09-14T09:52:23Z

This PR adds selective sync support only with leveled and eleveldb backends (there is a backend_capability called flush_put and only backends with this capability receive non-standards PUT requests).

Adding support for the bitcask backend is trivial, as the bitcask:sync/1 just needs to be called after a put when flush_put/4 is used. This would have no change to any existing bitcask user, as the default bucket configuration is to revert to backend as now.

@martincox - would you be OK if bitcask support was added to this PR? Normally I try and avoid bitcask changes, to avoid any risk for you guys. The only change required will be in riak_kv_bitcask_backend (not in the actual bitcask repo). Although you may not use the feature, it might be easier to implement across all backends rather than manage the ongoing caveat that it only works in some backends

martinsumner · 2021-09-14T09:53:04Z

basho/riak#1080

Sync straight after a PUT, if strategy is not already to o_sync.

martinsumner · 2021-09-15T16:53:12Z

basho/riak_test#1358

ThomasArts

Some smaller comments, but did not find serious issues

src/riak_kv_bucket.erl

ThomasArts · 2021-09-22T12:30:07Z

src/riak_kv_eleveldb_backend.erl

+    %% Setup write options...
+    WriteOpts2 = case Sync of
+        true ->
+            lists:keyreplace(sync,1,WriteOpts, {sync,Sync});


Observe that keyreplace does not add sync in this parameter is missing in the WriteOpts. Is that really what you want?

This is not equal to keydelete(sync, 1, WriteOpts) ++ [{sync, Sync}], which intuitively one probably wants.

This relies on the presence of sync in the eleveldb cuttlefish:

https://github.com/basho/eleveldb/blob/develop-3.0/priv/eleveldb.schema#L32-L39

which then becomes part of the write_opts at startup:

https://github.com/basho/riak_kv/blob/mas-i350-selectivesync/src/riak_kv_eleveldb_backend.erl#L633-L635

So it will always be there, base don the current logic. However, this does appear to be quite a brittle chain.

src/riak_kv_eleveldb_backend.erl

ThomasArts · 2021-09-22T12:32:37Z

src/riak_kv_eleveldb_backend.erl

+                 {ok, state()} |
+                 {error, term(), state()}.
+flush_put(Bucket, PrimaryKey, IndexSpecs, Val, State) ->
+    Sync = true,


check comment in leveled_backend, probably just write the boolean as argument

src/riak_kv_leveled_backend.erl

ThomasArts · 2021-09-22T13:12:08Z

src/riak_kv_util.erl

+expand_sync_on_write(default, BucketProps) ->
+    normalize_value(get_bucket_option(sync_on_write, BucketProps));
+expand_sync_on_write(Value, _BucketProps) ->
+    Value.


should you not normalise here as well?

I think normalize_value may be unnecessary in both cases due to validation at the API/riak_kv_bucket

src/riak_kv_vnode.erl

ranisen and others added 11 commits July 25, 2017 11:59

Added sync_on_write bucket property with values:

fcbe341

backend (use backend's config value) one (sync co-ordinating write only, or backend config value if set to sync there) all (sync all writes irrespective of backend config value)

Changes to calulate and pass sync value through to eleveldb backend.

af8940f

Converted selective sync solution to a new flush_put capability

2326fa3

Cleaned up modification of writeops list

33bc225

Fixed dialyzer ignore warnings for clean develop-2.2 branch.

921b914

Fixed error in spec on riak_kv_wm_object:malformed_custom_param

Addressed code review issues.

cd42b2e

Fixed sync_on_write put option not being passed through

62f0022

Normalised sync_on_write as atom

Merge remote-tracking branch 'ramensen/rs-selective-sync' into mas-i3…

c87d44b

…50-selectivesync

Resolve merge issues

f48a0f4

to xref/dialyzer/eunit pass

Add leveled support for selective sync

73fc641

Switch to update riak_core

877d68c

Default bucket types required

Add flush_put support for selective-sync

6072756

Sync straight after a PUT, if strategy is not already to o_sync.

ThomasArts reviewed Sep 22, 2021

View reviewed changes

Update following review

921cce0

martinsumner merged commit 174a737 into develop-3.0 Oct 6, 2021

martinsumner deleted the mas-i350-selectivesync branch October 6, 2021 10:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mas i350 selectivesync #1794

Mas i350 selectivesync #1794

martinsumner commented Sep 14, 2021

martinsumner commented Sep 14, 2021

martinsumner commented Sep 14, 2021

martinsumner commented Sep 15, 2021

ThomasArts left a comment

ThomasArts Sep 22, 2021

martinsumner Sep 23, 2021

ThomasArts Sep 22, 2021

ThomasArts Sep 22, 2021

martinsumner Sep 23, 2021

Mas i350 selectivesync #1794

Mas i350 selectivesync #1794

Conversation

martinsumner commented Sep 14, 2021

martinsumner commented Sep 14, 2021

martinsumner commented Sep 14, 2021

martinsumner commented Sep 15, 2021

ThomasArts left a comment

Choose a reason for hiding this comment

ThomasArts Sep 22, 2021

Choose a reason for hiding this comment

martinsumner Sep 23, 2021

Choose a reason for hiding this comment

ThomasArts Sep 22, 2021

Choose a reason for hiding this comment

ThomasArts Sep 22, 2021

Choose a reason for hiding this comment

martinsumner Sep 23, 2021

Choose a reason for hiding this comment