Pytests overhaul #569

dranikpg · 2022-12-17T15:53:45Z

Big testing overhaul

A lot of new features that we introduce for replication, snapshotting, compression, serialization and cancellation don't have proper tests. Many of those features are also hard to test in general, under load and for corner cases.

So far pytests have made a good job in uncovering bugs, but they used only simple commands and a single database.

The new DflySeeder can issue command sequences that converge to some targeted number of keys and oscilliate upon reaching it. It supports all main data types (strings, lists, sets, hsets, zsets) and 10 incremental commands (but can be extended to any number).

It allows creating captures on the master instance (that is expected to work faultless) and then comparing them to the state on different instances, showing any some changes if needed.

Its designed to be efficient (fully async, parallel work on multiple dbs, pipelined requests) so python's performance is not the bottleneck.

Example:

# Create seeder with target number of keys (100k) of specified size (200) and work on 5 dbs
seeder = new DflySeeder(keys=100_000, value_size=200, dbcount=5)

# Stop when we are in 5% of target number of keys (i.e. above 95_000)
# because its probabilistic we might never reach exactly 100_000
await seeder.run(target_deviation=0.05) 

# Run 3 iterations (full batches) in stable state
await seeder.run(target_times=3)

# Create a capture
capture = await seeder.capture()

# Compare capture to replica on port 1112
assert await seeder.compare(capture, port=1112)

fixes #530

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

romange · 2022-12-17T15:54:40Z

tests/dragonfly/generator.py

@@ -0,0 +1,245 @@
+import asyncio


can you add here what it does?

tests/dragonfly/generator.py

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

boazsade · 2022-12-28T09:09:47Z

tests/dragonfly/__init__.py

@@ -99,6 +105,16 @@ def create(self, **kwargs) -> DflyInstance:
        self.instances.append(instance)
        return instance

+    def start_all(self, instances):


Maybe we can use python test containers and not run them as sub processes (if I understand correctly, this is what it does here, spins DFs sub processes). They reject my PR for supporting DF, but you can start a DF container with the right parameters.

This is inconvenient (increased memory requirements) and requires re-building a container just for testing some change

boazsade · 2022-12-28T09:53:14Z

tests/dragonfly/server_family_test.py

-        assert False, str(e)
+    def gen_test_data():
+        for i in range(10):
+            yield "key-"+str(i), "value-"+str(i)


nitpick - yield f"key-{i}" f"value-{i}"
BTW why did you removed the "gen_test_data"?

Because its now the only place where it would be used... Maybe I should keep it though

boazsade · 2022-12-28T10:28:59Z

tests/dragonfly/utility.py

-def gen_test_data(n, start=0, seed=None):
-    for i in range(start, n):
-        yield "k-"+str(i), "v-"+str(i) + ("-"+str(seed) if seed else "")
+async def wait_available_async(client: aioredis.Redis):


at what cases this is useful? I mean from what I know await will block the current task until the awaited function will return. So when the iteration will take place?

This is for waiting until an instance becomes available for queries (i.e. exits the LOADING state). Its indeed supposed to block all this time, because the test has nothing else to to except wait for the instance to be available for comparing data

tests/dragonfly/utility.py

boazsade · 2022-12-28T11:41:07Z

tests/dragonfly/utility.py

+        client = aioredis.Redis(port=port, db=target_db)
+        return DataCapture(await self._capture_entries(client, keys))
+
+    async def compare(self, initial_capture, port=6379):


you have self.port, why not making this defaulted to self.port and not to 6379?

tests/dragonfly/utility.py

dranikpg · 2022-12-24T19:56:32Z

tests/dragonfly/utility.py

+        ('LPOP {k}', ValueType.LIST),
+        #('SADD {k} {val}', ValueType.SET),
+        #('SPOP {k}', ValueType.SET),
+        ('HSETNX {k} v0 {val}', ValueType.HSET),
+        ('HINCRBY {k} v1 1', ValueType.HSET),
+        #('ZPOPMIN {k} 1', ValueType.ZSET),
+        #('ZADD {k} 0 {val}', ValueType.ZSET)


Stable state currently has issues with set and zset commands (for example spop pops different values), so I commented them out for now to let the tests run

dranikpg · 2022-12-31T09:42:35Z

tests/dragonfly/__init__.py

@@ -99,6 +105,16 @@ def create(self, **kwargs) -> DflyInstance:
        self.instances.append(instance)
        return instance

+    def start_all(self, instances):


This is inconvenient (increased memory requirements) and requires re-building a container just for testing some change

dranikpg · 2022-12-31T09:44:14Z

tests/dragonfly/server_family_test.py

-        assert False, str(e)
+    def gen_test_data():
+        for i in range(10):
+            yield "key-"+str(i), "value-"+str(i)


Because its now the only place where it would be used... Maybe I should keep it though

dranikpg · 2022-12-31T09:45:27Z

tests/dragonfly/utility.py

-def gen_test_data(n, start=0, seed=None):
-    for i in range(start, n):
-        yield "k-"+str(i), "v-"+str(i) + ("-"+str(seed) if seed else "")
+async def wait_available_async(client: aioredis.Redis):


This is for waiting until an instance becomes available for queries (i.e. exits the LOADING state). Its indeed supposed to block all this time, because the test has nothing else to to except wait for the instance to be available for comparing data

tests/dragonfly/utility.py

romange · 2022-12-31T10:02:41Z

Totally. I even claim we should use ubuntu 20.04 as a test which version to assume.

…

On Sat, Dec 31, 2022, 11:58 Vladislav ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In tests/dragonfly/utility.py <#569 (comment)> : > + ('LPOP {k}', ValueType.LIST), + #('SADD {k} {val}', ValueType.SET), + #('SPOP {k}', ValueType.SET), + ('HSETNX {k} v0 {val}', ValueType.HSET), + ('HINCRBY {k} v1 1', ValueType.HSET), + #('ZPOPMIN {k} 1', ValueType.ZSET), + #('ZADD {k} 0 {val}', ValueType.ZSET) Stable state currently has issues with set and zset commands (for example spop pops different values), so I commented them out for now to let the tests run ------------------------------ In tests/dragonfly/__init__.py <#569 (comment)> : > @@ -99,6 +105,16 @@ def create(self, **kwargs) -> DflyInstance: self.instances.append(instance) return instance + def start_all(self, instances): This is inconvenient (increased memory requirements) and requires re-building a container just for testing some change ------------------------------ In tests/dragonfly/server_family_test.py <#569 (comment)> : > def test_scan(client): - try: - for key, val in gen_test_data(n=10, seed="set-test-key"): - res = client.set(key, val) - assert res is not None - cur, keys = client.scan(cursor=0, match=key, count=2) - assert cur == 0 - assert len(keys) == 1 - assert keys[0] == key - except Exception as e: - assert False, str(e) + def gen_test_data(): + for i in range(10): + yield "key-"+str(i), "value-"+str(i) Because its now the only place where it would be used... Maybe I should keep it though ------------------------------ In tests/dragonfly/utility.py <#569 (comment)> : > -def gen_test_data(n, start=0, seed=None): - for i in range(start, n): - yield "k-"+str(i), "v-"+str(i) + ("-"+str(seed) if seed else "") +async def wait_available_async(client: aioredis.Redis): This is for waiting until an instance becomes available for queries (i.e. exits the LOADING state). Its indeed supposed to block all this time, because the test has nothing else to to except wait for the instance to be available for comparing data ------------------------------ In tests/dragonfly/utility.py <#569 (comment)> : > + s = self.set_for_type(t) + + if s is None or len(s) == 0: + return None, None + + k = s.pop() + if not pop: + s.add(k) + + return k, t + + def generate_val(self, t: ValueType): + def rand_str(k=3, s=''): + return s.join(random.choices(string.ascii_letters, k=k)) + + if t == ValueType.STRING: I know it exists, but its Python 3.10, whereas most repos of most distros cover for now only Python 3.9 I would rather not use fancy new features so that its runnable with default python and doesn't require installing a new version just to do it — Reply to this email directly, view it on GitHub <#569 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA4BFCBV3AQ37HLIWIXA3NDWP77VRANCNFSM6AAAAAATCAJGMY> . You are receiving this because you commented.Message ID: ***@***.***>

Signed-off-by: Vladislav <vlad@dragonflydb.io>

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

dranikpg · 2023-01-09T09:47:13Z

They seem to pass now consistently

It takes about 3 min to run them fully on my machine. I reduced the tests a little because if it fails, it usually tends to do so already on the medium sized ones. Otherwise testing will take eternity 😄

Some parts are commented out - those are the commands we don't support. It works on Redis though and once we support them, we'll just uncomment the parts (like SPOP for example)

romange

Vlad, it's an amazing addition to our tests and to our testing methodology!
You really raise the bar for our testing quality. I gave you a few readability comments.

romange · 2023-01-09T10:38:31Z

tests/README.md

@@ -15,6 +15,8 @@ You can override the location of the binary using `DRAGONFLY_PATH` environment v
 ### Custom arguments

 - use `--gdb` to start all instances inside gdb.
+- use `--df arg=val` to pass custom arguments to all dragonfly instances.


can you provide a full command instead of a single option?

What does full command mean? You can use it multiple times like --df logtostdout --df proactor_threads=2, I'll add this info

tests/dragonfly/utility.py

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

dranikpg · 2023-01-09T13:01:05Z

Pushed fixes to you comments and some more last minute ones

feat(testing): Pytest data generator

0c4229c

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

romange reviewed Dec 17, 2022

View reviewed changes

tests/dragonfly/generator.py Outdated

@@ -0,0 +1,245 @@

import asyncio

Copy link

Collaborator

romange Dec 17, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you add here what it does?

dranikpg commented Dec 17, 2022

View reviewed changes

tests/dragonfly/generator.py Outdated Show resolved Hide resolved

dranikpg added 2 commits December 22, 2022 17:38

feat(testing): Pytest data generator

3ef94b3

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

feat(testing): Generator improvements

9771d04

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

dranikpg force-pushed the pytest-generator branch 2 times, most recently from 412e627 to c158593 Compare December 24, 2022 17:47

dranikpg changed the title ~~EXPERIMENT: Pytest data generator~~ Pytests overhaul Dec 24, 2022

feat(testing): Use new testing generator

a1ff692

Signed-off-by: Vladislav Oleshko <vlad@dragonflydb.io>

dranikpg force-pushed the pytest-generator branch from c158593 to a1ff692 Compare December 24, 2022 19:55