StreamManager: Add mechanism to close the request iterator #6263

verult · 2023-08-26T01:51:06Z

This PR builds on top of #6253 . Please review the last two commits.

This fix stemmed from the issue that requests were not sent after a stream breaks and reopens. The root cause is that the request iterator from the previous stream is still running, and although the Quantum Engine client doesn't actively yield from the iterator, the iterator still dequeues from the request queue behind the scenes.

This PR adds a dedicated stop signal to be sent to the request queue to signal that the iterator should stop. In addition to the issue above, this also addresses the TODO that the request iterator should be closed upon stream closing in order to send a half close to the server.

To make this fix work, I also made the request queue local to the execution and stream coroutines. Otherwise, once a user stops the manager, the queue is cleared in the duet thread while the stream coroutine tries to send a stop signal to the queue in the asyncio thread, leading to a race condition. This address another TODO.

@maffoo @wcourtney

codecov · 2023-08-26T02:59:50Z

Codecov Report

Patch coverage: 100.00% and project coverage change: -0.01% ⚠️

Comparison is base (deedb45) 97.88% compared to head (8548258) 97.88%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #6263      +/-   ##
==========================================
- Coverage   97.88%   97.88%   -0.01%     
==========================================
  Files        1104     1104              
  Lines       95760    95819      +59     
==========================================
+ Hits        93735    93790      +55     
- Misses       2025     2029       +4

Files Changed	Coverage Δ
cirq-google/cirq_google/engine/stream_manager.py	`100.00% <100.00%> (ø)`
...q-google/cirq_google/engine/stream_manager_test.py	`100.00% <100.00%> (ø)`

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

maffoo · 2023-08-29T22:03:06Z

cirq-google/cirq_google/engine/stream_manager.py

+        self._executor.submit(self._init_request_queue).result()
+
+    async def _init_request_queue(self) -> None:
+        await asyncio.sleep(0)


Why this sleep? It's not needed if the only point is to make the function async, since async def is enough for that and you don't actually need an await (note this is different from generator functions which do need at least one yield to be recognized as a generator).

It's not needed if the only point is to make the function async

Did not realize that, thanks!

maffoo · 2023-08-29T22:23:00Z

cirq-google/cirq_google/engine/stream_manager.py

@@ -121,6 +121,11 @@ def __init__(self, grpc_client: quantum.QuantumEngineServiceAsyncClient):
        # interface.
        self._response_demux = ResponseDemux()
        self._next_available_message_id = 0
+        self._executor.submit(self._init_request_queue).result()


It looks like self._request_queue is only ever accessed on the duet side (in StreamManager.submit). Is this to ensure that it gets bound to the correct event loop when constructed? If so, I would suggest just submitting a job to construct the queue but then assigning it here:

# Construct queue in executor to ensure it binds to the correct event loop self._request_queue = self._executor.submit(self._make_request_queue).result() async def _make_request_queue(self) -> asyncio.Queue[QuantumRunStreamRequest]: return asyncio.Queue()

But would be good to check that this is needed. In the current implementation asyncio.Queue doesn't bind its event loop until either get or put are called so would be fine to construct it on the duet side and pass to the executor to use, but this may have changed in python 3.10 when the loop arg was removed.

Nice, thanks for the suggestion, updated.

It indeed fails if the queue were constructed in the duet thread.

maffoo · 2023-08-29T22:24:47Z

cirq-google/cirq_google/engine/stream_manager.py

 async def _request_iterator(
    request_queue: asyncio.Queue,
 ) -> AsyncIterator[quantum.QuantumRunStreamRequest]:
    """The request iterator for Quantum Engine client RPC quantum_run_stream().

    Every call to this method generates a new iterator.
    """
-    while True:
-        yield await request_queue.get()
+    while (request := await request_queue.get()) != StreamManager._STOP_SIGNAL:


I'd suggest using None as the stop signal instead, then can simplify this a bit:

Suggested change

while (request := await request_queue.get()) != StreamManager._STOP_SIGNAL:

while request := await request_queue.get():

Would also be good to explicitly type the queue:

self._request_queue: asyncio.Queue[Optional[QuantumRunStreamRequest]] = ...

Updated. I kept the _STOP_SIGNAL constant (but set it to None) because the call to request_queue.put() is more explicit about what the signal means.

cirq-google/cirq_google/engine/stream_manager.py

verult requested review from maffoo and wcourtney August 26, 2023 01:51

verult requested review from vtomole, cduck and a team as code owners August 26, 2023 01:51

CirqBot added the size: L 250< lines changed <1000 label Aug 26, 2023

maffoo reviewed Aug 29, 2023

View reviewed changes

verult requested a review from maffoo September 1, 2023 00:02

maffoo approved these changes Sep 1, 2023

View reviewed changes

cirq-google/cirq_google/engine/stream_manager.py Outdated Show resolved Hide resolved

cirq-google/cirq_google/engine/stream_manager.py Outdated Show resolved Hide resolved

verult enabled auto-merge (squash) September 7, 2023 22:03

verult disabled auto-merge September 7, 2023 22:04

verult added 5 commits September 8, 2023 22:54

Add a signal to stop the request iterator

df8b78e

Make request_queue local to asyncio coroutines

3aa31ec

Added missing raises docstring

5378cff

Addressed maffoo's comments

0bd13bf

Addressed maffoo's nits

7a7a7a9

verult force-pushed the stream-client/request-iterator-stop branch from 7734b59 to 7a7a7a9 Compare September 8, 2023 22:55

verult added 2 commits September 11, 2023 18:45

Fix failing stream_manager_test after merging

65eb644

Fix format

8548258

verult merged commit 6c14cfa into quantumlib:master Sep 11, 2023
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StreamManager: Add mechanism to close the request iterator #6263

StreamManager: Add mechanism to close the request iterator #6263

verult commented Aug 26, 2023

codecov bot commented Aug 26, 2023 •

edited

Loading

maffoo Aug 29, 2023

verult Aug 31, 2023

maffoo Aug 29, 2023

verult Aug 31, 2023

maffoo Aug 29, 2023

verult Aug 31, 2023

	while (request := await request_queue.get()) != StreamManager._STOP_SIGNAL:
	while request := await request_queue.get():

StreamManager: Add mechanism to close the request iterator #6263

StreamManager: Add mechanism to close the request iterator #6263

Conversation

verult commented Aug 26, 2023

codecov bot commented Aug 26, 2023 • edited Loading

Codecov Report

maffoo Aug 29, 2023

Choose a reason for hiding this comment

verult Aug 31, 2023

Choose a reason for hiding this comment

maffoo Aug 29, 2023

Choose a reason for hiding this comment

verult Aug 31, 2023

Choose a reason for hiding this comment

maffoo Aug 29, 2023

Choose a reason for hiding this comment

verult Aug 31, 2023

Choose a reason for hiding this comment

codecov bot commented Aug 26, 2023 •

edited

Loading