Split up `SyncClient` and fix bandwidth tracking #85

bjester · 2020-07-02T23:24:49Z

Summary

Splits up the SyncClient class into a base class and two classes for each sync direction, PullClient and PushClient. This will enable us to better handling any database locking that should occur during the sync stages. I've left a small wrapper class in place for SyncClient such that this change should be backwards compatible for now.

This also fixes bandwidth tracking by attempting to track the response body size if there isn't a Content-Length header. It also now tracks the headers to get a better estimate of the entire request/response size.

Additionally, this fixes request exception handling in the SessionWrapper. Firstly, it was casting errors to strings which the errors being captured don't have a __str__ method defined, so it now simply logs the error class name. Also, it's more defensive on accessing the response from the error since it was always assuming it existed and that it was JSON.

Lastly, this adds a condition to only dequeue from the API to delete a transfer session when the session actually had records. This avoids a lengthy, serialize and then deserialize within the dequeue function when it's unnecessary.

TODO

Have tests been written for the new code?
Has documentation been written/updated?
New dependencies (if any) added to requirements file

Reviewer guidance

This will be most easily tested when integrated with Kolibri

Issues addressed

Related to learningequality/kolibri#7125

codecov-commenter · 2020-07-02T23:30:53Z

Codecov Report

Merging #85 into master will increase coverage by 1.62%.
The diff coverage is 90.30%.

@@            Coverage Diff             @@
##           master      #85      +/-   ##
==========================================
+ Coverage   82.85%   84.47%   +1.62%     
==========================================
  Files          37       37              
  Lines        2385     2442      +57     
  Branches      301      303       +2     
==========================================
+ Hits         1976     2063      +87     
+ Misses        309      280      -29     
+ Partials      100       99       -1

…is JSON

…rialize

jamalex

Looks great! One of my main questions was addresses in the latest commit (the JSON-serialization of the client_fsic field), so it seems basically good to go from my side, just some small suggestions and then one thing around the cache key that seems important.

morango/sync/session.py

jamalex · 2020-07-06T22:30:23Z

morango/sync/session.py

-        except TypeError:
-            pass
+        self.bytes_sent += _length_of_headers(prepped.headers)
+        self.bytes_sent += _headers_content_length(prepped.headers)


When we're doing the sending, do we assume the content length header will always be set, and hence this would never be 0?

No we aren't assuming it's always set. If the request has no body, the requests helper will not add it. So I believe this should be okay.

Right, sorry, what I mean is: can we safely assume that if the request has a body, the Content-Length header will have already been added, so that we wouldn't miss counting that transfer amount? I know on the receiving side we handle the case of the missing header (in case it was... removed by a proxy or something?), but here I'm guessing it's OK, as we have more control (request is being initiated locally).

The missing header on the response side is for when the response is chunked. The request body can be chunked too, but for the sync, none of the requests are chunked because we give the whole body. So if there is a body, then it would have the content length header. I can add comments so that's clear.

morango/sync/syncsession.py

bjester added 2 commits July 2, 2020 16:14

Split sync client into two individual client classes for push/pull

4e31602

Pass in filter to deserialize also

6e5de74

Add backward compatible wrapper, fix other issues

50b3f73

bjester mentioned this pull request Jul 3, 2020

'database is locked' error while import/exporting facility and trying to view device info page learningequality/kolibri#7125

Closed

Add more tests and fix broken ones

95719db

bjester marked this pull request as ready for review July 6, 2020 17:19

Fix bandwith tracking and add more tests

5074b82

bjester changed the title ~~Split up SyncClient~~ Split up SyncClient and fix bandwidth tracking Jul 6, 2020

Black formatting

36a6ca1

bjester requested a review from jamalex July 6, 2020 21:38

bjester mentioned this pull request Jul 6, 2020

Separate morango pull/push clients with better write locking and better cancellation handling learningequality/kolibri#7251

Merged

9 tasks

bjester requested a review from rtibbles July 7, 2020 15:56

bjester added 2 commits July 8, 2020 09:32

RequestException/HTTPError has no __str__, and don't assume response …

fae4bf1

…is JSON

If no records transferred, skip dequeue due to lengthy serialize/dese…

e83ebb2

…rialize

bjester added enhancement PR: needs review labels Jul 8, 2020

bjester added 2 commits July 8, 2020 11:52

Add conditional to allow server timeout

fc45a42

Fix issues with push

51623f2

jamalex requested changes Jul 9, 2020

View reviewed changes

bjester added 5 commits July 13, 2020 17:22

Review updates, simplification

7a0797e

Fix session wrapper test and formatting

87bcbb7

Reformatting

90565b3

Actually remove library as was intended

5ff2575

Remove unused Filter import

bd1f8b8

bjester requested a review from jamalex July 14, 2020 17:56

bjester merged commit 0814681 into learningequality:master Jul 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split up `SyncClient` and fix bandwidth tracking #85

Split up `SyncClient` and fix bandwidth tracking #85

bjester commented Jul 2, 2020 •

edited

Loading

codecov-commenter commented Jul 2, 2020 •

edited

Loading

jamalex left a comment

jamalex Jul 6, 2020

bjester Jul 9, 2020

jamalex Jul 12, 2020

bjester Jul 13, 2020

Split up SyncClient and fix bandwidth tracking #85

Split up SyncClient and fix bandwidth tracking #85

Conversation

bjester commented Jul 2, 2020 • edited Loading

Summary

TODO

Reviewer guidance

Issues addressed

codecov-commenter commented Jul 2, 2020 • edited Loading

Codecov Report

jamalex left a comment

Choose a reason for hiding this comment

jamalex Jul 6, 2020

Choose a reason for hiding this comment

bjester Jul 9, 2020

Choose a reason for hiding this comment

jamalex Jul 12, 2020

Choose a reason for hiding this comment

bjester Jul 13, 2020

Choose a reason for hiding this comment

Split up `SyncClient` and fix bandwidth tracking #85

Split up `SyncClient` and fix bandwidth tracking #85

bjester commented Jul 2, 2020 •

edited

Loading

codecov-commenter commented Jul 2, 2020 •

edited

Loading