🎨 Reduces response time of catalog/services listing entrypoint #5273

pcrespov · 2024-01-25T19:01:53Z

What do these changes do?

This PR analyses and provides a temporary solution for the incident #5267 that reports very long delays in GET /catalog/services entrypoint. In addition the front-end can at time spawn up to 17 parallel calls to this entrypoint point (probably due to some retry mechanism)

We are well aware that the original design does not scale with the amount of services. A proper resolution of this issue therefore requires a redesign that should incorporate at least pagination and lighter item objects with bounded fields.

For the moment we decided to go with a strategic TTL cache that will reduce the overhead of postprocessing every service item in the webserver. Note that the webserver does not just forward the service object provided by the catalog service but also computes and adds some extra information (e.g. units) to it. Using a profiler reveals that replace_service_inputs incurs a considerable computation overhead. Therefore, with the last increase in the number of services this has caused the large delays reported in #5267.

This is the benchark results and profiler before the cache was added

and this is after

Finally here we can see several calls to the entrypoint. After the first, subsequent calls take much less time

Regarding the empty lists, I also noticed that this comes directly in the response of the catalog service (and not from the webserver). Probably is due to an issue of the caches on the multiple replicas there. Nonetheless this point has not been addressed here

Details

⬆️ New cachetools to cache sync functions (we have aiocache to cache async functions`
⬆️ Adds pytest-benchmark to benchmark the replace_service_inputs
⬆️ Adds msgpack mainly to remove log warning message of some libraries (e.g. aiocache) that uses this as a default if it is available.

Related issue/s

Analyses and provides a fix for Backend replies with empty list of services #5267

How to test

Driving test services/web/server/tests/unit/isolated/test_catalog_models.py

Dev Checklist

No ENV changes or I properly updated ENV (read the instruction)

DevOps

codecov · 2024-01-25T19:21:20Z

Codecov Report

Attention: 1 lines in your changes are missing coverage. Please review.

Comparison is base (4fa4917) 87.2% compared to head (508a986) 88.7%.

Additional details and impacted files

@@           Coverage Diff            @@
##           master   #5273     +/-   ##
========================================
+ Coverage    87.2%   88.7%   +1.4%     
========================================
  Files        1308    1213     -95     
  Lines       53550   49548   -4002     
  Branches     1170    1024    -146     
========================================
- Hits        46749   43960   -2789     
+ Misses       6552    5367   -1185     
+ Partials      249     221     -28

Flag	Coverage Δ
integrationtests	`65.2% <61.5%> (-0.1%)`	⬇️
unittests	`87.1% <92.3%> (+2.0%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
...rver/src/simcore_service_webserver/catalog/_api.py	`42.3% <ø> (ø)`
...r/src/simcore_service_webserver/catalog/_models.py	`64.9% <100.0%> (-26.4%)`	⬇️
...er/src/simcore_service_webserver/catalog/client.py	`51.1% <0.0%> (ø)`

... and 113 files with indirect coverage changes

sonarqubecloud · 2024-01-28T17:24:38Z

Quality Gate passed

Kudos, no new issues were introduced!

0 New issues
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

matusdrobuliak66

👍

pcrespov self-assigned this Jan 25, 2024

pcrespov added this to the This is Sparta! milestone Jan 25, 2024

pcrespov added the a:webserver issue related to the webserver service label Jan 25, 2024

pcrespov force-pushed the is5267/catalog-service-list branch from 19a522f to b4b39c9 Compare January 25, 2024 19:04

pcrespov force-pushed the is5267/catalog-service-list branch from 8d46419 to 8c11867 Compare January 26, 2024 13:54

pcrespov changed the title ~~WIP 🎨 Is5267/catalog service list~~ 🎨 Reduces response time of catalog/services listing entrypoint Jan 26, 2024

pcrespov marked this pull request as ready for review January 26, 2024 14:25

pcrespov requested review from sanderegg, GitHK and matusdrobuliak66 as code owners January 26, 2024 14:25

pcrespov requested review from wvangeit and bisgaard-itis January 26, 2024 14:25

pcrespov enabled auto-merge (squash) January 26, 2024 15:37

sanderegg approved these changes Jan 26, 2024

View reviewed changes

pcrespov added 7 commits January 28, 2024 18:23

adds cache and test tooling

c1e478b

minor

1aa0bd5

adds cache in models

15eced3

new reqs for mypy

0c0ba8a

adds test for varaitions

0a79711

adds msgpack

2143d3e

rm stats in test

508a986

pcrespov force-pushed the is5267/catalog-service-list branch from bc876b9 to 508a986 Compare January 28, 2024 17:23

matusdrobuliak66 approved these changes Jan 29, 2024

View reviewed changes

pcrespov merged commit f7365ce into ITISFoundation:master Jan 29, 2024
55 checks passed

pcrespov deleted the is5267/catalog-service-list branch January 29, 2024 08:14

matusdrobuliak66 mentioned this pull request Jan 30, 2024

🚀 Pre-release master -> staging_ThisIsSparta4 #5266

Closed

17 tasks

pcrespov mentioned this pull request Feb 2, 2024

🐛Fixes internal server error in port matching 🚨 #5292

Merged

1 task

matusdrobuliak66 mentioned this pull request Feb 14, 2024

🚀 Release v1.65.0 #5226

Closed

39 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🎨 Reduces response time of catalog/services listing entrypoint #5273

🎨 Reduces response time of catalog/services listing entrypoint #5273

pcrespov commented Jan 25, 2024 •

edited

Loading

codecov bot commented Jan 25, 2024 •

edited

Loading

sonarqubecloud bot commented Jan 28, 2024

matusdrobuliak66 left a comment

🎨 Reduces response time of catalog/services listing entrypoint #5273

🎨 Reduces response time of catalog/services listing entrypoint #5273

Conversation

pcrespov commented Jan 25, 2024 • edited Loading

What do these changes do?

Details

Related issue/s

How to test

Dev Checklist

DevOps

codecov bot commented Jan 25, 2024 • edited Loading

Codecov Report

sonarqubecloud bot commented Jan 28, 2024

Quality Gate passed

matusdrobuliak66 left a comment

Choose a reason for hiding this comment

pcrespov commented Jan 25, 2024 •

edited

Loading

codecov bot commented Jan 25, 2024 •

edited

Loading