Slow performance after docker upgrade #7667
Labels
Affects: Operations
Affects the IA DevOps folks
Module: Solr
Issues related to the configuration or use of the Solr subsystem. [managed]
Priority: 0
Fix now: Issue prevents users from using the site or active data corruption. [managed]
Theme: Performance
Issues related to UI or Server performance. [managed]
Theme: Provisioning
Type: Post-Mortem
Log for when having to resolve a P0 issue
Summary
tl;dr:
apt-get install apparmor
Site is extremely slow, solr auto-restarting frequently, 503s for merge queue (EDIT: some of these issues preceded docker upgrade)
We checked on
At ~11am PT @cclauss performed a docker upgrade on
ol-home0
re: #7626 (comment). The upgrade was completed successfully however no containers were running.Initially, we suspected ~performance issues with ol-www0? We ran a
sudo docker restart openlibrary_web_nginx_1 openlibrary_web_haproxy_1 on ol-www0
which seemed to work for a moment.We
ssh
'ed tool-home0
and noticed no containers running viadocker ps
. Identified this was related to the recent docker upgrade. When we tried to manually restart containers:And failed with error:
We asked our newest team member ChatGPT for help and didn't get great suggestions however, we found a hint in docker/for-linux#1199 which suggested
apt install AppArmor
. After a restart, and re-running of the commands above, the systems were back up!#openlibrary
https://github.com/internetarchive/openlibrary/wiki/Production-Service-Architecture#performing-upgrades
Steps to close
Affects:
label applied?The text was updated successfully, but these errors were encountered: