-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GC fails after deleting many tags from same repository #15807
Comments
this is potentially related to #12948 which we already hit in the past, but looks like there's no progress there, so hoping there's some new insight. |
The performance issue was happended at the distribution side to lookup and remove tags, we will do some investigation on this, but no specific plan so far. |
Thanks @wy65701436, is there a workaround? I was thinking to delete these artifacts from I realize this is sort of a corner case, but I can imagine that others may end up in a similar situation, so hopefully you folks can find some cycles soon to take a look at fixing this 🙏🏼 |
@wy65701436 on another instance of Harbor we run, we noticed that a repository with ~4000 tags takes about 2 minutes to delete a single manifest. It seems to me like there's a significant performance issue during GC for repositories that have several thousand tags and more. for example:
it also appears that these operations are sequential, perhaps some form of parallelism can be introduced to speed this up? though I think the root constraint needs to be addressed to be able to support any significant deployments. |
An option to resolve this problem: the artifact in the distribution is untagged, the tag is managed in the harbor core side. |
since in v2.5, we introduce the skip for deletion failure. This could workaround for the timeout. |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
still relevant |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
still an issue |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
definitely not stale |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
I am getting same error with 2.10.2. |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
not stale. |
This issue is being marked stale due to a period of inactivity. If this issue is still relevant, please comment or remove the stale label. Otherwise, this issue will close in 30 days. |
not stale |
|
@wy65701436 We faced same issue after moved to GCS, any PR or possible workaround here? |
@dkulchinsky do you mind that we close this issue and use #12948 to track the performance problem if they are the same. We are seeing some users reported the same performance problem on artifact deletion of GC. I will continue updating at #12948. |
Expected behavior and actual behavior:
We deleted over 20,000 tags from a repository (these tags are auto generated during our periodic CI job to test the registry and CI), we expected to run GC to get the related blobs and manifests cleaned up, but now GC fails consistently with the following error:
we caught the following log in the registry.
DELETE
request:and the
500
error ~20 minutes later:Steps to reproduce the problem:
Versions:
Please specify the versions of following systems.
Additional context:
The text was updated successfully, but these errors were encountered: