-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Decide deployment services check #714
Conversation
- /bin/sh | ||
- -c | ||
- > | ||
{{ if .Values.clickhouse.enabled }} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Don't need Clickhouse either?
This just checks the current status of services to determine if the pod can come up, so it doesn't mean we'll never connect to postgres/pgbouncer. I think instead we'd want something like an "OR" check between redis and postgres? If either is up, we're good to go. |
Good idea, not clear enough about things here, would want @PostHog/team-infra to give their input 😅 |
@@ -170,6 +170,5 @@ spec: | |||
securityContext: {{- omit .Values.decide.securityContext "enabled" | toYaml | nindent 12 }} | |||
{{- end }} | |||
initContainers: | |||
{{- include "_snippet-initContainers-wait-for-service-dependencies" . | indent 8 }} | |||
{{- include "_snippet-initContainers-wait-for-migrations" . | indent 8 }} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Decide still makes queries to PostgreSQL right? So we still need to wait for them to have been applied?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I might have misunderstood earlier & mislead the ff team ... this would only apply on new container setup, not on restart? If that's true and considering we do rolling deploys this is fine to keep as is. Sorry.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah if you don't include the migration check, you'll end up with a SELECT all, the, fields, including, ones, that, don't, exist, yet FROM posthog_persons
which will error and if we're lucky fall back to redis, or just straight error out. Either way I don't think that's desired.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just confirmed that init containers only run when the pod is created
pods never restart, only the containers within them
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, then we should investigate, why FF went down, when PG was unavailable (as this was the initial hypothesis iirc and based on what Frank said it's not the case)
**Initial draft changes so discussion can be done here
Description
based on https://posthog.slack.com/archives/C05034MPYA0/p1679596107221869, the decide pods still rely postgres because of the startup checks. This means that if postgres is down, decide pods will be down and error out regardless of caching
Discussion points