-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Observability Patterns #6
Comments
@ticean & @rosshadden I've added a user story and a naive deliverable for distributed tracing in the Gateway. Looking to yall in defining acceptance criteria and guiding implementation since this is a little outside of my skill set. |
@griggheo wanted to loop you in on this issue and the tracing conversation |
Thanks Nat, I'll pitch in when necessary but this looks good so far. |
Do we require "distributed tracing"? Would local tracing be enough to get started? The gql server libs might offer a local tracing option out of the box? Is it enough to see counts of requests, fails, how long they are taking, byte size, etc? I think we can get all of this with local tracing. This kind of tracing wouldn't give insight into why those backend requests took a long time. We'd have to investigate that separately. Distributed tracing implies an external tracing server (ex Jaeger). We'd also need to implement that kind of tracing in the backend apps to get the "distributed" benefits. |
Yes, I think it makes sense to start with local tracing vs a more heavy-duty distributed tracing solution. |
@ticean no i don't think we need to require distributed tracing for this and looks like we might be able to leverage the built in Apollo tracing tools to do local tracing. |
Will need to create individual more granular tickets |
Establish Observability Patterns
The Federated Gateway service needs established observability patterns to help users understand how the service is operating.
User Stories
Deliverable
.env
log level configuration.health
API endpoint that returns service status..env
variable.Acceptance Criteria
health
endpoint and get a200
status.The text was updated successfully, but these errors were encountered: