-
Notifications
You must be signed in to change notification settings - Fork 115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[APM] Surface Slow transactions and errors correlation analysis #302
Comments
Design update - Dec 22, 2020 I have been meaning to share a design update for a while on this project since it's been developing quite fast over the past months or so. We have been polishing a POC implementation of the significant terms which is available in 7.11 (feature-flagged). Additionally, we have been redesigning basically all of the Transactions views to align them with the new Service overview, and add new capabilities beyond just the significant terms feature, but also new distribution charts and a dedicated trace view. The placeholder implementation issues have already been created due to planning and I will begin to fill these out with the details for significant terms and the redesigned transactions list. |
Closing in favor of implementation issues elastic/kibana#86477, elastic/kibana#86783, and elastic/kibana#86478 |
Summary of the problem (If there are multiple problems or use cases, prioritize them)
As a user I want to quickly view interesting data about my slow transactions or errors.
User stories
As a DevOps/SRE engineer at my company, responsible for keeping our production systems running I’d like to understand why certain traces are slow. I'd like to see if there is metadata/tags with these traces that is different from the normal traces which could point me to the root cause of these slow transaction.
Examples of this could be particular hardware (host / pod) that is failing and therefore resulting in slow transactions. Another reason could be a set of users (based on ip address or region) who are physically far away from the nearest datacenter and is therefore experiencing increased latency.
The text was updated successfully, but these errors were encountered: