Kibana user profiling

This gives a quick tool to profile user queries on Kibana (and indirectly on Elasticsearch).

Introduction

This tool was made to quickly analyze how users query Kibana (and Elasticsearch) on a 6.8 cluster. The result was great! very helpful.

Note: Not tested on other versions than 6.8, specially since Kibana 7.7 that uses async.

Note: Installing the Elastic node APM agent on Kibana can also provide quite great insights out of the box! Here is a great blog post and another one on this topic!

Capture traffic

First, we need to capture traffic. Here, we used packetbeat installed on the Elasticsearch node queried by Kibana to capture all traffic coming from Kibana to Elasticsearch (not the logs coming from data ingestion).

Then, simply create a saved search in Discover to display:

@timestamp, in the form October 5th 2021, 20:59:56.052, in UTC
path, for instance /myindex/_search
responsetime (integer, in ms)
http.request.body that is the body posted, like {"size": 0,"query": {"range": {"event.end": {"gte": "now-24h","lt": "now"}}}}

The time range should be on 1 day, midnight to midnight. Once the search is saved, export in CSV, name the file export.csv.

Note: if the file is too big, change the xpack.reporting.csv.maxSizeBytes setting (by default 10MB). If you can't, simply chunk the export by hour for instance, naming export-NN.csv for instance.

What's inside this export?

Answer: a mess :-/ The body part can be a proper json... or not! _msearch usually concatenates several json. The date range also varies, sometimes in now-10m form, sometimes in millisecond epoch, sometimes in unix timestamp...

Use this tool

In its simplest form, simply works placing this py file in the same directory than the exported csv file(s) and running: python kibana-user-profiling.py This should export a new kibana-user-profiling.csv file.

Inside the script, several hard coded values may be adapted:

epoch dates shifted to UTC+2 (Paris time!)
years, parsing taking '2021' into account, so if you're in 2022...
several paths are excluded like _xpack, _security, rollup indices, internal .* indices etc. You may want to keep them

Visualize using Excel!

Yeah I know, Excel... python could work all the way to the visualization, but I really like Excel! Do not open the csv file as is. This won't work. Instead, use Data > Get Data (or New Query) > From CSV. Once imported, the table has 12 columns.

I added a few columns afterwards, but they could be computed by the py script of course:

a query type, being 'dashboard' for _msearch queries, 'count' for _count ones and 'search' for _search ones
a start age in hours, computed as the difference between timeEnd-epoch and timeStart-epoch, divided by 3600
a viz type (for instance a date histogram) and query filter (for instance a term filter), parsed from the body, computed using a VB macro, but I didn't finish it... :-(

From there, I created several reports like:

histogram of number of requests, per age (displayed hereunder)
number of queries, per index (pivot table, index in lines, query type in columns, with a simple count as values)
average response time, per index (pivot table, index in lines, query type in columns, with avg response time)

If your histogram looks very crushed as mine here, you should customize bins. Add a sheet, make new bins - mine are 0.2, 0.3, 3, 48, 120, 480 hours, and beyond. Then use the FREQUENCY function, like =FREQUENCY('data'!M2:M8225;'freq range'!B2:B8) I also added a cumulated %

Here you go:

You can build a histogram out of it:

Conclusion

In my example, this analysis showed 95% requests target data that are less than 5 days old, 97% less than 30 days old. Quite a hint to play with ILM and resize! Happy profiling!

Author

Author: Vincent Maury
Contributor: a great partner of Elastic that cannot be shared ;)
License: Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
kibana-user-profiling.py		kibana-user-profiling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kibana user profiling

Introduction

Capture traffic

What's inside this export?

Use this tool

Visualize using Excel!

Conclusion

Author

About

Releases

Packages

Languages

License

blookot/kibana-user-profiling

Folders and files

Latest commit

History

Repository files navigation

Kibana user profiling

Introduction

Capture traffic

What's inside this export?

Use this tool

Visualize using Excel!

Conclusion

Author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages